[go: up one dir, main page]

CN111915116A - Electric power resident user classification method based on K-means clustering - Google Patents

Electric power resident user classification method based on K-means clustering Download PDF

Info

Publication number
CN111915116A
CN111915116A CN201910389901.7A CN201910389901A CN111915116A CN 111915116 A CN111915116 A CN 111915116A CN 201910389901 A CN201910389901 A CN 201910389901A CN 111915116 A CN111915116 A CN 111915116A
Authority
CN
China
Prior art keywords
load
data
users
electricity
type
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Pending
Application number
CN201910389901.7A
Other languages
Chinese (zh)
Inventor
李成仁
何永秀
高效
陈奋开
尤培培
张文月
周树鹏
王伟劼
焦哲
刘思佳
陈国平
梁宝全
张超
马凤云
肖广宇
邵洁
张立岩
王美艳
何青
杨光
马朝
毕娜
刘培良
尤立莎
周艳艳
李海杰
闫晶
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
State Grid Tianjin Electric Power Co Ltd
State Grid Energy Research Institute Co Ltd
North China Electric Power University
Original Assignee
State Grid Tianjin Electric Power Co Ltd
State Grid Energy Research Institute Co Ltd
North China Electric Power University
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by State Grid Tianjin Electric Power Co Ltd, State Grid Energy Research Institute Co Ltd, North China Electric Power University filed Critical State Grid Tianjin Electric Power Co Ltd
Priority to CN201910389901.7A priority Critical patent/CN111915116A/en
Publication of CN111915116A publication Critical patent/CN111915116A/en
Pending legal-status Critical Current

Links

Images

Classifications

    • GPHYSICS
    • G06COMPUTING OR CALCULATING; COUNTING
    • G06QINFORMATION AND COMMUNICATION TECHNOLOGY [ICT] SPECIALLY ADAPTED FOR ADMINISTRATIVE, COMMERCIAL, FINANCIAL, MANAGERIAL OR SUPERVISORY PURPOSES; SYSTEMS OR METHODS SPECIALLY ADAPTED FOR ADMINISTRATIVE, COMMERCIAL, FINANCIAL, MANAGERIAL OR SUPERVISORY PURPOSES, NOT OTHERWISE PROVIDED FOR
    • G06Q10/00Administration; Management
    • G06Q10/06Resources, workflows, human or project management; Enterprise or organisation planning; Enterprise or organisation modelling
    • G06Q10/063Operations research, analysis or management
    • G06Q10/0631Resource planning, allocation, distributing or scheduling for enterprises or organisations
    • G06Q10/06315Needs-based resource requirements planning or analysis
    • GPHYSICS
    • G06COMPUTING OR CALCULATING; COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F18/00Pattern recognition
    • G06F18/20Analysing
    • G06F18/23Clustering techniques
    • G06F18/232Non-hierarchical techniques
    • G06F18/2321Non-hierarchical techniques using statistics or function optimisation, e.g. modelling of probability density functions
    • G06F18/23213Non-hierarchical techniques using statistics or function optimisation, e.g. modelling of probability density functions with fixed number of clusters, e.g. K-means clustering
    • GPHYSICS
    • G06COMPUTING OR CALCULATING; COUNTING
    • G06QINFORMATION AND COMMUNICATION TECHNOLOGY [ICT] SPECIALLY ADAPTED FOR ADMINISTRATIVE, COMMERCIAL, FINANCIAL, MANAGERIAL OR SUPERVISORY PURPOSES; SYSTEMS OR METHODS SPECIALLY ADAPTED FOR ADMINISTRATIVE, COMMERCIAL, FINANCIAL, MANAGERIAL OR SUPERVISORY PURPOSES, NOT OTHERWISE PROVIDED FOR
    • G06Q50/00Information and communication technology [ICT] specially adapted for implementation of business processes of specific business sectors, e.g. utilities or tourism
    • G06Q50/06Energy or water supply

Landscapes

  • Engineering & Computer Science (AREA)
  • Business, Economics & Management (AREA)
  • Human Resources & Organizations (AREA)
  • Physics & Mathematics (AREA)
  • Economics (AREA)
  • Theoretical Computer Science (AREA)
  • General Physics & Mathematics (AREA)
  • Data Mining & Analysis (AREA)
  • Strategic Management (AREA)
  • Tourism & Hospitality (AREA)
  • Health & Medical Sciences (AREA)
  • Entrepreneurship & Innovation (AREA)
  • General Business, Economics & Management (AREA)
  • Marketing (AREA)
  • Bioinformatics & Cheminformatics (AREA)
  • Life Sciences & Earth Sciences (AREA)
  • Evolutionary Computation (AREA)
  • Public Health (AREA)
  • Water Supply & Treatment (AREA)
  • General Health & Medical Sciences (AREA)
  • Artificial Intelligence (AREA)
  • Computer Vision & Pattern Recognition (AREA)
  • Primary Health Care (AREA)
  • Bioinformatics & Computational Biology (AREA)
  • Probability & Statistics with Applications (AREA)
  • Evolutionary Biology (AREA)
  • Development Economics (AREA)
  • Educational Administration (AREA)
  • General Engineering & Computer Science (AREA)
  • Game Theory and Decision Science (AREA)
  • Operations Research (AREA)
  • Quality & Reliability (AREA)
  • Management, Administration, Business Operations System, And Electronic Commerce (AREA)
  • Supply And Distribution Of Alternating Current (AREA)

Abstract

本发明公开了一种基于K‑means聚类的电力居民用户分类方法,包括以下步骤:确定区域电力居民用户负荷数据;对数据进行预处理;对处理后的数据,根据K‑means聚类算法对数据进行分类;基于聚类结果,分析每类电力居民用户负荷特性;分析区域电力系统负荷特性;对比分析电力系统负荷曲线和每类电力居民用户负荷曲线,确定每类电力居民用户用电类型。本发明采用K‑means聚类算法对电力居民用户的用电负荷数据进行相似度度量,以及对用户进行聚类分析,能够对用户的用电负荷曲线特征进行良好的度量和区分,实现用户负荷曲线的聚类和负荷特性分析,且K‑means聚类算法在对电力居民用户负荷聚类分析时具有可伸缩性和高效性。

Figure 201910389901

The invention discloses a K-means clustering-based electricity residential user classification method, comprising the following steps: determining regional electricity residential user load data; preprocessing the data; Classify the data; based on the clustering results, analyze the load characteristics of each type of electricity residential users; analyze the regional power system load characteristics; compare and analyze the power system load curve and each type of electricity residential user load curve to determine the electricity consumption type of each type of electricity residential user . The invention adopts the K-means clustering algorithm to measure the similarity of the power consumption load data of the electric residential users, and perform cluster analysis on the users, so that the characteristics of the power consumption load curve of the users can be well measured and distinguished, and the user load can be realized. The clustering of curves and the analysis of load characteristics, and the K-means clustering algorithm is scalable and efficient in the clustering analysis of electric residential users' loads.

Figure 201910389901

Description

一种基于K-means聚类的电力居民用户分类方法A Classification Method of Electricity Residential Users Based on K-means Clustering

技术领域technical field

本发明涉及电力系统分析技术领域,具体涉及一种基于K-means聚类的电力居民用户分类方法。The invention relates to the technical field of power system analysis, in particular to a K-means clustering-based method for classifying electricity residential users.

背景技术Background technique

售电侧放开模式下,电力市场中将会出现多元化的售电主体,其将会展开电力零售业务竞争。通过对电力市场中的用户细分,详尽的了解不同类型用户的负荷特性,以及不同用户的差异化需求,电力市场中售电主体可以科学安排购售电方案,制定出多元化及多样化的电价套餐体系,给用户提供更多可选择性,有益于售电主体吸引更多用户,且抢占售电市场。Under the mode of opening up the electricity sales side, there will be diversified electricity sellers in the electricity market, and they will compete in the electricity retail business. By subdividing users in the power market, and understanding the load characteristics of different types of users in detail, as well as the differentiated needs of different users, the electricity sellers in the electricity market can scientifically arrange electricity purchase and sales plans, and formulate diversified and diversified The electricity price package system provides users with more options, which is beneficial to the main body of electricity sales to attract more users and seize the electricity sales market.

目前对于基于聚类方法实现用户细分的方法理论已经相对成熟,如模糊聚类法和Kohonen神经网络(KNN)聚类法。但模糊聚类法易受主观因素的干扰,使聚类结果存在局部差异,且算法相对复杂;KNN聚类法的实际应用表明其不能满足负荷预测聚类对曲线形态细节识别的要求;随着售电侧改革的深入,电力市场中居民用户已达到相当大的规模,若对这些用户进行一对一的负荷特性分析已不具备可行性,因此,在进行负荷特性分析时,电力居民用户细分尤显得十分必要,而目前尚未有技术涉及该领域。At present, the method theory for realizing user segmentation based on clustering method has been relatively mature, such as fuzzy clustering method and Kohonen neural network (KNN) clustering method. However, the fuzzy clustering method is susceptible to the interference of subjective factors, resulting in local differences in the clustering results, and the algorithm is relatively complex; the practical application of the KNN clustering method shows that it cannot meet the requirements of load forecasting clustering for the identification of curve shape details; With the deepening of the reform of the electricity sales side, the residential users in the electricity market have reached a considerable scale, and it is no longer feasible to carry out one-to-one load characteristic analysis on these users. It is very necessary to distinguish, and there is no technology involved in this field at present.

发明内容SUMMARY OF THE INVENTION

为了解决上述技术问题,本发明所采用的技术方案是提供了一种基于K-means聚类的电力居民用户分类方法,包括以下步骤:In order to solve the above-mentioned technical problems, the technical solution adopted in the present invention is to provide a method for classifying electricity residential users based on K-means clustering, including the following steps:

确定区域电力居民用户负荷数据;Determine the load data of regional electricity residential users;

对数据进行预处理;preprocessing the data;

根据K-means聚类算法对对预处理后的数据进行分类;Classify the preprocessed data according to the K-means clustering algorithm;

基于聚类结果,分析每类电力居民用户负荷特性;分析区域电力系统负荷特性;Based on the clustering results, analyze the load characteristics of each type of electricity residential users; analyze the regional power system load characteristics;

对比分析电力系统负荷曲线和每类电力居民用户负荷曲线,确定每类电力居民用户用电类型。Compare and analyze the load curve of the power system and the load curve of each type of electricity residential user, and determine the electricity consumption type of each type of electricity residential user.

在上述方法中,还包括根据每类电力居民用户用电类型,推送合理的售电方案。In the above method, it also includes pushing a reasonable electricity sales plan according to the electricity consumption type of each type of electricity residential user.

在上述方法中,所述对数据进行预处理包括以下步骤:In the above method, the data preprocessing includes the following steps:

A1、删除负荷数据中无效记录;包括负值数据与0;A1. Delete invalid records in load data; including negative data and 0;

A2、删除连续缺失较多的数据;A2. Delete the data with more continuous missing;

A3、个别缺失值处理;A3. Handling of individual missing values;

对于首末端数据空缺,以趋势比例法进行补全;For the first and last data gaps, the trend ratio method is used to fill in;

对于中间数据空缺,以非邻均值生成法进行补全。For the gaps in the intermediate data, the non-neighboring mean generation method is used for filling.

在上述方法中,所述删除连续缺失较多的数据具体为:In the above method, the deletion of continuously missing data is specifically:

以15min为时间间隔点,每天的负荷值有96个点,删除连续缺失超过30个负荷值的样本。With a time interval of 15 minutes, there are 96 load values per day, and samples with consecutively missing more than 30 load values are deleted.

在上述方法中,所述根据K-means聚类算法对对处理后的数据进行分类;包括以下步骤:In the above method, the described data is classified according to the K-means clustering algorithm; including the following steps:

对预处理后的负荷数据进行归一化处理,并形成负荷数据样本集;Normalize the preprocessed load data and form a load data sample set;

设定聚类个数,根据K-means聚类算法对负荷数据样本集进行分类。Set the number of clusters, and classify the load data sample set according to the K-means clustering algorithm.

在上述方法中,所述分析电力居民用户负荷特性包括:In the above method, the analyzing the load characteristics of electric residential users includes:

基于聚类结果,分析每类用户的负荷特性,以日负荷特性曲线描述电力居民用户负荷特性。Based on the clustering results, the load characteristics of each type of users are analyzed, and the daily load characteristic curve is used to describe the load characteristics of electric residential users.

在上述方法中,所述对预处理后的负荷数据进行归一化处理包括:In the above method, the normalization processing of the preprocessed load data includes:

采用极值法对预处理后的负荷数据进行归一化处理,具体公式如下:The preprocessed load data is normalized by the extreme value method, and the specific formula is as follows:

Figure BDA0002056138930000031
Figure BDA0002056138930000031

式中,X*为经归一化处理后的值,X为预处理后的负荷数据,Xmax为预处理后的负荷数据中的最大值。In the formula, X * is the value after normalization, X is the load data after preprocessing, and Xmax is the maximum value in the load data after preprocessing.

在上述方法中,所述电力居民用户用电类型包括:迎峰用电型用户,部分迎峰用电型用户和平稳用电型用户。In the above method, the electricity consumption types of the electricity residential users include: peak-peak electricity users, partial peak-peak electricity users and stable electricity users.

在上述方法中,所述聚类个数K为3~10个。In the above method, the number of clusters K is 3-10.

在上述方法中,所述聚类个数K优选为5。In the above method, the number of clusters K is preferably 5.

本发明采用K-means聚类算法对电力居民用户的用电负荷数据进行相似度度量,以及对用户进行聚类分析,能够对用户的用电负荷曲线特征进行良好的度量和区分,实现用户负荷曲线的聚类和负荷特性分析,且K-means聚类算法在对电力居民用户负荷聚类分析时具有可伸缩性和高效性。The invention adopts the K-means clustering algorithm to measure the similarity of the power consumption load data of the electric residential users, and perform cluster analysis on the users, so as to measure and distinguish the characteristics of the power consumption load curve of the users well, and realize the user load. Curve clustering and load characteristic analysis, and the K-means clustering algorithm is scalable and efficient in the clustering analysis of electric residential users' loads.

附图说明Description of drawings

图1为本发明提供的流程图;Fig. 1 is the flow chart provided by the present invention;

图2为本发明提供的数据预处理流程示意图;Fig. 2 is a schematic diagram of a data preprocessing process flow provided by the present invention;

图3为本发明提供的案例中第一类负荷聚类中心曲线图;Fig. 3 is the first kind of load cluster center curve diagram in the case provided by the present invention;

图4为本发明提供的案例中第二类负荷聚类中心曲线图;Fig. 4 is the second kind of load cluster center curve diagram in the case provided by the present invention;

图5为本发明提供的案例中第三类负荷聚类中心曲线图;Fig. 5 is the third type of load clustering center curve diagram in the case provided by the present invention;

图6为本发明提供的案例中第四类负荷聚类中心曲线图;Fig. 6 is the fourth type of load clustering center curve diagram in the case provided by the present invention;

图7为本发明提供的案例中第五类负荷聚类中心曲线图;Fig. 7 is the fifth kind of load clustering center curve diagram in the case provided by the present invention;

图8为本发明提供的案例中地区电力系统一周内日负荷特性曲线图。FIG. 8 is a daily load characteristic curve diagram of the regional power system in one week in the case provided by the present invention.

具体实施方式Detailed ways

本发明采用K-means聚类方法高效地实现了电力居民用户细分,详尽分析不同类型用户的负荷特性,一方面可以指导售电主体依据不同电力居民用户的差异化需求,提出相应的零售电价套餐;另一方面可以指导电力居民用户合理用电如避免在系统负荷高峰时用电,以优化电力居民用户用电行为,降低系统负荷的波动。下面结合具体实施方式和说明书附图对本发明做出详细的说明。The invention adopts the K-means clustering method to efficiently realize the subdivision of electricity residential users, and analyzes the load characteristics of different types of users in detail. Package; on the other hand, it can guide electricity residential users to use electricity reasonably, such as avoiding electricity consumption during peak system load, so as to optimize the electricity consumption behavior of electricity residential users and reduce the fluctuation of system load. The present invention will be described in detail below with reference to the specific embodiments and the accompanying drawings.

如图1所示,本发明提供了一种基于K-means聚类的电力居民用户分类方法,其特征在于,包括以下步骤:As shown in FIG. 1 , the present invention provides a method for classifying electricity residential users based on K-means clustering, which is characterized in that it includes the following steps:

S1、确定区域电力居民用户负荷数据;S1. Determine the load data of regional electricity residential users;

在搜集负荷数据时,要尽量保证数据的完整性及时间上的连续性。When collecting load data, try to ensure data integrity and time continuity.

S2、对数据进行预处理;S2. Preprocess the data;

数据预处理的方法如下,具体过程如附图2所示:The method of data preprocessing is as follows, and the specific process is shown in Figure 2:

在数据预处理时,负荷数据的缺失或者有误时会影响聚类结果的正确性,因此在聚类之前首先应当对负荷数据进行预处理,采用方法如下:During data preprocessing, the lack or error of load data will affect the correctness of the clustering results. Therefore, the load data should be preprocessed before clustering. The methods are as follows:

(1)删除负荷数据中无效记录(1) Delete invalid records in load data

本实施例所需要的负荷值是正向有功负荷,因此搜集的数据均不低于0。首先删除表中含有负值的数据;其次为了使得聚类结果有代表性,再删除全为0的数据;The load value required in this embodiment is the forward active load, so the collected data are not lower than 0. First, delete the data with negative values in the table; secondly, in order to make the clustering results representative, delete the data with all 0s;

(2)删除连续缺失太多的数据(2) Delete too many consecutive missing data

以15min为时间间隔点,每天的负荷值有96个点,删除连续缺失超过30个负荷值的样本。With a time interval of 15 minutes, there are 96 load values per day, and samples with consecutively missing more than 30 load values are deleted.

(3)个别缺失值处理(3) Handling of individual missing values

在聚类前,需要对数据进行补遗和修正。本实施例对于首末端数据空缺,以趋势比例法进行补全;对于中间数据空缺,以非邻均值生成法进行补全。所谓非邻均值生成法,即对于非等时距的数列,或虽为等时距数列,但剔除异常值之后出现空穴的数列,用空穴两边的数据求平均值构造新的数据以填补空穴。Before clustering, the data needs to be supplemented and corrected. In this embodiment, for the data gaps at the beginning and end, the trend ratio method is used to fill in; for the intermediate data gaps, the non-neighboring mean generation method is used to fill in. The so-called non-neighboring mean generation method, that is, for a sequence that is not equidistant in time, or a sequence that is equidistant, but has holes after removing outliers, the data on both sides of the hole is averaged to construct new data to fill the gap. hole.

S3、根据K-means聚类算法对预处理后的数据进行分类;具体包括以下步骤:S3. Classify the preprocessed data according to the K-means clustering algorithm; specifically, the following steps are included:

S31、对预处理后的数据进行归一化处理,形成负荷数据样本集D={x1,x2,…,xm};S31. Normalize the preprocessed data to form a load data sample set D={x 1 ,x 2 ,...,x m };

由于不同电力居民用户的负荷大小相差甚远,则为了更好地将用电行为相似的用户聚为一类,以及确保本数据训练的有效性,需对预处理后的数据进行归一化处理,再进行分析。本实施例采用极值法对预处理后的数据进行归一化处理,该方法可实现对原始数据的等比例伸缩,具体公式如下:Since the loads of different residential electricity users are quite different, in order to better group users with similar electricity consumption behaviors into one group and ensure the effectiveness of this data training, it is necessary to normalize the preprocessed data. , and then analyze. This embodiment adopts the extreme value method to normalize the preprocessed data. This method can realize the proportional expansion and contraction of the original data. The specific formula is as follows:

Figure BDA0002056138930000051
Figure BDA0002056138930000051

式中,X*为经归一化处理后的值,X为预处理后的负荷数据,Xmax为预处理后的负荷数据中的最大值。预处理后的负荷数据经归一化处理后,将限定在[0,1]区间内。In the formula, X * is the value after normalization, X is the load data after preprocessing, and Xmax is the maximum value in the load data after preprocessing. After the preprocessed load data is normalized, it will be limited to the interval [0,1].

S32、设定聚类个数,根据K-means聚类算法对负荷数据样本集D={x1,x2,…,xm}进行分类;S32. Set the number of clusters, and classify the load data sample set D={x 1 ,x 2 ,...,x m } according to the K-means clustering algorithm;

K-means聚类,是聚类算法中相对简单的聚类算法,以对负荷曲线进行相关的聚类。K-means clustering is a relatively simple clustering algorithm in the clustering algorithm to perform related clustering on the load curve.

聚类个数是用户设定,对于负荷数据样本集,首先随机选取K个点作为簇的中心点;其次根据相关的距离度量找到离中心点最近的样本点,将其划归到最近的簇中,同时更新簇的中心;迭代执行上述操作,直到数据不再发生变化或者簇的中心点不再变化,此时输出划分好的簇C={C1,C2,…,Ck},且聚类的损失函数为如下:The number of clusters is set by the user. For the load data sample set, K points are randomly selected as the center point of the cluster; secondly, the sample point closest to the center point is found according to the relevant distance metric, and it is classified into the nearest cluster. , update the center of the cluster at the same time; perform the above operations iteratively until the data no longer changes or the center point of the cluster no longer changes, and the divided cluster C={C 1 ,C 2 ,...,C k } is output at this time, And the loss function of clustering is as follows:

Figure BDA0002056138930000061
Figure BDA0002056138930000061

式中,μi是簇Ci均值向量,其具体表示如下:In the formula, μ i is the mean vector of cluster C i , and its specific expression is as follows:

Figure BDA0002056138930000062
Figure BDA0002056138930000062

本实施例中,使用K-means聚类算法进行聚类分析时,首先需要指定聚类的数量;聚类个数过多,会出现负荷曲线形状类似,分类代表性不强的情况;聚类个数过少,会出现负荷特性结果较少,分析不完备的情况,因此,聚类个数K范围为3~10个;为了使分类更具有代表性,经多次试验,发现当分类数K=5时,分类结果更为合理。In this embodiment, when using the K-means clustering algorithm to perform cluster analysis, the number of clusters needs to be specified first; if the number of clusters is too large, the shape of the load curve will be similar, and the representativeness of the classification will not be strong; If the number of clusters is too small, there will be less load characteristic results and incomplete analysis. Therefore, the number of clusters K ranges from 3 to 10; in order to make the classification more representative, after many experiments, it is found that when the number of When K=5, the classification result is more reasonable.

S4、基于聚类结果,分析每类电力居民用户负荷特性;具体包括:S4. Based on the clustering results, analyze the load characteristics of each type of electricity residential users; specifically include:

依据步骤S3中的聚类结果,分析电力居民用户的负荷特性,采用的方法为:According to the clustering result in step S3, analyze the load characteristics of electric power residential users, and the adopted method is:

通常情况下,分析电力居民用户的负荷特性,是通过计算负荷特性指标来实现的。负荷特性指标以时间为单位,不同时限可以划分为不同的负荷特性指标,如年负荷特性指标、月负荷特性指标和日负荷特性指标,且在不同的负荷特性指标中可以进一步细分为曲线类指标、比较类指标和描述类指标。在实施例中,为了快速且精确得到负荷特性分析的结果,将采用日负荷特性指标中的曲线类指标如日负荷特性曲线,实现对电力居民用户负荷特性的分析。Usually, the analysis of the load characteristics of electricity residential users is realized by calculating the load characteristic index. The load characteristic index takes time as the unit, and different time limits can be divided into different load characteristic indexes, such as annual load characteristic index, monthly load characteristic index and daily load characteristic index, and can be further subdivided into curve types in different load characteristic indexes Metrics, comparative metrics, and descriptive metrics. In the embodiment, in order to obtain the result of the load characteristic analysis quickly and accurately, the curve-type index in the daily load characteristic index, such as the daily load characteristic curve, will be used to analyze the load characteristic of the electric residential user.

S5、分析区域电力系统负荷特性;本实施例分析区域电力系统负荷特性方法与步骤S4中分析每类电力居民用户负荷特性一致,以日负荷特性曲线描述区域电力系统负荷特性。S5. Analyze the load characteristics of the regional power system; the method for analyzing the load characteristics of the regional power system in this embodiment is consistent with the analysis of the load characteristics of each type of electricity residential user in step S4, and the daily load characteristic curve is used to describe the load characteristics of the regional power system.

S6、对比分析电力系统负荷曲线和每类电力居民用户负荷曲线,确定每类电力居民用户用电类型,包括迎峰用电型用户,部分迎峰用电型用户和平稳用电型用户。S6. Comparatively analyze the load curve of the power system and the load curve of each type of electricity residential user, and determine the electricity consumption type of each type of electricity residential user, including peak-consuming electricity users, some peak-consuming electricity users and stable electricity users.

本实施例基于上述分类结果,在售电侧放开的市场环境下,针对不同类型的用户群体,设计差异化的用电套餐及服务策略,为市场化售电未雨绸缪,具体包括:Based on the above classification results, this embodiment designs differentiated electricity consumption packages and service strategies for different types of user groups under the market environment where the electricity sales side is open, so as to plan ahead for market-oriented electricity sales, specifically including:

S7、根据每类电力居民用户用电类型,推送合理的售电方案。S7. According to the electricity consumption type of each type of electricity residential user, push a reasonable electricity sales plan.

下面通过具体案例说明本实施例。The present embodiment is described below through a specific case.

本案例以某地区为例,按照上述实施例的思路,首先搜集电力居民用户2498个日负荷曲线,采样时间间隔为1小时,日负荷数据共包含24个有功功率点;其次对负荷数据样本进行数据填补、删除无效数据样本并归一化处理。经数据处理,该样本数据中存在30个无效样本,2468个有效样本。This case takes a certain area as an example. According to the idea of the above embodiment, firstly collect 2498 daily load curves of electric residential users, the sampling interval is 1 hour, and the daily load data contains 24 active power points in total; Data padding, removal of invalid data samples and normalization. After data processing, there are 30 invalid samples and 2468 valid samples in the sample data.

将归一化处理后的2468个样本输入到K-means聚类模型。使用K-means聚类进行聚类分析时,首先需要指定K值取5进行聚类分析,则各个聚类类别中所包含的个案数目如表1所示。The normalized 2468 samples were input into the K-means clustering model. When using K-means clustering for cluster analysis, it is first necessary to specify a K value of 5 for cluster analysis, and the number of cases included in each cluster category is shown in Table 1.

表1、各个聚类中的个案数目Table 1. Number of cases in each cluster

Figure BDA0002056138930000071
Figure BDA0002056138930000071

基于聚类结果,得到确定的各个聚类中心曲线及其对应的用户差异化用电特征如下。Based on the clustering results, the determined cluster center curves and their corresponding differentiated power consumption characteristics of users are as follows.

(1)第一类结果用户数量为810,占有效样本数的32.8%。如图3所示,该类用户的日负荷曲线表现较为平稳,在全天24小时内基本稳定,波动幅度不大。(1) The number of users in the first category of results is 810, accounting for 32.8% of the valid samples. As shown in Figure 3, the daily load curve of this type of users is relatively stable, basically stable within 24 hours of the day, and the fluctuation range is not large.

(2)第二类结果用户数量为20,占有效样本数的0.8%。如图4所示,该类用户的日负荷曲线表现为“两峰一谷”,且白天用电负荷较低,晚上用电负荷较高。(2) The number of users of the second type of results is 20, accounting for 0.8% of the valid samples. As shown in Figure 4, the daily load curve of such users is "two peaks and one valley", and the electricity load is low during the day and high at night.

(3)第三类结果用户数量为955,占有效样本数的38.7%。如图5所示,该类用户的日负荷曲线表现为“两峰一平一谷”,用电高峰分别出现在11点和20点左右。该类用户的日负荷曲线属于典型的迎峰负荷曲线,低谷时段用电比例较小,所以该类用户的基本负荷具有较大的负荷调控能力,应加强宣传教育,实施分时电价,提高居民参与错峰管理的意愿。(3) The number of users in the third category is 955, accounting for 38.7% of the valid samples. As shown in Figure 5, the daily load curve of such users is "two peaks, one level and one valley", and the peak electricity consumption occurs at around 11:00 and 20:00 respectively. The daily load curve of this type of user belongs to the typical peak load curve, and the proportion of electricity consumption during the trough period is small, so the basic load of this type of user has a large load regulation ability. Willingness to participate in peak shifting management.

(4)第4类结果用户数量为85,占有效样本数的3.4%。如图6所示,该类用户的日负荷曲线表现较为平稳,与第一类相似,波动幅度不大,但与第一类不同的是八点以后负荷呈下降趋势。(4) The number of users in the fourth category is 85, accounting for 3.4% of the valid samples. As shown in Figure 6, the daily load curve of this type of users is relatively stable, similar to the first type, and the fluctuation range is not large, but the difference from the first type is that the load shows a downward trend after eight o'clock.

(5)第五类结果用户数量为598,占有效样本数的24.3%,如图7所示,该类用户的日负荷曲线表现为“一峰一平一谷”,其负荷特性与第三类用户相似。该类型的负荷具有较大的负荷调控潜力。(5) The number of users in the fifth category is 598, accounting for 24.3% of the number of valid samples. As shown in Figure 7, the daily load curve of this category of users is “one peak, one level and one valley”, and its load characteristics are similar to those of the third category of users. resemblance. This type of load has great potential for load regulation.

在提取居民用户的典型负荷曲线后,即可依据相对个体的用电特征与群体用电特征相比较。本案例与采集当地的地区总负荷曲线进行比较,提出一种便于未来售电主体进行零售电价套餐设计的分类规则。本案例主要从衡量用户用电策略引导潜力出发,将用户的负荷曲线与地区总负荷曲线进行比较,重新梳理分类。对售电主体而言,这种分类方式有助于其对用户的分类管理,实现电力负荷结构优化,节省运行成本。After the typical load curve of residential users is extracted, the power consumption characteristics of relative individuals can be compared with the power consumption characteristics of groups. This case is compared with the collected local regional total load curve, and a classification rule is proposed to facilitate the design of retail electricity price packages for future electricity sellers. This case mainly starts from measuring the guiding potential of the user's electricity consumption strategy, compares the user's load curve with the regional total load curve, and re-categorizes. For the main body of electricity sales, this classification method is helpful for the classification management of users, realizes the optimization of the power load structure, and saves the operation cost.

对电力系统进行负荷分析时,提取该地区一周内电力系统的日负荷,以绘制成周负荷曲线,如图8所示,可以看出该地区日负荷曲线形状均呈现“三峰一谷”形态,能够显现出早、中、晚三个高峰,早高峰集中出现在9:00-11:00,持续时间较短;午高峰出现在14:00-18:00之间,持续时间较长;晚高峰出现在19:00-21:00之间,持续时间较短;低谷出现在1:00-6:00之间,持续时间较长。When carrying out load analysis on the power system, the daily load of the power system in the region within a week is extracted to draw a weekly load curve, as shown in Figure 8. It can be seen that the shape of the daily load curve in this region is in the form of "three peaks and one valley". It can show three peaks in the morning, middle and evening. The morning peak appears at 9:00-11:00 and lasts for a short time; the afternoon peak appears between 14:00-18:00 and lasts for a long time; The peak appears between 19:00-21:00 and the duration is shorter; the trough appears between 1:00-6:00 and the duration is longer.

通过对比分析居民日负荷特性曲线与系统日负荷特性曲线,从个体曲线形态与群体曲线形态的不同角度出发,考虑系统负荷特性的用户负荷特性分析可以更好的满足售电主体的需求,为其制定零售电价套餐以及增值服务提供依据,可以从将居民用户分为三类,即迎峰用电型用户、部分迎峰用电型用户和平稳用电型用户。By comparing and analyzing the daily load characteristic curve of residents and the daily load characteristic curve of the system, from the different perspectives of the individual curve shape and the group curve shape, the user load characteristic analysis considering the system load characteristic can better meet the needs of the main body of electricity sales. To formulate retail electricity price packages and provide basis for value-added services, residential users can be divided into three categories, namely, peak-consuming electricity users, partial peak-consuming electricity users, and stable electricity users.

(1)迎峰用电型用户:该类型的用户主要对应的是上述第三类用户和第五类用户,曲线峰段时间区间与地区总负荷曲线高度吻合,误差在30分钟内,早高峰与午高峰相对晚高峰最大值差距较大。(1) Peak power users: This type of users mainly corresponds to the third and fifth types of users mentioned above. The time interval of the peak period of the curve is highly consistent with the regional total load curve. The error is within 30 minutes. Compared with the afternoon peak, there is a big gap between the maximum value of the evening peak.

(2)部分迎峰用电型用户:该类型的用户主要对应的是第二类用户,曲线峰段时间区间与地区总负荷曲线基本吻合,相对总体负荷而言有避峰用电行为。(2) Some peak-consuming electricity users: This type of users mainly corresponds to the second type of users. The peak time interval of the curve is basically consistent with the regional total load curve, and there is a behavior of avoiding peak electricity consumption relative to the overall load.

(3)平稳用电型用户:该类型的用户主要对应的是第一类用户和第四类用户,一天中用电波动幅度不大。(3) Users of stable electricity consumption: This type of users mainly corresponds to the first and fourth types of users, and the fluctuation of electricity consumption in a day is not large.

实例结果表明,采用K-means聚类算法对电力居民用户的用电负荷数据进行相似度度量,以及对用户进行聚类分析,能够对用户的用电负荷曲线特征进行良好的度量和区分,实现用户负荷曲线的聚类和负荷特性分析,且K-means聚类算法在对电力居民用户负荷聚类分析时具有可伸缩性和高效性。在售电侧放开的市场环境下,针对不同类型的用户群体,设计差异化的用电套餐及服务策略,为市场化售电未雨绸缪,对提升企业竞争力具有非常重要的现实意义。The example results show that the K-means clustering algorithm is used to measure the similarity of the electricity load data of the electric residential users, and the cluster analysis of the users can be used to measure and distinguish the characteristics of the electricity load curve of the users. The clustering and load characteristic analysis of the user load curve, and the K-means clustering algorithm has scalability and high efficiency in the cluster analysis of the electricity residential user load. In the market environment where the electricity sales side is open, it is of great practical significance to improve the competitiveness of enterprises to design differentiated electricity consumption packages and service strategies for different types of user groups, and plan ahead for market-oriented electricity sales.

本发明不局限于上述最佳实施方式,任何人应该得知在本发明的启示下作出的结构变化,凡是与本发明具有相同或相近的技术方案,均落入本发明的保护范围之内。The present invention is not limited to the above-mentioned best embodiment, and anyone should know that structural changes made under the inspiration of the present invention, and all technical solutions that are the same or similar to the present invention, fall within the protection scope of the present invention.

Claims (10)

1. A method for classifying users of electric power residents based on K-means clustering is characterized by comprising the following steps:
determining regional electric power resident user load data;
preprocessing the data;
classifying the preprocessed data according to a K-means clustering algorithm;
analyzing the load characteristics of each type of electric power resident users based on the clustering result; analyzing the load characteristics of the regional power system;
and comparing and analyzing the load curve of the power system and the load curve of each type of power residential users, and determining the power consumption type of each type of power residential users.
2. The method as claimed in claim 1, further comprising pushing a reasonable electricity selling plan according to the electricity consumption type of each type of electric residents.
3. The method of claim 1, wherein the pre-processing the data comprises:
a1, deleting invalid records in the load data; negative value data and 0;
a2, deleting data with more continuous missing;
a3, processing individual missing values;
for the vacancy of the head-end and the tail-end data, completing by a trend proportion method;
and for the vacancy of the intermediate data, completing the vacancy by a non-adjacent mean generation method.
4. The method according to claim 3, wherein the deleting of the data with a large number of consecutive deletions is specifically:
at 15min intervals, 96 load values per day, samples with continuously missing more than 30 load values were deleted.
5. The method of claim 1, wherein the processed data is classified according to a K-means clustering algorithm; the method comprises the following steps:
carrying out normalization processing on the preprocessed load data, and forming a load data sample set;
and setting the clustering number, and classifying the load data sample set according to a K-means clustering algorithm.
6. The method according to claim 1, wherein said analyzing electrical residential customer load characteristics comprises:
and analyzing the load characteristics of each type of users based on the clustering result, and describing the load characteristics of the electric power residential users by daily load characteristic curves.
7. The method of claim 5, wherein normalizing the pre-processed load data comprises:
carrying out normalization processing on the preprocessed load data by adopting an extreme method, wherein a specific formula is as follows:
Figure FDA0002056138920000021
in the formula, X*Is normalized value, X is preprocessed load data, XmaxIs the maximum value in the preprocessed load data.
8. The method of claim 1 wherein said electrical residential consumer electricity usage pattern comprises: the peak-meeting power utilization type users, part of the peak-meeting power utilization type users and the stable power utilization type users.
9. The method according to claim 5, wherein the number of clusters K is 3-10.
10. Method according to claim 9, characterized in that the cluster number K is preferably 5.
CN201910389901.7A 2019-05-10 2019-05-10 Electric power resident user classification method based on K-means clustering Pending CN111915116A (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
CN201910389901.7A CN111915116A (en) 2019-05-10 2019-05-10 Electric power resident user classification method based on K-means clustering

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
CN201910389901.7A CN111915116A (en) 2019-05-10 2019-05-10 Electric power resident user classification method based on K-means clustering

Publications (1)

Publication Number Publication Date
CN111915116A true CN111915116A (en) 2020-11-10

Family

ID=73241909

Family Applications (1)

Application Number Title Priority Date Filing Date
CN201910389901.7A Pending CN111915116A (en) 2019-05-10 2019-05-10 Electric power resident user classification method based on K-means clustering

Country Status (1)

Country Link
CN (1) CN111915116A (en)

Cited By (6)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN114638284A (en) * 2022-02-17 2022-06-17 广西电网有限责任公司南宁供电局 Power utilization behavior characterization method considering external influence factors
CN114662761A (en) * 2022-03-24 2022-06-24 国网江苏省电力有限公司南通供电分公司 Load prediction-based participatory demand response load identification method and system
CN114966275A (en) * 2022-05-20 2022-08-30 国网江苏省电力有限公司苏州供电分公司 Non-invasive resident load monitoring method and device
CN115374857A (en) * 2022-08-23 2022-11-22 国网山东省电力公司烟台供电公司 Classification method and device for power consumption behaviors of power customers
CN115481786A (en) * 2022-08-30 2022-12-16 国网天津市电力公司 Power distribution network user load characteristic classification method based on data acquisition
CN116257770A (en) * 2022-11-25 2023-06-13 云南电网有限责任公司电力科学研究院 User clustering method and device, equipment and storage medium based on power load

Citations (19)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US6233719B1 (en) * 1997-10-27 2001-05-15 Kla-Tencor Corporation System and method for analyzing semiconductor production data
FR3001850A1 (en) * 2013-02-04 2014-08-08 Commissariat Energie Atomique METHOD FOR MODIFYING THE STATE OF LOCAL ACCESS POINTS IN A CELLULAR NETWORK
US20150235143A1 (en) * 2003-12-30 2015-08-20 Kantrack Llc Transfer Learning For Predictive Model Development
CN105117810A (en) * 2015-09-24 2015-12-02 国网福建省电力有限公司泉州供电公司 Residential electricity consumption mid-term load prediction method under multistep electricity price mechanism
CN106022578A (en) * 2016-05-12 2016-10-12 国网天津市电力公司 Residential electricity peak-valley-flat dividing method based on data dimension increasing and K-means clustering
CN106096805A (en) * 2016-05-10 2016-11-09 华北电力大学 A kind of residential electricity consumption load classification method based on entropy assessment feature selection
CN106127360A (en) * 2016-06-06 2016-11-16 国网天津市电力公司 A kind of multi-model load forecasting method analyzed based on user personality
CN106600119A (en) * 2016-11-30 2017-04-26 南方电网科学研究院有限责任公司 K-means-based power user clustering method and device
CN107358368A (en) * 2017-07-21 2017-11-17 国网四川省电力公司眉山供电公司 A kind of robust k means clustering methods towards power consumer subdivision
US20180189612A1 (en) * 2016-12-29 2018-07-05 Paypal, Inc. System and method for learning from the images of raw data
CN108256724A (en) * 2017-11-27 2018-07-06 国网浙江省电力公司嵊泗县供电公司 A kind of power distribution network based on dynamic industry coefficient can open method for planning capacity
CN108334887A (en) * 2017-01-19 2018-07-27 腾讯科技(深圳)有限公司 A kind of user's choosing method and device
CN108681973A (en) * 2018-05-14 2018-10-19 广州供电局有限公司 Sorting technique, device, computer equipment and the storage medium of power consumer
CN108875801A (en) * 2018-05-29 2018-11-23 深圳万智联合科技有限公司 A kind of Classification of Load Curves system based on smart grid
CN109146252A (en) * 2018-07-25 2019-01-04 国家电网有限公司 Cluster analysis method and system of user's electricity consumption behavior based on regulation potential index
CN109508858A (en) * 2018-09-30 2019-03-22 国网安徽省电力有限公司经济技术研究院 Residential households electricity consumption Intervention Strategy recommended method under a kind of timesharing step price policy
CN109522934A (en) * 2018-10-22 2019-03-26 云南电网有限责任公司 A kind of power consumer clustering method based on clustering algorithm
CN109634940A (en) * 2018-11-12 2019-04-16 国网天津市电力公司电力科学研究院 A kind of typical low pressure platform area's electricity consumption model building method based on magnanimity low-voltage platform area electricity consumption data
CN109685581A (en) * 2019-01-03 2019-04-26 江苏方天电力技术有限公司 A kind of large power customers electricity consumption behavior analysis method based on label clustering technology

Patent Citations (19)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US6233719B1 (en) * 1997-10-27 2001-05-15 Kla-Tencor Corporation System and method for analyzing semiconductor production data
US20150235143A1 (en) * 2003-12-30 2015-08-20 Kantrack Llc Transfer Learning For Predictive Model Development
FR3001850A1 (en) * 2013-02-04 2014-08-08 Commissariat Energie Atomique METHOD FOR MODIFYING THE STATE OF LOCAL ACCESS POINTS IN A CELLULAR NETWORK
CN105117810A (en) * 2015-09-24 2015-12-02 国网福建省电力有限公司泉州供电公司 Residential electricity consumption mid-term load prediction method under multistep electricity price mechanism
CN106096805A (en) * 2016-05-10 2016-11-09 华北电力大学 A kind of residential electricity consumption load classification method based on entropy assessment feature selection
CN106022578A (en) * 2016-05-12 2016-10-12 国网天津市电力公司 Residential electricity peak-valley-flat dividing method based on data dimension increasing and K-means clustering
CN106127360A (en) * 2016-06-06 2016-11-16 国网天津市电力公司 A kind of multi-model load forecasting method analyzed based on user personality
CN106600119A (en) * 2016-11-30 2017-04-26 南方电网科学研究院有限责任公司 K-means-based power user clustering method and device
US20180189612A1 (en) * 2016-12-29 2018-07-05 Paypal, Inc. System and method for learning from the images of raw data
CN108334887A (en) * 2017-01-19 2018-07-27 腾讯科技(深圳)有限公司 A kind of user's choosing method and device
CN107358368A (en) * 2017-07-21 2017-11-17 国网四川省电力公司眉山供电公司 A kind of robust k means clustering methods towards power consumer subdivision
CN108256724A (en) * 2017-11-27 2018-07-06 国网浙江省电力公司嵊泗县供电公司 A kind of power distribution network based on dynamic industry coefficient can open method for planning capacity
CN108681973A (en) * 2018-05-14 2018-10-19 广州供电局有限公司 Sorting technique, device, computer equipment and the storage medium of power consumer
CN108875801A (en) * 2018-05-29 2018-11-23 深圳万智联合科技有限公司 A kind of Classification of Load Curves system based on smart grid
CN109146252A (en) * 2018-07-25 2019-01-04 国家电网有限公司 Cluster analysis method and system of user's electricity consumption behavior based on regulation potential index
CN109508858A (en) * 2018-09-30 2019-03-22 国网安徽省电力有限公司经济技术研究院 Residential households electricity consumption Intervention Strategy recommended method under a kind of timesharing step price policy
CN109522934A (en) * 2018-10-22 2019-03-26 云南电网有限责任公司 A kind of power consumer clustering method based on clustering algorithm
CN109634940A (en) * 2018-11-12 2019-04-16 国网天津市电力公司电力科学研究院 A kind of typical low pressure platform area's electricity consumption model building method based on magnanimity low-voltage platform area electricity consumption data
CN109685581A (en) * 2019-01-03 2019-04-26 江苏方天电力技术有限公司 A kind of large power customers electricity consumption behavior analysis method based on label clustering technology

Non-Patent Citations (2)

* Cited by examiner, † Cited by third party
Title
朱天博;傅军;杨一帆;孙志杰;周辛南;: "基于用电信息采集系统用户负荷特性聚类分析", 电测与仪表, no. 1, pages 70 *
韦钢, 王飞, 张永健, 符杨: "负荷预测中历史数据缺损处理", 电力科学与工程, no. 01, pages 16 *

Cited By (6)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN114638284A (en) * 2022-02-17 2022-06-17 广西电网有限责任公司南宁供电局 Power utilization behavior characterization method considering external influence factors
CN114662761A (en) * 2022-03-24 2022-06-24 国网江苏省电力有限公司南通供电分公司 Load prediction-based participatory demand response load identification method and system
CN114966275A (en) * 2022-05-20 2022-08-30 国网江苏省电力有限公司苏州供电分公司 Non-invasive resident load monitoring method and device
CN115374857A (en) * 2022-08-23 2022-11-22 国网山东省电力公司烟台供电公司 Classification method and device for power consumption behaviors of power customers
CN115481786A (en) * 2022-08-30 2022-12-16 国网天津市电力公司 Power distribution network user load characteristic classification method based on data acquisition
CN116257770A (en) * 2022-11-25 2023-06-13 云南电网有限责任公司电力科学研究院 User clustering method and device, equipment and storage medium based on power load

Similar Documents

Publication Publication Date Title
CN111915116A (en) Electric power resident user classification method based on K-means clustering
Yang et al. Identifying household electricity consumption patterns: A case study of Kunshan, China
CN110781332A (en) Clustering method of daily load curve of electric residential users based on compound clustering algorithm
CN111860600A (en) A method for selecting user power consumption characteristics based on maximum correlation and minimum redundancy criterion
CN112819299A (en) Differential K-means load clustering method based on center optimization
CN109472404A (en) A method, model, device and system for short-term prediction of power load
CN110119948B (en) Power user credit evaluation method and system based on dynamic combination of time-varying weights
CN114519514B (en) A method, system and computer equipment for calculating reasonable line loss values in low-voltage station areas
CN111832861A (en) Residential load variable scale portrait method and system based on big data platform
CN108280541A (en) Customer service strategies formulating method, device based on random forest and decision tree
CN103577883A (en) Grid-load intelligent interaction method and device
CN108846530A (en) One kind being based on the short-term load forecasting method of " cluster-recurrence " model
CN108389069A (en) Top-tier customer recognition methods based on random forest and logistic regression and device
CN108492134A (en) The big data user power utilization behavior analysis system integrated based on multicycle regression tree
CN110321934A (en) Method and system for detecting abnormal data of user electricity consumption
WO2020125106A1 (en) Similarity model-based data processing method and system
Shamim et al. Multi-domain feature extraction for improved clustering of smart meter data
CN113094448B (en) Analysis methods and analysis devices and electronic equipment for residential vacancy status
CN111612228A (en) An analysis method of user's electricity consumption behavior based on electricity consumption information
CN109934301A (en) A kind of power load aggregation analysis method, device and equipment
CN108364191A (en) Top-tier customer Optimum Identification Method and device based on random forest and logistic regression
CN118052348A (en) Low-carbon optimization method, medium and system for green electricity consumption based on electricity consumption characteristics
CN111897888B (en) Household change relationship identification method based on Spark framework and agglomerative hierarchical clustering algorithm
CN113935399B (en) A two-stage ensemble clustering algorithm for mining regional energy load power consumption patterns
CN119622383A (en) An industrial park load analysis method and system based on optimized K-means clustering

Legal Events

Date Code Title Description
PB01 Publication
PB01 Publication
SE01 Entry into force of request for substantive examination
SE01 Entry into force of request for substantive examination
RJ01 Rejection of invention patent application after publication

Application publication date: 20201110

RJ01 Rejection of invention patent application after publication