[go: up one dir, main page]

CN115422146B - A Construction and Application Method of Standardized Database for Enterprises in Xinjiang Region - Google Patents

A Construction and Application Method of Standardized Database for Enterprises in Xinjiang Region Download PDF

Info

Publication number
CN115422146B
CN115422146B CN202210649087.XA CN202210649087A CN115422146B CN 115422146 B CN115422146 B CN 115422146B CN 202210649087 A CN202210649087 A CN 202210649087A CN 115422146 B CN115422146 B CN 115422146B
Authority
CN
China
Prior art keywords
data
under
current
enterprise
residual
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Active
Application number
CN202210649087.XA
Other languages
Chinese (zh)
Other versions
CN115422146A (en
Inventor
赵佳琪
李晶
侯韩芳
王春艳
韩冰
张雪飞
王玉鸾
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
China National Institute of Standardization
Original Assignee
China National Institute of Standardization
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by China National Institute of Standardization filed Critical China National Institute of Standardization
Priority to CN202210649087.XA priority Critical patent/CN115422146B/en
Publication of CN115422146A publication Critical patent/CN115422146A/en
Application granted granted Critical
Publication of CN115422146B publication Critical patent/CN115422146B/en
Active legal-status Critical Current
Anticipated expiration legal-status Critical

Links

Classifications

    • GPHYSICS
    • G06COMPUTING OR CALCULATING; COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F16/00Information retrieval; Database structures therefor; File system structures therefor
    • G06F16/20Information retrieval; Database structures therefor; File system structures therefor of structured data, e.g. relational data
    • G06F16/21Design, administration or maintenance of databases
    • G06F16/211Schema design and management
    • G06F16/212Schema design and management with details for data modelling support
    • GPHYSICS
    • G06COMPUTING OR CALCULATING; COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F16/00Information retrieval; Database structures therefor; File system structures therefor
    • G06F16/20Information retrieval; Database structures therefor; File system structures therefor of structured data, e.g. relational data
    • G06F16/21Design, administration or maintenance of databases
    • G06F16/215Improving data quality; Data cleansing, e.g. de-duplication, removing invalid entries or correcting typographical errors
    • GPHYSICS
    • G06COMPUTING OR CALCULATING; COUNTING
    • G06QINFORMATION AND COMMUNICATION TECHNOLOGY [ICT] SPECIALLY ADAPTED FOR ADMINISTRATIVE, COMMERCIAL, FINANCIAL, MANAGERIAL OR SUPERVISORY PURPOSES; SYSTEMS OR METHODS SPECIALLY ADAPTED FOR ADMINISTRATIVE, COMMERCIAL, FINANCIAL, MANAGERIAL OR SUPERVISORY PURPOSES, NOT OTHERWISE PROVIDED FOR
    • G06Q50/00Information and communication technology [ICT] specially adapted for implementation of business processes of specific business sectors, e.g. utilities or tourism
    • G06Q50/06Energy or water supply

Landscapes

  • Engineering & Computer Science (AREA)
  • Theoretical Computer Science (AREA)
  • Databases & Information Systems (AREA)
  • Physics & Mathematics (AREA)
  • General Physics & Mathematics (AREA)
  • Business, Economics & Management (AREA)
  • Data Mining & Analysis (AREA)
  • General Engineering & Computer Science (AREA)
  • Health & Medical Sciences (AREA)
  • Economics (AREA)
  • Public Health (AREA)
  • Water Supply & Treatment (AREA)
  • General Health & Medical Sciences (AREA)
  • Human Resources & Organizations (AREA)
  • Marketing (AREA)
  • Primary Health Care (AREA)
  • Strategic Management (AREA)
  • Tourism & Hospitality (AREA)
  • General Business, Economics & Management (AREA)
  • Quality & Reliability (AREA)
  • Management, Administration, Business Operations System, And Electronic Commerce (AREA)

Abstract

The invention relates to the technical field of enterprise standardized database construction, and discloses a method for constructing and applying an enterprise standardized database in Xinjiang, which comprises the following steps: s1, acquiring a data feature a for constructing a standardized database and a data feature b under the current enterprise running state in the Internet, and generating a data table under the relevance of the current data feature a and the data feature b. According to the method, through the relevance among different data characteristics, an abnormal data generation source is generated according to a real digital interval, an abnormal data cleaning instruction is obtained, abnormal data is cleaned at regular time, so that an enterprise database can be constructed in a standardized mode, the decision efficiency of an enterprise is greatly improved, and the normalized data characteristics under the current construction standard can be supplemented by automatically repairing the residual data information which does not reach the fit standard, so that a normal operation work frame under the current enterprise standardized database is established.

Description

一种新疆地区企业标准化数据库构建及应用方法A Construction and Application Method of Standardized Database for Enterprises in Xinjiang Region

技术领域technical field

本发明涉及企业标准化数据库构建技术领域,具体为一种新疆地区企业标准化数据库构建及应用方法。The invention relates to the technical field of enterprise standardization database construction, in particular to a construction and application method of enterprise standardization database in Xinjiang region.

背景技术Background technique

新疆维吾尔自治区,简称新,中国5个少数民族自治区之一,位于西北地区,首府乌鲁木齐,新疆作为西部大开发的重点地区和主要的能源生产基地之一,有着丰富的风能和太阳能资源,是我国八大千万千瓦风电基地之一。但是新疆风电场和光伏电站建设地点大都远离负荷中心,和负荷中心的地理位置差异较大,呈现逆向分布的特点。该特点决定了新疆风电/光伏开发的模式是以大规模集中开发,大量风电场、光伏电站汇集至同一母线,由变压器升至高电压后,远距离输送至负荷中心进行消纳为主。目前,对于上千公里的中间无落点的电力输送来说,直流输电相对于交流输电是一种更可靠、经济的输电方式,其输送距离可达到2500km以上,能满足风电和光伏基地远距离输送电能至负荷中心的需要,而随着新疆地区的经济不断发展,众多的企业应运而生,而众多的企业在发展的同时,数据库构建和应用问题显得格外重要。Xinjiang Uygur Autonomous Region, referred to as Xin, is one of the five ethnic minority autonomous regions in China. It is located in the northwest region and its capital is Urumqi. One of the eight 10 million-kilowatt wind power bases. However, the construction sites of wind farms and photovoltaic power stations in Xinjiang are mostly far away from the load center, and the geographical location of the load center is quite different, showing the characteristics of reverse distribution. This feature determines that the wind power/photovoltaic development model in Xinjiang is based on large-scale centralized development. A large number of wind farms and photovoltaic power stations are gathered on the same bus, and after being raised to high voltage by transformers, they are transported to the load center for consumption over long distances. At present, DC transmission is a more reliable and economical power transmission method than AC transmission for thousands of kilometers of power transmission without a drop point in the middle. The need to transmit electric energy to the load center, and with the continuous development of the economy in Xinjiang, many enterprises have emerged as the times require, and while many enterprises are developing, the issue of database construction and application is particularly important.

目前的数据库难以建立在标准化的模式下进行构建,企业无法建立完整的大数据信息指标数据库,无法提高企业决策效率,而且难以对未达吻合标准的剩余数据信息进行自动修复,无法补齐当前构建标准下的常态化数据特征,进而无法建立起当前企业标准化数据库下的常态运行工作框架。The current database is difficult to build in a standardized mode. Enterprises cannot establish a complete database of big data information indicators, which cannot improve the efficiency of corporate decision-making. Moreover, it is difficult to automatically repair the remaining data information that does not meet the standards, and it is impossible to complete the current construction. The normalized data characteristics under the standard make it impossible to establish a normal operation work framework under the current enterprise standardized database.

发明内容Contents of the invention

本发明的目的在于提供一种新疆地区企业标准化数据库构建及应用方法,以解决上述背景技术中提出的问题。The purpose of the present invention is to provide a method for constructing and applying a standardized database for enterprises in Xinjiang, so as to solve the problems raised in the above-mentioned background technology.

为实现上述目的,本发明提供如下技术方案:一种新疆地区企业标准化数据库构建及应用方法,包括以下步骤:In order to achieve the above object, the present invention provides the following technical solutions: a method for constructing and applying a standardized database for enterprises in Xinjiang, comprising the following steps:

S1、采集互联网中具备构建标准化数据库的数据特征a以及当前企业运行状态下的数据特征b,生成当前数据特征a和数据特征b具有关联性下的数据表,在数据表上建立多组数据展示模型,并按照1、2......N的数字编号赋予多组数据展示模型唯一的编号;S1. Collect the data feature a and the data feature b under the current operating state of the enterprise in the Internet, generate a data table with the correlation between the current data feature a and data feature b, and create multiple sets of data display on the data table Model, and assign unique numbers to multiple sets of data display models according to the numbers of 1, 2...N;

S2、依据建立的多组数据展示模型对当前数据特征a和数据特征b进行深层次的解析,产生关联性下的变量类型以及产生相同属性的MDS元素,将变量类型通过转化器进行数据转化后,复制在MDS元素中生成标准格式下的数据源;S2. Perform in-depth analysis of the current data feature a and data feature b based on the established multi-group data display model, generate the variable type under the correlation and the MDS element with the same attribute, and convert the variable type through the converter for data conversion , copy the data source in the standard format generated in the MDS element;

S3、查询未产生关联性下的剩余变量类型以及未产生相同属性的剩余元素,将剩余变量类型以及剩余元素在数据展示模型上进行划分和分区域标记,通过冥函数模拟演示剩余变量类型下的剩余变量数据未来变化趋势线,并通过导函数模拟演示剩余元素下的剩余数据未来变化趋势线,计算剩余变量数据未来变化趋势线和剩余数据未来变化趋势线之间的真实数字区间,并建立二者之间的阈值范围;S3. Query the remaining variable types without correlation and the remaining elements without the same attributes, divide and mark the remaining variable types and remaining elements on the data display model, and demonstrate the remaining variable types through the ghost function simulation The future change trend line of the remaining variable data, and simulate and demonstrate the future change trend line of the remaining data under the remaining elements through the derivative function, calculate the real digital interval between the future change trend line of the remaining variable data and the future change trend line of the remaining data, and establish two the threshold range between

S4、依据互联网中标准化数据库的构建标准,对剩余变量类型以及剩余元素进行梳理,选取与构建标准吻合度超过85-95%的剩余数据信息,采用数据循环策略的方式剔除未达吻合标准的数据信息,并对剩余数据信息进行自动修复,补齐当前构建标准下的常态化数据特征,并结合数据源,建立起当前企业标准化数据库下的常态运行工作框架;S4. According to the construction standard of the standardized database in the Internet, sort out the remaining variable types and remaining elements, select the remaining data information with a degree of agreement with the construction standard of more than 85-95%, and use the data circulation strategy to eliminate the data that does not meet the standard. Information, and automatically repair the remaining data information, complete the normalized data characteristics under the current construction standard, and combine the data source to establish a normal operation work framework under the current enterprise standardized database;

S5、将建立的常态运行工作框架传输至企业工作模型架构中,间歇性发送正常操作指令,观察企业运行数据是否出现异常,并通过编辑常态运行工作框架下的工作逻辑,建立与企业工作模型架构中的数据采集端口,通过远程终端采集的方式对日常运行数据进行实时采集和监测,生成采集点和监测点,并通过数据编码解析出最终的指标库。S5. Transfer the established normal operation work framework to the enterprise work model architecture, send normal operation instructions intermittently, observe whether the enterprise operation data is abnormal, and establish a work model architecture consistent with the enterprise work model by editing the work logic under the normal operation work framework The data collection port in the remote terminal collects and monitors the daily operation data in real time, generates collection points and monitoring points, and analyzes the final index library through data coding.

可选的,所述阈值范围内,根据真实数字区间生成异常数据发生源,按照异常数据发生源出现的次数,对异常数据发生源和标准化数据库的数据特征a进行对比,得到异常数据清洗指令,并对异常数据进行定时清洗。Optionally, within the threshold range, the source of abnormal data is generated according to the real number interval, and the source of abnormal data is compared with the data feature a of the standardized database according to the number of occurrences of the source of abnormal data to obtain an abnormal data cleaning instruction, And regularly clean abnormal data.

可选的,所述数据源在生成后,通过分布式网络探针搜寻到当前数据特征下的数据信息,同时记录下网络探针的所有搜寻路径,根据得到的所有搜寻路径建立当前数据信息运行轨迹下的基本逻辑。Optionally, after the data source is generated, the distributed network probes search for the data information under the current data characteristics, record all the search paths of the network probes at the same time, and establish the current data information operation according to all the obtained search paths The basic logic under the trajectory.

可选的,所述补齐当前构建标准下的常态化数据特征后,还应对常态化数据特征进行质量检查和标准化数据训练,并得到标准化数据训练集。Optionally, after completing the normalized data features under the current construction standard, quality inspection and standardized data training should be performed on the normalized data features to obtain a standardized data training set.

可选的,所述指标库在解析前,通过对日常运行数据进行采集和监测,并创建当前日常运行数据下的多个服务终端以及服务接口,对日常运行数据进行脱敏和信号放大后,形成最终的指标库。Optionally, before parsing, the indicator library collects and monitors the daily operation data, creates multiple service terminals and service interfaces under the current daily operation data, desensitizes the daily operation data and amplifies the signal, Form the final indicator library.

可选的,所述采集点和监测点在生成后,还应自动梳理出当前数据采集端口下的多组特征信息,按照类型和存储大小对多组特征信息进行重排和分类后,建立多组特征信息下的独立传输通道。Optionally, after the collection points and monitoring points are generated, multiple sets of feature information under the current data collection port should be automatically sorted out, and multiple sets of feature information are rearranged and classified according to type and storage size, and multiple sets of feature information are established. Independent transmission channels under group feature information.

可选的,所述指标库在解析后,在指标库中生成日常运行数据下的索引栏,根据索引栏中对应的数据信息建立数据触发按钮,当数据触发按钮产生指令后,自动选取指标库中的重复数据并产生额外参数,并对额外参数进行格式化处理。Optionally, after the analysis of the index library, an index column under the daily operation data is generated in the index library, and a data trigger button is established according to the corresponding data information in the index column. When the data trigger button generates an instruction, the index library is automatically selected Duplicate data in and generate extra parameters, and format the extra parameters.

可选的,所述常态运行工作框架是基于数据源下的多个数据元素进行创建的,其中数据元素包含数据长度、数据名称、数据类型、数据格式、数据起始位置和数据终端位置。Optionally, the normal operation work frame is created based on multiple data elements under the data source, where the data elements include data length, data name, data type, data format, data start position and data end position.

与现有技术相比,本发明的有益效果是:该方法通过不同数据特征之间的关联性,根据真实数字区间生成异常数据发生源,得到异常数据清洗指令,并对异常数据进行定时清洗,使得企业数据库得以在标准化的模式下进行构建,有利于建立完整的大数据信息指标数据库,大大提高企业决策效率,而且通过对未达吻合标准的剩余数据信息进行自动修复,能够补齐当前构建标准下的常态化数据特征,进而建立起当前企业标准化数据库下的常态运行工作框架,使得企业数据库更加标准化和流程化,有利于企业各部门信息的精准管理,具备一定的市场推广前景。Compared with the prior art, the beneficial effect of the present invention is: the method generates abnormal data occurrence sources according to the real digital interval through the correlation between different data features, obtains abnormal data cleaning instructions, and regularly cleans the abnormal data, It enables the enterprise database to be constructed in a standardized mode, which is conducive to the establishment of a complete big data information index database, greatly improving the efficiency of enterprise decision-making, and by automatically repairing the remaining data information that does not meet the standard, it can complement the current construction standard The characteristics of normalized data under the normalized data, and then establish the normal operation framework under the current enterprise standardized database, making the enterprise database more standardized and streamlined, which is conducive to the precise management of information in various departments of the enterprise, and has certain market promotion prospects.

具体实施方式Detailed ways

下面对本发明实施例中的技术方案进行清楚、完整地描述,显然,所描述的实施例仅仅是本发明一部分实施例,而不是全部的实施例。基于本发明中的实施例,本领域普通技术人员在没有做出创造性劳动前提下所获得的所有其他实施例,都属于本发明保护的范围。The following clearly and completely describes the technical solutions in the embodiments of the present invention. Obviously, the described embodiments are only some of the embodiments of the present invention, but not all of them. Based on the embodiments of the present invention, all other embodiments obtained by persons of ordinary skill in the art without making creative efforts belong to the protection scope of the present invention.

本发明提供一种新疆地区企业标准化数据库构建及应用方法,包括以下步骤:The invention provides a method for constructing and applying a standardized database for enterprises in Xinjiang, comprising the following steps:

S1、采集互联网中具备构建标准化数据库的数据特征a以及当前企业运行状态下的数据特征b,生成当前数据特征a和数据特征b具有关联性下的数据表,在数据表上建立多组数据展示模型,并按照1、2......N的数字编号赋予多组数据展示模型唯一的编号;S1. Collect the data feature a and the data feature b under the current operating state of the enterprise in the Internet, generate a data table with the correlation between the current data feature a and data feature b, and create multiple sets of data display on the data table Model, and assign unique numbers to multiple sets of data display models according to the numbers of 1, 2...N;

S2、依据建立的多组数据展示模型对当前数据特征a和数据特征b进行深层次的解析,产生关联性下的变量类型以及产生相同属性的MDS元素,将变量类型通过转化器进行数据转化后,复制在MDS元素中生成标准格式下的数据源;S2. Perform in-depth analysis of the current data feature a and data feature b based on the established multi-group data display model, generate the variable type under the correlation and the MDS element with the same attribute, and convert the variable type through the converter for data conversion , copy the data source in the standard format generated in the MDS element;

S3、查询未产生关联性下的剩余变量类型以及未产生相同属性的剩余元素,将剩余变量类型以及剩余元素在数据展示模型上进行划分和分区域标记,通过冥函数模拟演示剩余变量类型下的剩余变量数据未来变化趋势线,并通过导函数模拟演示剩余元素下的剩余数据未来变化趋势线,计算剩余变量数据未来变化趋势线和剩余数据未来变化趋势线之间的真实数字区间,并建立二者之间的阈值范围,阈值范围内,根据真实数字区间生成异常数据发生源,按照异常数据发生源出现的次数,对异常数据发生源和标准化数据库的数据特征a进行对比,得到异常数据清洗指令,并对异常数据进行定时清洗;S3. Query the remaining variable types without correlation and the remaining elements without the same attributes, divide and mark the remaining variable types and remaining elements on the data display model, and demonstrate the remaining variable types through the ghost function simulation The future change trend line of the remaining variable data, and simulate and demonstrate the future change trend line of the remaining data under the remaining elements through the derivative function, calculate the real digital interval between the future change trend line of the remaining variable data and the future change trend line of the remaining data, and establish two Within the threshold range, the abnormal data source is generated according to the real number interval, and the abnormal data source is compared with the data feature a of the standardized database according to the number of occurrences of the abnormal data source, and the abnormal data cleaning instruction is obtained. , and regularly clean abnormal data;

S4、依据互联网中标准化数据库的构建标准,对剩余变量类型以及剩余元素进行梳理,选取与构建标准吻合度超过85-95%的剩余数据信息,采用数据循环策略的方式剔除未达吻合标准的数据信息,并对剩余数据信息进行自动修复,补齐当前构建标准下的常态化数据特征,并结合数据源,建立起当前企业标准化数据库下的常态运行工作框架;S4. According to the construction standard of the standardized database in the Internet, sort out the remaining variable types and remaining elements, select the remaining data information with a degree of agreement with the construction standard of more than 85-95%, and use the data circulation strategy to eliminate the data that does not meet the standard. Information, and automatically repair the remaining data information, complete the normalized data characteristics under the current construction standard, and combine the data source to establish a normal operation work framework under the current enterprise standardized database;

S5、将建立的常态运行工作框架传输至企业工作模型架构中,间歇性发送正常操作指令,观察企业运行数据是否出现异常,并通过编辑常态运行工作框架下的工作逻辑,建立与企业工作模型架构中的数据采集端口,通过远程终端采集的方式对日常运行数据进行实时采集和监测,生成采集点和监测点,并通过数据编码解析出最终的指标库。S5. Transfer the established normal operation work framework to the enterprise work model architecture, send normal operation instructions intermittently, observe whether the enterprise operation data is abnormal, and establish a work model architecture consistent with the enterprise work model by editing the work logic under the normal operation work framework The data collection port in the remote terminal collects and monitors the daily operation data in real time, generates collection points and monitoring points, and analyzes the final index library through data coding.

进一步的:数据源在生成后,通过分布式网络探针搜寻到当前数据特征下的数据信息,同时记录下网络探针的所有搜寻路径,根据得到的所有搜寻路径建立当前数据信息运行轨迹下的基本逻辑,同时构建基本逻辑下的编辑机制和操作权限,对数据信息的内容判断和流程判断,分析出当前数据信息运行轨迹是否发生偏移,其中内容判断和流程判断前,还应对当前检测人员基本信息进行采集,并分析当前检测人员是否具有判断权限。Further: After the data source is generated, the data information under the current data characteristics is searched through the distributed network probes, and all the search paths of the network probes are recorded at the same time, and the current data information running track is established according to all the search paths obtained. Basic logic, at the same time build the editing mechanism and operation authority under the basic logic, judge the content and process of data information, and analyze whether the current data information running track has shifted. Before the content judgment and process judgment, the current inspection personnel should also be dealt with Collect basic information and analyze whether the current testing personnel have judgment authority.

进一步的:补齐当前构建标准下的常态化数据特征后,还应对常态化数据特征进行质量检查和标准化数据训练,并得到标准化数据训练集。Further: After completing the normalized data features under the current construction standards, quality inspection and standardized data training should be carried out on the normalized data features, and a standardized data training set should be obtained.

进一步的:指标库在解析前,通过对日常运行数据进行采集和监测,并创建当前日常运行数据下的多个服务终端以及服务接口,对日常运行数据进行脱敏和信号放大后,形成最终的指标库。Further: before the analysis of the indicator library, the daily operation data is collected and monitored, and multiple service terminals and service interfaces under the current daily operation data are created, and the daily operation data is desensitized and the signal is amplified to form the final Indicator library.

进一步的:采集点和监测点在生成后,还应自动梳理出当前数据采集端口下的多组特征信息,按照类型和存储大小对多组特征信息进行重排和分类后,建立多组特征信息下的独立传输通道,将该独立传输通道与外部服务器进行网络连接,利用外部服务器中最优的传输线路对独立传输通道进行实时优化,获知当前独立传输通道下的传输路径,将最优传输线路发送至独立传输通道内进行通道构建。Further: After the collection points and monitoring points are generated, they should also automatically sort out multiple sets of feature information under the current data collection port, rearrange and classify multiple sets of feature information according to type and storage size, and establish multiple sets of feature information Under the independent transmission channel, connect the independent transmission channel with the external server, use the optimal transmission line in the external server to optimize the independent transmission channel in real time, know the transmission path under the current independent transmission channel, and connect the optimal transmission line Send to an independent transmission channel for channel construction.

进一步的:指标库在解析后,在指标库中生成日常运行数据下的索引栏,根据索引栏中对应的数据信息建立数据触发按钮,当数据触发按钮产生指令后,自动选取指标库中的重复数据并产生额外参数,并对额外参数进行格式化处理,然后对额外参数进行可视化界面查询,同时添加新的资源参数对可视化界面进行渲染和制作,其中,可视化查询前需要对当前页面进行数据编辑和数据规划,数据规划包含数据参数和数据属性。Further: After the indicator library is analyzed, the index column under the daily operation data is generated in the indicator library, and the data trigger button is established according to the corresponding data information in the index column. When the data trigger button generates an instruction, the repetition in the indicator library is automatically selected. Data and generate additional parameters, and format the additional parameters, then perform visual interface query on the additional parameters, and add new resource parameters to render and make the visual interface. Among them, the data editing of the current page is required before the visual query And data planning, data planning includes data parameters and data attributes.

进一步的:常态运行工作框架是基于数据源下的多个数据元素进行创建的,其中数据元素包含数据长度、数据名称、数据类型、数据格式、数据起始位置和数据终端位置。Further: The normal operation work framework is created based on multiple data elements under the data source, where the data elements include data length, data name, data type, data format, data start position and data end position.

以新疆地区的某化工企业A为例进行数据库的构建说明:Take a chemical company A in Xinjiang as an example to explain the construction of the database:

企业数据特征分类众多,标准数据库的构建从不同的数据归属进行,以其中人力数据为例进行具体说明,There are many classifications of enterprise data characteristics, and the construction of standard databases is carried out from different data attributions. Take the human data as an example for specific illustrations.

首先,设定化工企业中具备构建人力标准化数据库的数据特征a为标准人员分布,具体为研发人员20~30%,普通技术人员30~40%,操作工人40~50%,后勤保障人员10~20%;当前A企业运行状态下的数据特征b为实际人员分布,具体为研发人员20%,普通技术人员30%,操作工人40%,后勤保障人员10%;First of all, set the data characteristic a of building a standardized human resource database in chemical enterprises as the distribution of standard personnel, specifically 20-30% of R&D personnel, 30-40% of ordinary technical personnel, 40-50% of operating workers, and 10-50% of logistics support personnel. 20%; the data feature b under the current operating status of enterprise A is the actual personnel distribution, specifically 20% for R&D personnel, 30% for ordinary technical personnel, 40% for operating workers, and 10% for logistics support personnel;

根据上述的数据特征生成当前人力分布数据特征a和实际人员数据特征b具有关联性下的数据表,在数据表上建立1、研发人员,2、普通技术人员,3、操作工人,4、后勤保障人员的多组数据展示模型;According to the above data characteristics, generate a data table under the correlation between the current manpower distribution data feature a and the actual personnel data feature b, and establish 1. R&D personnel, 2. Ordinary technical personnel, 3. Operators, 4. Logistics on the data table Multiple sets of data display models for security personnel;

依据建立的上述4组数据展示模型对当前标准人员分布数据特征a和实际人员分布数据特征b进行深层次的解析,发现学历分布为与人员分布产生关联性下的变量类型以及产生相同属性的MDS元素,将变量类型通过转化器进行数据转化后,将学历分布的数据复制在MDS元素中生成标准格式下的数据源;产生相同属性的MDS元素的学历为本科以上、本科;本科以上的基本进入研发人员;本科基本进入普通技术人员;Based on the above-mentioned 4 sets of data display models established above, the current standard personnel distribution data feature a and the actual personnel distribution data feature b are deeply analyzed, and it is found that the educational background distribution is a variable type that is related to the personnel distribution and MDS that produces the same attribute Element, after the variable type is converted into data by the converter, copy the data of educational background distribution in the MDS element to generate a data source in a standard format; the educational background of the MDS element with the same attribute is undergraduate or above; undergraduate or above basic entry R & D personnel; undergraduates basically enter ordinary technical personnel;

查询未产生关联性下的剩余变量类型以及未产生相同属性的剩余元素为本科以下,将剩余变量类型以及剩余元素即本科以下的人员在数据展示模型上进行划分和分区域标记,本科以下的人员可作为助理进入研发人员中,可做为助理进入普通技术人员中,可进入操作工人中,可进入后勤保障人员中;Query the remaining variable types without correlation and the remaining elements that do not produce the same attributes as undergraduates, and divide and mark the remaining variable types and remaining elements, that is, the personnel below the undergraduates on the data display model, and the personnel below the undergraduates Can be used as an assistant to join the research and development personnel, can be used as an assistant to join the ordinary technicians, can join the operation workers, and can join the logistics support staff;

通过冥函数模拟演示剩余变量类型下的剩余变量数据未来变化趋势线,并通过导函数模拟演示剩余元素本科以下人员的剩余数据未来变化趋势线,计算剩余变量数据未来变化趋势线和剩余数据未来变化趋势线之间的真实数字区间,建立二者之间的阈值范围,例如在本企业A中,本科以下人员在科研人员中比例阈值范围为0~5%,根据真实数字区间生成异常数据发生源,按照异常数据发生源出现的次数,对本科以下学历人员在科研人员中占比异常数据发生源和标准化数据库的数据特征a进行对比,得到异常数据清洗指令,并对异常数据进行定时清洗;提示企业本科以下人员在科研人员中数据异常,需清理调整;Demonstrate the future change trend line of the remaining variable data under the remaining variable type through the ghost function simulation, and demonstrate the future change trend line of the remaining data of the remaining elements under the undergraduate degree through the derivative function simulation, and calculate the future change trend line of the remaining variable data and the future change of the remaining data For the real number interval between the trend lines, establish the threshold range between the two. For example, in this enterprise A, the threshold range of the proportion of scientific research personnel with undergraduates and below is 0-5%, and the source of abnormal data is generated according to the real number interval , according to the number of occurrences of abnormal data sources, compare the proportion of personnel with a bachelor’s degree or below in the proportion of abnormal data sources among scientific researchers and the data feature a of the standardized database, get the abnormal data cleaning instructions, and regularly clean the abnormal data; prompt The data of the scientific research personnel of the personnel below the undergraduate degree in the enterprise is abnormal, which needs to be cleaned up and adjusted;

依据互联网中标准化数据库的构建标准,采用上述的方式对剩余变量类型以及剩余元素进行梳理,即对本科以下人员在科研人员中的占比进行梳理,选取与构建标准吻合度超过85-95%的剩余数据信息,采用数据循环策略的方式剔除未达吻合标准的数据信息,并对剩余数据信息进行提醒,自动修复,补齐当前构建标准下的常态化数据特征,并结合数据源,建立起当前企业标准化数据库下的人员配置常态运行工作框架;According to the construction standard of the standardized database in the Internet, use the above method to sort out the remaining variable types and remaining elements, that is, to sort out the proportion of scientific research personnel with undergraduates and below, and select those with a degree of agreement with the construction standard of more than 85-95%. The remaining data information adopts the data circulation strategy to eliminate the data information that does not meet the standard, and reminds the remaining data information, automatically repairs, complements the normalized data characteristics under the current construction standard, and combines the data source to establish the current Work framework for normal operation of staffing under the enterprise standardized database;

将建立的人员配置常态运行工作框架传输至企业工作模型架构中,间歇性发送正常操作指令,观察企业运行数据是否出现异常,并通过编辑常态运行工作框架下的工作逻辑,建立与企业工作模型架构中的数据采集端口,通过远程终端采集的方式对日常运行数据进行实时采集和监测,生成采集点和监测点,并通过数据编码解析出最终的指标库。Transfer the established staffing normal operation work framework to the enterprise work model architecture, send normal operation instructions intermittently, observe whether there is any abnormality in the enterprise operation data, and establish an enterprise work model architecture by editing the work logic under the normal operation work framework The data collection port in the remote terminal collects and monitors the daily operation data in real time, generates collection points and monitoring points, and analyzes the final index library through data coding.

该方法通过不同数据特征之间的关联性,根据真实数字区间生成异常数据发生源,得到异常数据清洗指令,并对异常数据进行定时清洗,使得企业数据库得以在标准化的模式下进行构建,有利于建立完整的大数据信息指标数据库,大大提高企业决策效率,而且通过对未达吻合标准的剩余数据信息进行自动修复,能够补齐当前构建标准下的常态化数据特征,进而建立起当前企业标准化数据库下的常态运行工作框架,使得企业数据库更加标准化和流程化,有利于企业各部门信息的精准管理,具备一定的市场推广前景。Through the correlation between different data features, this method generates abnormal data sources according to the real digital interval, obtains abnormal data cleaning instructions, and regularly cleans abnormal data, so that enterprise databases can be constructed in a standardized mode, which is beneficial Establish a complete big data information index database, which greatly improves the decision-making efficiency of enterprises, and by automatically repairing the remaining data information that does not meet the standards, it can complement the normalized data characteristics under the current construction standards, and then establish the current enterprise standardized database Under the normal operation framework, the enterprise database is more standardized and streamlined, which is conducive to the precise management of information in various departments of the enterprise, and has certain market promotion prospects.

尽管已经示出和描述了本发明的实施例,对于本领域的普通技术人员而言,可以理解在不脱离本发明的原理和精神的情况下可以对这些实施例进行多种变化、修改、替换和变型,本发明的范围由所附权利要求及其等同物限定。Although the embodiments of the present invention have been shown and described, those skilled in the art can understand that various changes, modifications and substitutions can be made to these embodiments without departing from the principle and spirit of the present invention. and modifications, the scope of the invention is defined by the appended claims and their equivalents.

Claims (5)

1. The method for constructing and applying the enterprise standardized database in the Xinjiang area is characterized by comprising the following steps of:
s1, acquiring a data feature a for constructing a standardized database and a data feature b under the current enterprise running state in the Internet, generating a data table under the correlation of the current data feature a and the data feature b, establishing a plurality of groups of data display models on the data table, and endowing the plurality of groups of data display models with unique numbers according to the number numbers of 1 and 2.
S2, carrying out deep analysis on the current data feature a and the data feature b according to the established multi-group data display model, generating variable types under relevance and MDS elements with the same attribute, carrying out data conversion on the variable types through a converter, and copying the variable types into the MDS elements to generate a data source under a standard format;
s3, inquiring the residual variable types under the condition that no relevance is generated and the residual elements with the same attribute are not generated, dividing and zoning the residual variable types and the residual elements on a data display model, simulating and demonstrating the residual variable data future change trend line under the residual variable types through the meditation function, simulating and demonstrating the residual variable data future change trend line under the residual elements through the guide function, calculating the real digital interval between the residual variable data future change trend line and the residual variable data future change trend line, and establishing a threshold range between the residual variable data future change trend line and the residual variable data future change trend line;
s4, carding the types of the residual variables and the residual elements according to the construction standard of the standardized database in the Internet, selecting residual data information which has the coincidence degree of more than 85-95% with the construction standard, removing the data information which does not reach the coincidence standard by adopting a data circulation strategy mode, automatically repairing the residual data information, supplementing the normalized data characteristics under the current construction standard, and establishing a normal operation working frame under the current enterprise standardized database by combining with a data source;
s5, transmitting the established normal operation work frame to an enterprise work model framework, intermittently transmitting normal operation instructions, observing whether abnormal operation data of the enterprise occur, establishing a data acquisition port in the enterprise work model framework by editing work logic under the normal operation work frame, acquiring and monitoring daily operation data in real time by a remote terminal acquisition mode, generating acquisition points and monitoring points, and analyzing a final index library by data coding;
generating an abnormal data generating source according to the real digital interval in the threshold range, comparing the abnormal data generating source with the data characteristic a of the standardized database according to the occurrence times of the abnormal data generating source to obtain an abnormal data cleaning instruction, and cleaning the abnormal data at fixed time;
after the data source is generated, searching data information under the current data characteristics through the distributed network probe, recording all searching paths of the network probe, and establishing basic logic under the current data information running track according to all the obtained searching paths;
after the normalized data features under the current construction standard are complemented, quality inspection and normalized data training are also carried out on the normalized data features, and a normalized data training set is obtained.
2. The method for building and applying the standardized database of the enterprises in the Xinjiang area according to claim 1, wherein the index library is used for forming a final index library after desensitizing and amplifying signals of daily operation data by collecting and monitoring daily operation data and creating a plurality of service terminals and service interfaces under the current daily operation data before analysis.
3. The method for constructing and applying the standardized database of the enterprises in the Xinjiang area according to claim 1, wherein after the acquisition points and the monitoring points are generated, a plurality of sets of characteristic information under the current data acquisition port are automatically carded out, and after the plurality of sets of characteristic information are rearranged and classified according to types and storage sizes, independent transmission channels under the plurality of sets of characteristic information are established.
4. The method for building and applying the standardized database of the enterprises in the Xinjiang area according to claim 1, wherein after the analysis of the index database, an index column under daily operation data is generated in the index database, a data trigger button is built according to corresponding data information in the index column, after an instruction is generated by the data trigger button, repeated data in the index database are automatically selected, additional parameters are generated, and formatting processing is carried out on the additional parameters.
5. The method for building and applying the standardized database of the enterprises in the Xinjiang area according to claim 1, wherein the normal operation work framework is created based on a plurality of data elements under a data source, wherein the data elements comprise a data length, a data name, a data type, a data format, a data starting position and a data terminal position.
CN202210649087.XA 2022-06-09 2022-06-09 A Construction and Application Method of Standardized Database for Enterprises in Xinjiang Region Active CN115422146B (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
CN202210649087.XA CN115422146B (en) 2022-06-09 2022-06-09 A Construction and Application Method of Standardized Database for Enterprises in Xinjiang Region

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
CN202210649087.XA CN115422146B (en) 2022-06-09 2022-06-09 A Construction and Application Method of Standardized Database for Enterprises in Xinjiang Region

Publications (2)

Publication Number Publication Date
CN115422146A CN115422146A (en) 2022-12-02
CN115422146B true CN115422146B (en) 2023-05-16

Family

ID=84197401

Family Applications (1)

Application Number Title Priority Date Filing Date
CN202210649087.XA Active CN115422146B (en) 2022-06-09 2022-06-09 A Construction and Application Method of Standardized Database for Enterprises in Xinjiang Region

Country Status (1)

Country Link
CN (1) CN115422146B (en)

Families Citing this family (1)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN120067093B (en) * 2025-04-29 2025-08-15 四川创力科技有限责任公司 Enterprise internal control digital management method based on AI large model

Citations (1)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN114298412A (en) * 2021-12-28 2022-04-08 问策师信息科技南京有限公司 A standardized operation method of enterprise security based on artificial intelligence and big data

Family Cites Families (5)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CA2602309C (en) * 2007-09-13 2015-10-13 Semiconductor Insights, Inc. A method of bibliographic field normalization
US20170034200A1 (en) * 2015-07-30 2017-02-02 Federal Reserve Bank Of Atlanta Flaw Remediation Management
CN107526820A (en) * 2017-08-29 2017-12-29 广东省技术经济研究发展中心 A kind of more storehouse enterprise innovation monitoring big data normal data base construction methods of multi-source
CN111444293B (en) * 2020-04-17 2023-03-14 重庆市勘测院 Intelligent report generation method for multi-source heterogeneous safety monitoring data
CN113641659B (en) * 2021-08-30 2024-10-29 深圳平安医疗健康科技服务有限公司 Medical characteristic database construction method, device, equipment and storage medium

Patent Citations (1)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN114298412A (en) * 2021-12-28 2022-04-08 问策师信息科技南京有限公司 A standardized operation method of enterprise security based on artificial intelligence and big data

Also Published As

Publication number Publication date
CN115422146A (en) 2022-12-02

Similar Documents

Publication Publication Date Title
CN113159339B (en) One-area one-index line loss management method and system based on big data
CN109359950B (en) A method for whole-process management and control of power grid monitoring information
CN106154209A (en) Electrical energy meter fault Forecasting Methodology based on decision Tree algorithms
CN113420162A (en) Equipment operation chain state monitoring method based on knowledge graph
CN118863057A (en) A low voltage management decision-making aid method integrating large language model and knowledge graph
CN112270619A (en) Intelligent lean overhaul management system for power transformation equipment based on big data
CN111046519A (en) Application analysis method of artificial intelligence technology in electric energy meter error diagnosis
CN115422146B (en) A Construction and Application Method of Standardized Database for Enterprises in Xinjiang Region
CN112485593B (en) An intelligent diagnosis method for distribution network problems based on big data
CN115603464A (en) Intelligent power grid operation ticket generation management system based on digital twinning
CN117614141B (en) A multi-voltage level coordinated management method for distribution network
CN108335231A (en) A kind of power distribution network data diagnosis method of Auto-matching
CN118644209A (en) Method and system for automatically generating power outage plans based on multi-dimensional big data fusion
CN117094688B (en) Digital control method and system for power supply station
CN118536936A (en) Engineering technology consultation service system
CN112269822A (en) High-voltage power equipment management system
CN111049157B (en) Distribution network transformer reactive compensation condition analysis method
CN113872200B (en) Medium-voltage distribution network power failure event diagnosis and identification method, system and storage medium
CN104200338A (en) Line loss statistics and decision analysis system
CN110852606B (en) An object-oriented analysis method for production early report data based on control cloud
CN114386787A (en) Multi-layer coordination comprehensive evaluation system and method for power grid planning
CN114049082A (en) Power grid technical improvement overhaul cost data structuring method
CN107194529A (en) A kind of distribution network reliability analysis method of economic benefit based on Text Mining Technology
CN112700127A (en) Method for implementing enterprise safety production operation management and control system
CN113034304A (en) Panoramic analysis system and method based on indexes of power distribution network

Legal Events

Date Code Title Description
PB01 Publication
PB01 Publication
SE01 Entry into force of request for substantive examination
SE01 Entry into force of request for substantive examination
GR01 Patent grant
GR01 Patent grant