[go: up one dir, main page]

CN115729998A - Large-scale processing and analyzing system for arbitrary data hybrid optimization - Google Patents

Large-scale processing and analyzing system for arbitrary data hybrid optimization Download PDF

Info

Publication number
CN115729998A
CN115729998A CN202211478698.9A CN202211478698A CN115729998A CN 115729998 A CN115729998 A CN 115729998A CN 202211478698 A CN202211478698 A CN 202211478698A CN 115729998 A CN115729998 A CN 115729998A
Authority
CN
China
Prior art keywords
data
module
parameter field
sub
interpretation
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Pending
Application number
CN202211478698.9A
Other languages
Chinese (zh)
Inventor
史普力
张林林
周训游
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Beijing Testor Technology Co ltd
Original Assignee
Beijing Testor Technology Co ltd
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Beijing Testor Technology Co ltd filed Critical Beijing Testor Technology Co ltd
Priority to CN202211478698.9A priority Critical patent/CN115729998A/en
Publication of CN115729998A publication Critical patent/CN115729998A/en
Pending legal-status Critical Current

Links

Images

Classifications

    • YGENERAL TAGGING OF NEW TECHNOLOGICAL DEVELOPMENTS; GENERAL TAGGING OF CROSS-SECTIONAL TECHNOLOGIES SPANNING OVER SEVERAL SECTIONS OF THE IPC; TECHNICAL SUBJECTS COVERED BY FORMER USPC CROSS-REFERENCE ART COLLECTIONS [XRACs] AND DIGESTS
    • Y02TECHNOLOGIES OR APPLICATIONS FOR MITIGATION OR ADAPTATION AGAINST CLIMATE CHANGE
    • Y02DCLIMATE CHANGE MITIGATION TECHNOLOGIES IN INFORMATION AND COMMUNICATION TECHNOLOGIES [ICT], I.E. INFORMATION AND COMMUNICATION TECHNOLOGIES AIMING AT THE REDUCTION OF THEIR OWN ENERGY USE
    • Y02D10/00Energy efficient computing, e.g. low power processors, power management or thermal management

Landscapes

  • Computer And Data Communications (AREA)

Abstract

The invention discloses a large-scale processing and analyzing system for arbitrary data mixing optimization, and relates to the field of data large-scale processing and analyzing systems. The method comprises a data acquisition module, a data conversion module, a data processing module, a learning module, a data management module and a data interpretation analysis module, wherein the parameter field of a source file is determined to be in a first data format and the parameter field of a target file is in a second data format by setting the corresponding relation between the parameter field of the source file and the parameter field of the target file, a format conversion protocol corresponding to each first data format is generated according to the similarity between the parameter field of the source file and the parameter field of the target file, each format conversion protocol is issued to the corresponding data conversion module, the data of the parameter field of the source file is matched to the parameter field corresponding to the target file according to the format conversion protocol, the data integration difficulty of the system facing multi-source heterogeneous mass data is reduced, and the analysis efficiency is improved.

Description

一种任意数据混合优化的大规模处理分析系统A Large-Scale Processing and Analysis System for Arbitrary Data Mixing Optimization

技术领域technical field

本发明涉及数据大规模处理分析系统领域,特别是涉及一种任意数据混合优化的大规模处理分析系统。The invention relates to the field of large-scale data processing and analysis systems, in particular to a large-scale processing and analysis system for arbitrary data mixing optimization.

背景技术Background technique

数据是事实或观察的结果,是对客观事物的逻辑归纳,是用于表示客观事物的未经加工的原始素材。数据可以是连续的值,比如声音、图像,称为模拟数据;也可以是离散的,如符号、文字,称为数字数据;在计算机系统中,数据以二进制信息单元0、1的形式表示。Data is the result of facts or observations, a logical induction of objective things, and unprocessed raw materials used to represent objective things. Data can be continuous values, such as sounds and images, which are called analog data; they can also be discrete, such as symbols and text, which are called digital data; in computer systems, data are represented in the form of binary information units 0 and 1.

在公开号“CN108427709B”公开的“一种多源海量数据处理系统及方法”,所述系统包括计算模块和任务管理模块,其中:所述计算模块用于接收多源海量数据,并调取数据接收服务解析所述多源海量数据;所述多源海量数据是根据预先在所述任务管理模块中配置的任务所产生的开源数据;所述计算模块若接收到外界选择的预设模型的确认动作,将解析后的多源海量数据输入所述预设模型,以供分析所述预设模型的输出结果。所述方法使用所述系统。本发明实施例提供的多源海量数据处理系统及方法,通过调取数据接收服务解析多源海量数据,将解析后的多源海量数据输入预设模型,并根据预设模型的输出结果分析多源海量数据,不仅使企业系统高效兼容多源海量数据,还能够有效利用该多源海量数据进行数据分析。In the "multi-source mass data processing system and method" disclosed in the publication number "CN108427709B", the system includes a calculation module and a task management module, wherein: the calculation module is used to receive multi-source mass data and retrieve data The receiving service parses the multi-source massive data; the multi-source massive data is open-source data generated according to tasks pre-configured in the task management module; if the calculation module receives the confirmation of the preset model selected by the outside An action of inputting the analyzed multi-source mass data into the preset model for analyzing the output result of the preset model. The method uses the system. The multi-source mass data processing system and method provided by the embodiments of the present invention analyze the multi-source mass data by calling the data receiving service, input the parsed multi-source mass data into the preset model, and analyze the multi-source mass data according to the output result of the preset model. The source of massive data not only makes the enterprise system efficiently compatible with multi-source massive data, but also can effectively use the multi-source massive data for data analysis.

随着大数据技术的发展,数据的来源也越来越广泛,现有的处理分析系统在面对多源异构的海量数据时往往存在数据整合困难,分析效率低;并且不同的业务场景需要基于业务进行编码的工作,当业务发生轻微变化时需要进行相应的需求评审、设计、开发、上线、部署等一系列的操作,效率低下,且过程繁琐。With the development of big data technology, data sources are becoming more and more extensive. Existing processing and analysis systems often have difficulties in data integration and low analysis efficiency when faced with multi-source heterogeneous massive data; and different business scenarios require Coding based on the business requires a series of operations such as requirements review, design, development, launch, and deployment when there is a slight change in the business, which is inefficient and cumbersome.

发明内容Contents of the invention

本发明的目的在于提供一种任意数据混合优化的大规模处理分析系统,解决现有的处理分析系统在面对多源异构的海量数据时往往存在数据整合困难,分析效率低;并且不同的业务场景需要基于业务进行编码的工作,当业务发生轻微变化时需要进行相应的需求评审、设计、开发、上线、部署等一系列的操作,效率低下,且过程繁琐的问题:The purpose of the present invention is to provide a large-scale processing and analysis system for arbitrary data mixing and optimization, which solves the problem of data integration difficulties and low analysis efficiency in existing processing and analysis systems when faced with multi-source heterogeneous massive data; and different The business scenario requires coding based on the business. When the business changes slightly, a series of operations such as requirements review, design, development, launch, and deployment need to be carried out, which is inefficient and cumbersome:

本发明为一种任意数据混合优化的大规模处理分析系统,包括数据采集模块、数据转换模块、数据处理模块、学习模块、数据管理模块以及数据判读分析模块;The present invention is a large-scale processing and analysis system for arbitrary data mixing and optimization, including a data acquisition module, a data conversion module, a data processing module, a learning module, a data management module and a data interpretation and analysis module;

所述数据采集模块用于通过利用大数据平台的计算能力,采用分布式的方式并行执行采集任务;The data acquisition module is used to execute acquisition tasks in parallel in a distributed manner by utilizing the computing power of the big data platform;

所述数据转换模块用于将海量异构数据转换为同构数据,并传输给数据处理模块;The data conversion module is used to convert massive heterogeneous data into isomorphic data and transmit it to the data processing module;

所述数据处理模块用于对数据包进行解析,按照用户配置的参数信息处理出结果,并发送至数据管理模块;The data processing module is used to analyze the data packet, process the result according to the parameter information configured by the user, and send it to the data management module;

所述数据管理模块用于接收并存储数据处理模块发送的原始数据,按试验项目建立存储结构,完成数据实时存储;实时接收数据处理模块通过网络发布的数据处理结果,并实时显示;The data management module is used to receive and store the original data sent by the data processing module, establish a storage structure according to the test items, and complete the real-time data storage; receive the data processing results released by the data processing module through the network in real time, and display them in real time;

所述学习模块用于将现有的算法模型整合,形成算法模型数据库,基于海量的样本数据训练不断优化模型正确率;The learning module is used to integrate existing algorithm models to form an algorithm model database, and continuously optimize the accuracy of the model based on massive sample data training;

其中,算法模型数据库的底层融入开源技术组件,如Impala、YARN、Spark、Hbase、HDFS、Hive、Kafka、Flink、ElasticSearch、ZooKeeper等。针对不同应用领域,以插件的方式对功能组件进行扩展,快速响应特定的计算需求;Among them, the bottom layer of the algorithm model database is integrated with open source technology components, such as Impala, YARN, Spark, Hbase, HDFS, Hive, Kafka, Flink, ElasticSearch, ZooKeeper, etc. For different application fields, the functional components are extended in the form of plug-ins to quickly respond to specific computing needs;

且支持PB量级复杂的查询和分析,单集群部署规模超过1000个节点。并提供多种格式的数据文件高效转换及自定义格式的解析服务加载,支持数据与应用分离管理、应用无缝的数据平移。提供算法模型仓库,基于海量的样本数据训练不断优化模型正确率。And it supports PB-level complex query and analysis, and the single-cluster deployment scale exceeds 1,000 nodes. It also provides efficient conversion of data files in various formats and analysis service loading of custom formats, supports separate management of data and applications, and seamless data translation of applications. Provide an algorithm model warehouse, and continuously optimize the model accuracy rate based on massive sample data training.

所述数据判读分析模块用于根据不同的测试状态和测试流程,自动完成遥测参数的判读工作;The data interpretation and analysis module is used to automatically complete the interpretation of telemetry parameters according to different test states and test procedures;

所述数据采集模块与数据判读分析模块之间设置有能够相互传输数据的传输子模块,且数据采集模块包括字段子模块,所述数据采集模块采集源文件数据,并分析所述源文件的参数字段,提取每一参数字段的数据;所述字段子模块用于设置源文件的参数字段与目标文件的参数字段的对应关系,确定源文件的参数字段为第一数据格式,目标文件的参数字段为第二数据格式,所述字段子模块根据源文件的参数字段与目标文件的参数字段之间的相似度生成每个第一数据格式对应的格式转换协议,并将每个格式转换协议下发至对应数据转换模块,根据格式转换协议,将所述源文件的参数字段的数据匹配至所述目标文件对应的参数字段,将所述源文件的参数字段的数据匹配至所述目标文件对应的参数字段的步骤包括将一个或多个所述源文件的参数字段的数据进行计算和判断后,生成匹配至所述目标文件对应的参数字段的数据,并对数据进行整合为数据包;A transmission sub-module capable of transmitting data to each other is set between the data acquisition module and the data interpretation and analysis module, and the data acquisition module includes a field sub-module, the data acquisition module collects source file data, and analyzes the parameters of the source file field, to extract the data of each parameter field; the field submodule is used to set the corresponding relationship between the parameter field of the source file and the parameter field of the target file, and determine that the parameter field of the source file is the first data format, and the parameter field of the target file For the second data format, the field submodule generates a format conversion protocol corresponding to each first data format according to the similarity between the parameter field of the source file and the parameter field of the target file, and sends each format conversion protocol To the corresponding data conversion module, according to the format conversion protocol, match the data of the parameter field of the source file to the parameter field corresponding to the target file, and match the data of the parameter field of the source file to the corresponding parameter field of the target file The step of the parameter field includes calculating and judging the data of one or more parameter fields of the source file, generating data matched to the corresponding parameter field of the target file, and integrating the data into a data package;

通过设置数据采集模块、数据转换模块、数据处理模块、学习模块、数据管理模块以及数据判读分析模块,并在数据采集模块中设置字段子模块,通过设置源文件的参数字段与目标文件的参数字段的对应关系,确定源文件的参数字段为第一数据格式,目标文件的参数字段为第二数据格式,根据源文件的参数字段与目标文件的参数字段之间的相似度生成每个第一数据格式对应的格式转换协议,并将每个格式转换协议下发至对应数据转换模块,根据格式转换协议,将所述源文件的参数字段的数据匹配至所述目标文件对应的参数字段,将所述源文件的参数字段的数据匹配至所述目标文件对应的参数字段的步骤包括将一个或多个所述源文件的参数字段的数据进行计算和判断后,生成匹配至所述目标文件对应的参数字段的数据,并对数据进行整合为数据包,降低系统面对多源异构的海量数据时数据整合难度,提高分析效率;By setting the data acquisition module, data conversion module, data processing module, learning module, data management module and data interpretation and analysis module, and setting the field sub-module in the data acquisition module, by setting the parameter field of the source file and the parameter field of the target file The corresponding relationship, determine the parameter field of the source file is the first data format, the parameter field of the target file is the second data format, generate each first data according to the similarity between the parameter field of the source file and the parameter field of the target file The format conversion protocol corresponding to the format, and each format conversion protocol is sent to the corresponding data conversion module, according to the format conversion protocol, the data in the parameter field of the source file is matched to the parameter field corresponding to the target file, and the The step of matching the data of the parameter field of the source file to the corresponding parameter field of the target file includes calculating and judging the data of one or more parameter fields of the source file, and generating a corresponding Data in the parameter field, and integrate the data into data packets, reducing the difficulty of data integration when the system faces multi-source heterogeneous massive data, and improving analysis efficiency;

并且,当业务发生轻微变化时,由于数据的转换是根据源文件的参数字段与目标文件的参数字段之间的相似度生成每个第一数据格式对应的格式转换协议,数据之间的相似度变化较小,所以需求评审、设计、开发、上线、部署等一系列的操作也变化较小,效率较高,过程较为方便,大大减小了耗时,使用者的使用更为灵活、方便。And, when the business changes slightly, since the conversion of data is based on the similarity between the parameter fields of the source file and the parameter fields of the target file to generate a format conversion protocol corresponding to each first data format, the similarity between the data The changes are small, so a series of operations such as requirements review, design, development, launch, and deployment are also small, with high efficiency and convenient processes, greatly reducing time-consuming, and making users more flexible and convenient to use.

其中,系统的工作步骤如下:Among them, the working steps of the system are as follows:

S1:数据采集模块采集源文件数据,并分析所述源文件的参数字段,提取每一参数字段的数据;所述字段子模块用于设置源文件的参数字段与目标文件的参数字段的对应关系,确定源文件的参数字段为第一数据格式,目标文件的参数字段为第二数据格式;S1: the data acquisition module collects the source file data, and analyzes the parameter fields of the source file, and extracts the data of each parameter field; the field sub-module is used to set the corresponding relationship between the parameter fields of the source file and the parameter fields of the target file , determining that the parameter field of the source file is the first data format, and the parameter field of the target file is the second data format;

S2:根据源文件的参数字段与目标文件的参数字段之间的相似度生成每个第一数据格式对应的格式转换协议,并将每个格式转换协议下发至对应数据转换模块;S2: Generate a format conversion protocol corresponding to each first data format according to the similarity between the parameter field of the source file and the parameter field of the target file, and deliver each format conversion protocol to the corresponding data conversion module;

S3:根据格式转换协议,将所述源文件的参数字段的数据匹配至所述目标文件对应的参数字段,将所述源文件的参数字段的数据匹配至所述目标文件对应的参数字段的步骤包括将一个或多个所述源文件的参数字段的数据进行计算和判断后,生成匹配至所述目标文件对应的参数字段的数据,并对数据进行整合为数据包;S3: According to the format conversion protocol, the step of matching the data of the parameter field of the source file to the parameter field corresponding to the target file, and matching the data of the parameter field of the source file to the parameter field corresponding to the target file After calculating and judging the data of one or more parameter fields of the source file, generating data matched to the parameter field corresponding to the target file, and integrating the data into a data package;

S4:接收到数据后,数据处理模块根据配置的参数处理信息完成数据处理,并将处理结果转发至数据管理软件;S4: After receiving the data, the data processing module completes the data processing according to the configured parameter processing information, and forwards the processing results to the data management software;

S5:数据管理模块根据试验项目建立存储结构,实时接收到数据处理与发布软件发送的处理结果后,对数据进行存储;用户可通过数据管理软件的实时监测模块对测试过程中的数据进行实时监测;S5: The data management module establishes a storage structure according to the test items, and stores the data after receiving the processing results sent by the data processing and publishing software in real time; the user can monitor the data in the test process in real time through the real-time monitoring module of the data management software ;

S6:试验完成后,数据判读分析模块读取数据管理模块存储在硬盘上的数据文件,并调用判据对数据进行自动判读,并生成报告;软件也可调用之前存储的历次试验数据,对数据进行不同试验的横向比对。S6: After the test is completed, the data interpretation and analysis module reads the data files stored in the hard disk by the data management module, and invokes the criteria to automatically interpret the data and generate a report; the software can also call the previously stored test data to analyze the data A horizontal comparison of different experiments was carried out.

其中,系统的数据交互涉及了从数据的生成到数据的计算处理、分发及数据的存储、调用的整个过程。整个数据传输的过程中,涉及了数据与各业务单元及上级之间的上传,需根据各级对数据的需求不同,进行系统之前的对接及数据的传输,预留各类数据接口是数据整个生命周期的必要环节。Among them, the data interaction of the system involves the whole process from data generation to data calculation, distribution, data storage and call. The entire process of data transmission involves the uploading of data with various business units and superiors. According to the different requirements for data at all levels, the connection before the system and data transmission should be carried out. Various data interfaces are reserved for the entire data. necessary part of the life cycle.

优选地,所述数据采集模块还包括微处理器单元、发送接口控制子模块以及接收接口控制子模块;所述微处理器单元,用于根据使用需求实现接口控制;所述发送接口控制子模块,用于实现基于总线协议的数据包发送;所述接收接口控制子模块,用于实现基于总线协议的数据包接收。Preferably, the data acquisition module further includes a microprocessor unit, a sending interface control submodule and a receiving interface control submodule; the microprocessor unit is used to implement interface control according to usage requirements; the sending interface control submodule is used to realize data packet sending based on bus protocol; the receiving interface control submodule is used to realize data packet reception based on bus protocol.

优选地,所述数据处理模块包括总线数据处理子模块、辅助判读子模块和实时数据处理子模块;所述总线数据处理子模块用于实现基于总线协议进行数据解析;所述辅助判读子模块用于辅助数据判读分析模块对数据进行判读分析;所述实时数据处理子模块用于进行实时数据的解析。Preferably, the data processing module includes a bus data processing sub-module, an auxiliary interpretation sub-module and a real-time data processing sub-module; the bus data processing sub-module is used to implement data analysis based on the bus protocol; the auxiliary interpretation sub-module uses Interpret and analyze the data in the auxiliary data interpretation and analysis module; the real-time data processing sub-module is used to analyze the real-time data.

优选地,所述数据管理模块包括数据库管理子模块、实时检测参数配置子模块、数据存储和发布子模块以及数据库配置导入子模块;所述数据管理模块中建立实时数据库,所述实时数据库用于存储数据处理模块传递来的实时数据;所述实时检测参数配置子模块用于参数信息的装订导入,对参数信息进行正确性校验;所述数据存储和发布子模块用于通过网络发布数据处理结果,并实时显示;所述数据库配置导入子模块用于对历次试验数据进行迁移和备份,支持相同格式单元测试数据的导入。Preferably, the data management module includes a database management submodule, a real-time detection parameter configuration submodule, a data storage and publishing submodule, and a database configuration import submodule; a real-time database is established in the data management module, and the real-time database is used for The real-time data delivered by the storage data processing module; the real-time detection parameter configuration sub-module is used for binding and importing parameter information, and correctness verification of the parameter information; the data storage and publishing sub-module is used for publishing data processing through the network The results are displayed in real time; the database configuration import sub-module is used for migrating and backing up previous test data, and supports the import of unit test data in the same format.

优选地,所述数据判读分析模块包括数据自动判读子模块和数据比对分析子模块;所述数据自动判读子模块用于根据不同的测试状态和测试流程,自动完成遥测参数的判读工作;所述数据比对分析子模块用于不同试验数据的横向比对,存储不同任务、不同状态下的判据,对判据进行创建、编辑、删除和复制。Preferably, the data interpretation and analysis module includes an automatic data interpretation sub-module and a data comparison analysis sub-module; the automatic data interpretation sub-module is used to automatically complete the interpretation of telemetry parameters according to different test states and test procedures; The data comparison and analysis sub-module is used for horizontal comparison of different test data, storage of criteria under different tasks and different states, and creation, editing, deletion and copying of criteria.

优选地,所述数据采集模块包括自动采集子模块,用于自动接收外部所传输的所述源文件的数据。Preferably, the data collection module includes an automatic collection sub-module for automatically receiving the data of the source file transmitted from outside.

优选地,所述数据判读分析模块还包括用户子模块以及报告子模块;所述用户子模块用于用户、角色和权限的分级数据、判据分级管理;所述报告子模块用于判读结果报告自动生成,并基于网络完成签署、确认。Preferably, the data interpretation and analysis module further includes a user sub-module and a report sub-module; the user sub-module is used for hierarchical data and criterion hierarchical management of users, roles and permissions; the report sub-module is used for interpretation result reporting Automatically generated, signed and confirmed based on the network.

优选地,所述数据处理模块能够进行动态编译、打包以及动态调度,实现热部署能力;Preferably, the data processing module can perform dynamic compilation, packaging and dynamic scheduling to achieve hot deployment capabilities;

其中,通过灵活配置数据处理逻辑及数据处理流程及热部署的方式节省了大量研发资源及繁琐的开发流程。Among them, a large amount of R&D resources and tedious development process are saved through flexible configuration of data processing logic and data processing flow and hot deployment.

本发明具有以下有益效果:The present invention has the following beneficial effects:

1、本发明通过设置数据采集模块、数据转换模块、数据处理模块、学习模块、数据管理模块以及数据判读分析模块,并在数据采集模块中设置字段子模块,通过设置源文件的参数字段与目标文件的参数字段的对应关系,确定源文件的参数字段为第一数据格式,目标文件的参数字段为第二数据格式,根据源文件的参数字段与目标文件的参数字段之间的相似度生成每个第一数据格式对应的格式转换协议,并将每个格式转换协议下发至对应数据转换模块,根据格式转换协议,将所述源文件的参数字段的数据匹配至所述目标文件对应的参数字段,将所述源文件的参数字段的数据匹配至所述目标文件对应的参数字段的步骤包括将一个或多个所述源文件的参数字段的数据进行计算和判断后,生成匹配至所述目标文件对应的参数字段的数据,并对数据进行整合为数据包,降低系统面对多源异构的海量数据时数据整合难度,提高分析效率。1. The present invention sets the data acquisition module, data conversion module, data processing module, learning module, data management module and data interpretation and analysis module, and sets the field sub-module in the data acquisition module, by setting the parameter field and target of the source file The corresponding relationship of the parameter field of the file, determine the parameter field of the source file is the first data format, the parameter field of the target file is the second data format, generate each parameter field according to the similarity between the parameter field of the source file and the target file A format conversion protocol corresponding to the first data format, and each format conversion protocol is sent to the corresponding data conversion module, and according to the format conversion protocol, the data in the parameter field of the source file is matched to the parameter corresponding to the target file field, the step of matching the data of the parameter field of the source file to the corresponding parameter field of the target file includes calculating and judging the data of one or more parameter fields of the source file, and generating the data matched to the The data in the parameter field corresponding to the target file is integrated into a data package, which reduces the difficulty of data integration when the system faces multi-source heterogeneous massive data and improves analysis efficiency.

2、本发明在当业务发生轻微变化时,由于数据的转换是根据源文件的参数字段与目标文件的参数字段之间的相似度生成每个第一数据格式对应的格式转换协议,数据之间的相似度变化较小,所以需求评审、设计、开发、上线、部署等一系列的操作也变化较小,效率较高,过程较为方便,大大减小了耗时,使用者的使用更为灵活、方便。2. The present invention generates a format conversion protocol corresponding to each first data format according to the similarity between the parameter field of the source file and the parameter field of the target file when the business changes slightly, and the data conversion The similarity changes are small, so a series of operations such as requirements review, design, development, launch, and deployment also change little, with high efficiency, more convenient process, greatly reduced time-consuming, and more flexible use of users ,convenient.

当然,实施本发明的任一产品并不一定需要同时达到以上所述的所有优点。Of course, any product implementing the present invention does not necessarily need to achieve all the above-mentioned advantages at the same time.

附图说明Description of drawings

为了更清楚地说明本发明实施例的技术方案,下面将对实施例描述所需要使用的附图作简单地介绍,显而易见地,下面描述中的附图仅仅是本发明的一些实施例,对于本领域普通技术人员来讲,在不付出创造性劳动的前提下,还可以根据这些附图获得其他的附图。In order to more clearly illustrate the technical solutions of the embodiments of the present invention, the following will briefly introduce the accompanying drawings that are required for the description of the embodiments. Obviously, the accompanying drawings in the following description are only some embodiments of the present invention. Those of ordinary skill in the art can also obtain other drawings based on these drawings without any creative effort.

图1为本发明一种任意数据混合优化的大规模处理分析系统的系统框图;Fig. 1 is a system block diagram of a large-scale processing analysis system optimized for arbitrary data mixing in the present invention;

图2为本发明一种任意数据混合优化的大规模处理分析系统的工作流程图。Fig. 2 is a work flow diagram of a large-scale processing and analysis system optimized for arbitrary data mixing in the present invention.

具体实施方式Detailed ways

下面将结合本发明实施例中的附图,对本发明实施例中的技术方案进行清楚、完整地描述,显然,所描述的实施例仅仅是本发明一部分实施例,而不是全部的实施例。基于本发明中的实施例,本领域普通技术人员在没有做出创造性劳动前提下所获得的所有其它实施例,都属于本发明保护的范围。The following will clearly and completely describe the technical solutions in the embodiments of the present invention with reference to the accompanying drawings in the embodiments of the present invention. Obviously, the described embodiments are only some, not all, embodiments of the present invention. Based on the embodiments of the present invention, all other embodiments obtained by persons of ordinary skill in the art without making creative efforts belong to the protection scope of the present invention.

在本发明的描述中,需要理解的是,术语“上”、“中”、“外”、“内”等指示方位或位置关系,仅是为了便于描述本发明和简化描述,而不是指示或暗示所指的组件或元件必须具有特定的方位,以特定的方位构造和操作,因此不能理解为对本发明的限制。In the description of the present invention, it should be understood that the terms "upper", "middle", "outer", "inner" and the like indicate orientation or positional relationship, and are only for the convenience of describing the present invention and simplifying the description, rather than indicating or It should not be construed as limiting the invention by implying that a referenced component or element must have a particular orientation, be constructed and operate in a particular orientation.

如图1-2所示,本实施列一种任意数据混合优化的大规模处理分析系统,包括数据采集模块、数据转换模块、数据处理模块、学习模块、数据管理模块以及数据判读分析模块;As shown in Figure 1-2, this embodiment implements a large-scale processing and analysis system for arbitrary data mixing and optimization, including a data acquisition module, a data conversion module, a data processing module, a learning module, a data management module, and a data interpretation and analysis module;

数据采集模块用于通过利用大数据平台的计算能力,采用分布式的方式并行执行采集任务;The data acquisition module is used to execute acquisition tasks in parallel in a distributed manner by utilizing the computing power of the big data platform;

数据转换模块用于将海量异构数据转换为同构数据,并传输给数据处理模块;The data conversion module is used to convert massive heterogeneous data into isomorphic data and transmit it to the data processing module;

数据处理模块用于对数据包进行解析,按照用户配置的参数信息处理出结果,并发送至数据管理模块;The data processing module is used to analyze the data packet, process the result according to the parameter information configured by the user, and send it to the data management module;

数据管理模块用于接收并存储数据处理模块发送的原始数据,按试验项目建立存储结构,完成数据实时存储;实时接收数据处理模块通过网络发布的数据处理结果,并实时显示;The data management module is used to receive and store the original data sent by the data processing module, establish a storage structure according to the test items, and complete the real-time data storage; receive the data processing results released by the data processing module through the network in real time, and display them in real time;

学习模块用于将现有的算法模型整合,形成算法模型数据库,基于海量的样本数据训练不断优化模型正确率;The learning module is used to integrate the existing algorithm models to form an algorithm model database, and continuously optimize the accuracy of the model based on massive sample data training;

其中,算法模型数据库的底层融入开源技术组件,如Impala、YARN、Spark、Hbase、HDFS、Hive、Kafka、Flink、ElasticSearch、ZooKeeper等。针对不同应用领域,以插件的方式对功能组件进行扩展,快速响应特定的计算需求;Among them, the bottom layer of the algorithm model database is integrated with open source technology components, such as Impala, YARN, Spark, Hbase, HDFS, Hive, Kafka, Flink, ElasticSearch, ZooKeeper, etc. For different application fields, the functional components are extended in the form of plug-ins to quickly respond to specific computing needs;

且支持PB量级复杂的查询和分析,单集群部署规模超过1000个节点。并提供多种格式的数据文件高效转换及自定义格式的解析服务加载,支持数据与应用分离管理、应用无缝的数据平移。提供算法模型仓库,基于海量的样本数据训练不断优化模型正确率。And it supports PB-level complex query and analysis, and the single-cluster deployment scale exceeds 1,000 nodes. It also provides efficient conversion of data files in various formats and analysis service loading of custom formats, supports separate management of data and applications, and seamless data translation of applications. Provide an algorithm model warehouse, and continuously optimize the model accuracy rate based on massive sample data training.

数据判读分析模块用于根据不同的测试状态和测试流程,自动完成遥测参数的判读工作;The data interpretation and analysis module is used to automatically complete the interpretation of telemetry parameters according to different test states and test procedures;

数据采集模块与数据判读分析模块之间设置有能够相互传输数据的传输子模块,且数据采集模块包括字段子模块,数据采集模块采集源文件数据,并分析源文件的参数字段,提取每一参数字段的数据;字段子模块用于设置源文件的参数字段与目标文件的参数字段的对应关系,确定源文件的参数字段为第一数据格式,目标文件的参数字段为第二数据格式,根据源文件的参数字段与目标文件的参数字段之间的相似度生成每个第一数据格式对应的格式转换协议,并将每个格式转换协议下发至对应数据转换模块,所述字段子模块根据格式转换协议,将源文件的参数字段的数据匹配至目标文件对应的参数字段,将源文件的参数字段的数据匹配至目标文件对应的参数字段的步骤包括将一个或多个源文件的参数字段的数据进行计算和判断后,生成匹配至目标文件对应的参数字段的数据,并对数据进行整合为数据包;A transmission sub-module capable of transmitting data to each other is set between the data acquisition module and the data interpretation and analysis module, and the data acquisition module includes a field sub-module, the data acquisition module collects the source file data, analyzes the parameter field of the source file, and extracts each parameter The data of the field; the field sub-module is used to set the corresponding relationship between the parameter field of the source file and the parameter field of the target file, determine that the parameter field of the source file is the first data format, and the parameter field of the target file is the second data format, according to the source file The similarity between the parameter field of the file and the parameter field of the target file generates a format conversion protocol corresponding to each first data format, and sends each format conversion protocol to the corresponding data conversion module, and the field sub-module is based on the format The conversion protocol is to match the data of the parameter field of the source file to the corresponding parameter field of the target file, and the step of matching the data of the parameter field of the source file to the corresponding parameter field of the target file includes converting the parameter field of one or more source files to After the data is calculated and judged, the data matching the parameter field corresponding to the target file is generated, and the data is integrated into a data package;

通过设置数据采集模块、数据转换模块、数据处理模块、学习模块、数据管理模块以及数据判读分析模块,并在数据采集模块中设置字段子模块,通过设置源文件的参数字段与目标文件的参数字段的对应关系,确定源文件的参数字段为第一数据格式,目标文件的参数字段为第二数据格式,根据源文件的参数字段与目标文件的参数字段之间的相似度生成每个第一数据格式对应的格式转换协议,并将每个格式转换协议下发至对应数据转换模块,根据格式转换协议,将源文件的参数字段的数据匹配至目标文件对应的参数字段,将源文件的参数字段的数据匹配至目标文件对应的参数字段的步骤包括将一个或多个源文件的参数字段的数据进行计算和判断后,生成匹配至目标文件对应的参数字段的数据,并对数据进行整合为数据包,降低系统面对多源异构的海量数据时数据整合难度,提高分析效率;By setting the data acquisition module, data conversion module, data processing module, learning module, data management module and data interpretation and analysis module, and setting the field sub-module in the data acquisition module, by setting the parameter field of the source file and the parameter field of the target file The corresponding relationship, determine the parameter field of the source file is the first data format, the parameter field of the target file is the second data format, generate each first data according to the similarity between the parameter field of the source file and the parameter field of the target file The format conversion protocol corresponding to the format, and each format conversion protocol is sent to the corresponding data conversion module. According to the format conversion protocol, the data in the parameter field of the source file is matched to the parameter field corresponding to the target file, and the parameter field of the source file is The step of matching the data of the target file to the parameter field corresponding to the target file includes calculating and judging the data of the parameter field of one or more source files, generating the data matching the parameter field corresponding to the target file, and integrating the data into data package, which reduces the difficulty of data integration when the system is faced with multi-source heterogeneous massive data, and improves analysis efficiency;

并且,当业务发生轻微变化时,由于数据的转换是根据源文件的参数字段与目标文件的参数字段之间的相似度生成每个第一数据格式对应的格式转换协议,数据之间的相似度变化较小,所以需求评审、设计、开发、上线、部署等一系列的操作也变化较小,效率较高,过程较为方便,大大减小了耗时,使用者的使用更为灵活、方便。And, when the business changes slightly, since the conversion of data is based on the similarity between the parameter fields of the source file and the parameter fields of the target file to generate a format conversion protocol corresponding to each first data format, the similarity between the data The changes are small, so a series of operations such as requirements review, design, development, launch, and deployment are also small, with high efficiency and convenient processes, greatly reducing time-consuming, and making users more flexible and convenient to use.

其中,系统的工作步骤如下:Among them, the working steps of the system are as follows:

S1:数据采集模块采集源文件数据,并分析源文件的参数字段,提取每一参数字段的数据;字段子模块用于设置源文件的参数字段与目标文件的参数字段的对应关系,确定源文件的参数字段为第一数据格式,目标文件的参数字段为第二数据格式;S1: The data acquisition module collects the source file data, analyzes the parameter fields of the source file, and extracts the data of each parameter field; the field sub-module is used to set the corresponding relationship between the parameter fields of the source file and the parameter fields of the target file, and determine the source file The parameter field of the target file is the first data format, and the parameter field of the target file is the second data format;

S2:根据源文件的参数字段与目标文件的参数字段之间的相似度生成每个第一数据格式对应的格式转换协议,并将每个格式转换协议下发至对应数据转换模块;S2: Generate a format conversion protocol corresponding to each first data format according to the similarity between the parameter field of the source file and the parameter field of the target file, and deliver each format conversion protocol to the corresponding data conversion module;

S3:根据格式转换协议,将源文件的参数字段的数据匹配至目标文件对应的参数字段,将源文件的参数字段的数据匹配至目标文件对应的参数字段的步骤包括将一个或多个源文件的参数字段的数据进行计算和判断后,生成匹配至目标文件对应的参数字段的数据,并对数据进行整合为数据包;S3: According to the format conversion protocol, match the data of the parameter field of the source file to the corresponding parameter field of the target file, and the step of matching the data of the parameter field of the source file to the corresponding parameter field of the target file includes converting one or more source files to After calculating and judging the data in the parameter field of the target file, generate data that matches the parameter field corresponding to the target file, and integrate the data into a data package;

S4:接收到数据后,数据处理模块根据配置的参数处理信息完成数据处理,并将处理结果转发至数据管理软件;S4: After receiving the data, the data processing module completes the data processing according to the configured parameter processing information, and forwards the processing results to the data management software;

S5:数据管理模块根据试验项目建立存储结构,实时接收到数据处理与发布软件发送的处理结果后,对数据进行存储;用户可通过数据管理软件的实时监测模块对测试过程中的数据进行实时监测;S5: The data management module establishes a storage structure according to the test items, and stores the data after receiving the processing results sent by the data processing and publishing software in real time; the user can monitor the data in the test process in real time through the real-time monitoring module of the data management software ;

S6:试验完成后,数据判读分析模块读取数据管理模块存储在硬盘上的数据文件,并调用判据对数据进行自动判读,并生成报告;软件也可调用之前存储的历次试验数据,对数据进行不同试验的横向比对。S6: After the test is completed, the data interpretation and analysis module reads the data files stored in the hard disk by the data management module, and invokes the criteria to automatically interpret the data and generate a report; the software can also call the previously stored test data to analyze the data A horizontal comparison of different experiments was carried out.

其中,系统的数据交互涉及了从数据的生成到数据的计算处理、分发及数据的存储、调用的整个过程。整个数据传输的过程中,涉及了数据与各业务单元及上级之间的上传,需根据各级对数据的需求不同,进行系统之前的对接及数据的传输,预留各类数据接口是数据整个生命周期的必要环节。Among them, the data interaction of the system involves the whole process from data generation to data calculation, distribution, data storage and call. The entire process of data transmission involves the uploading of data with various business units and superiors. According to the different requirements for data at all levels, the connection before the system and data transmission should be carried out. Various data interfaces are reserved for the entire data. necessary part of the life cycle.

数据采集模块包括微处理器单元、发送接口控制子模块以及接收接口控制子模块;微处理器单元,用于根据使用需求实现接口控制;发送接口控制子模块,用于实现基于总线协议的数据包发送;接收接口控制子模块,用于实现基于总线协议的数据包接收。The data acquisition module includes a microprocessor unit, a sending interface control submodule and a receiving interface control submodule; the microprocessor unit is used to realize interface control according to usage requirements; the sending interface control submodule is used to realize the data packet based on the bus protocol The sending and receiving interface control submodule is used to realize the data packet reception based on the bus protocol.

数据处理模块包括总线数据处理子模块、辅助判读子模块和实时数据处理子模块;总线数据处理子模块用于实现基于总线协议进行数据解析;辅助判读子模块用于辅助数据判读分析模块对数据进行判读分析;实时数据处理子模块用于进行实时数据的解析。The data processing module includes a bus data processing sub-module, an auxiliary interpretation sub-module and a real-time data processing sub-module; the bus data processing sub-module is used to implement data analysis based on the bus protocol; the auxiliary interpretation sub-module is used to assist the data interpretation and analysis module to perform data analysis. Interpretation and analysis; the real-time data processing sub-module is used for real-time data analysis.

数据管理模块包括数据库管理子模块、实时检测参数配置子模块、数据存储和发布子模块以及数据库配置导入子模块;数据管理模块中建立实时数据库,实时数据库用于存储数据处理模块传递来的实时数据;实时检测参数配置子模块用于参数信息的装订导入,对参数信息进行正确性校验;数据存储和发布子模块用于通过网络发布数据处理结果,并实时显示;数据库配置导入子模块用于对历次试验数据进行迁移和备份,支持相同格式单元测试数据的导入。The data management module includes a database management sub-module, a real-time detection parameter configuration sub-module, a data storage and release sub-module, and a database configuration import sub-module; a real-time database is established in the data management module, and the real-time database is used to store real-time data delivered by the data processing module ; The real-time detection parameter configuration sub-module is used for the binding and import of parameter information, and the correctness of the parameter information is verified; the data storage and release sub-module is used for publishing the data processing results through the network and displaying them in real time; the database configuration import sub-module is used for Migrate and back up previous test data, and support the import of unit test data in the same format.

数据判读分析模块包括数据自动判读子模块和数据比对分析子模块;数据自动判读子模块用于根据不同的测试状态和测试流程,自动完成遥测参数的判读工作;数据比对分析子模块用于不同试验数据的横向比对,存储不同任务、不同状态下的判据,对判据进行创建、编辑、删除和复制。The data interpretation and analysis module includes the data automatic interpretation sub-module and the data comparison and analysis sub-module; the data automatic interpretation sub-module is used to automatically complete the interpretation of telemetry parameters according to different test states and test procedures; the data comparison and analysis sub-module is used for Horizontal comparison of different test data, storage of criteria under different tasks and different states, creation, editing, deletion and copying of criteria.

数据采集模块包括自动采集子模块,用于自动接收外部所传输的源文件的数据。The data collection module includes an automatic collection sub-module, which is used to automatically receive data from externally transmitted source files.

数据判读分析模块还包括用户子模块以及报告子模块;用户子模块用于用户、角色和权限的分级数据、判据分级管理;报告子模块用于判读结果报告自动生成,并基于网络完成签署、确认。The data interpretation and analysis module also includes a user sub-module and a report sub-module; the user sub-module is used for hierarchical data and criterion hierarchical management of users, roles and permissions; the report sub-module is used for automatic generation of interpretation result reports, and completes signing, confirm.

数据处理模块能够进行动态编译、打包以及动态调度,实现热部署能力;The data processing module can be dynamically compiled, packaged and dynamically scheduled to achieve hot deployment capabilities;

其中,通过灵活配置数据处理逻辑及数据处理流程及热部署的方式节省了大量研发资源及繁琐的开发流程。Among them, a large amount of R&D resources and tedious development process are saved through flexible configuration of data processing logic and data processing flow and hot deployment.

以上公开的本发明优选实施例只是用于帮助阐述本发明。优选实施例并没有详尽叙述所有的细节,也不限制该发明仅为所述的具体实施方式。显然,根据本说明书的内容,可作很多的修改和变化。本说明书选取并具体描述这些实施例,是为了更好地解释本发明的原理和实际应用,从而使所属技术领域技术人员能很好地理解和利用本发明。本发明仅受权利要求书及其全部范围和等效物的限制。The preferred embodiments of the invention disclosed above are only to help illustrate the invention. The preferred embodiments are not exhaustive in all detail, nor are the inventions limited to specific embodiments described. Obviously, many modifications and variations can be made based on the contents of this specification. This description selects and specifically describes these embodiments in order to better explain the principle and practical application of the present invention, so that those skilled in the art can well understand and utilize the present invention. The invention is to be limited only by the claims, along with their full scope and equivalents.

Claims (8)

1.一种任意数据混合优化的大规模处理分析系统,其特征在于,包括数据采集模块、数据转换模块、数据处理模块、学习模块、数据管理模块以及数据判读分析模块;1. A large-scale processing and analysis system for arbitrary data mixing optimization, characterized in that, comprising a data acquisition module, a data conversion module, a data processing module, a learning module, a data management module and a data interpretation and analysis module; 所述数据采集模块用于利用大数据平台的计算能力,采用分布式的方式并行执行采集任务;The data acquisition module is used to utilize the computing power of the big data platform to execute acquisition tasks in parallel in a distributed manner; 所述数据转换模块用于将海量异构数据转换为同构数据,并传输给数据处理模块;The data conversion module is used to convert massive heterogeneous data into isomorphic data and transmit it to the data processing module; 所述数据处理模块用于对数据包进行解析,按照用户配置的参数信息处理出结果,并发送至数据管理模块;The data processing module is used to analyze the data packet, process the result according to the parameter information configured by the user, and send it to the data management module; 所述数据管理模块用于接收并存储数据处理模块发送的原始数据,建立存储结构,完成数据实时存储;The data management module is used to receive and store the original data sent by the data processing module, establish a storage structure, and complete real-time data storage; 所述学习模块用于将现有的算法模型整合,形成算法模型数据库,基于海量的样本数据训练不断优化模型;The learning module is used to integrate existing algorithm models to form an algorithm model database, and continuously optimize the model based on massive sample data training; 所述数据判读分析模块用于根据不同的测试状态和测试流程,自动完成遥测参数的判读工作;The data interpretation and analysis module is used to automatically complete the interpretation of telemetry parameters according to different test states and test procedures; 所述数据采集模块与数据判读分析模块之间设置有能够相互传输数据的传输子模块,且数据采集模块包括字段子模块,所述数据采集模块采集源文件数据,并分析所述源文件的参数字段,提取每一参数字段的数据;所述字段子模块用于设置源文件参数字段与目标文件参数字段的对应关系,确定源文件的参数字段为第一数据格式,目标文件的参数字段为第二数据格式,所述字段子模块根据源文件参数字段与目标文件参数字段之间的相似度生成每个第一数据格式对应的格式转换协议,并将每个格式转换协议下发至对应数据转换模块,根据格式转换协议,将所述源文件的参数字段的数据匹配至所述目标文件对应的参数字段,将所述源文件的参数字段的数据匹配至所述目标文件对应的参数字段的步骤包括将一个或多个所述源文件的参数字段的数据进行计算和判断后,生成匹配至所述目标文件对应的参数字段的数据,并对数据进行整合为数据包。A transmission sub-module capable of transmitting data to each other is set between the data acquisition module and the data interpretation and analysis module, and the data acquisition module includes a field sub-module, the data acquisition module collects source file data, and analyzes the parameters of the source file field, to extract the data of each parameter field; the field submodule is used to set the corresponding relationship between the source file parameter field and the target file parameter field, and determine that the parameter field of the source file is the first data format, and the parameter field of the target file is the first data format. Two data formats, the field submodule generates a format conversion protocol corresponding to each first data format according to the similarity between the source file parameter field and the target file parameter field, and sends each format conversion protocol to the corresponding data conversion A module, according to the format conversion protocol, matching the data of the parameter field of the source file to the parameter field corresponding to the target file, and matching the data of the parameter field of the source file to the parameter field corresponding to the target file After calculating and judging the data of one or more parameter fields of the source file, generating data matched to the corresponding parameter field of the target file, and integrating the data into a data package. 2.根据权利要求1所述的一种任意数据混合优化的大规模处理分析系统,其特征在于:所述数据采集模块还包括微处理器单元、发送接口控制子模块以及接收接口控制子模块;所述微处理器单元,用于根据使用需求实现接口控制;所述发送接口控制子模块,用于实现基于总线协议的数据包发送;所述接收接口控制子模块,用于实现基于总线协议的数据包接收。2. the large-scale processing analysis system of a kind of arbitrary data mixing optimization according to claim 1, is characterized in that: described data acquisition module also comprises microprocessor unit, sending interface control submodule and receiving interface control submodule; The microprocessor unit is used to realize interface control according to usage requirements; the sending interface control submodule is used to realize data packet transmission based on bus protocol; the receiving interface control submodule is used to realize data packet transmission based on bus protocol Packet received. 3.根据权利要求1所述的一种任意数据混合优化的大规模处理分析系统,其特征在于:所述数据处理模块包括总线数据处理子模块、辅助判读子模块和实时数据处理子模块;所述总线数据处理子模块用于实现基于总线协议进行数据解析;所述辅助判读子模块用于辅助数据判读分析模块对数据进行判读分析;所述实时数据处理子模块用于进行实时数据的解析。3. the large-scale processing analysis system of a kind of arbitrary data mixing optimization according to claim 1, it is characterized in that: described data processing module comprises bus data processing submodule, auxiliary interpretation submodule and real-time data processing submodule; The bus data processing sub-module is used to implement data analysis based on the bus protocol; the auxiliary interpretation sub-module is used to assist the data interpretation and analysis module to interpret and analyze data; the real-time data processing sub-module is used to analyze real-time data. 4.根据权利要求1所述的一种任意数据混合优化的大规模处理分析系统,其特征在于:所述数据管理模块包括数据库管理子模块、实时检测参数配置子模块、数据存储和发布子模块以及数据库配置导入子模块;所述数据管理模块中建立实时数据库,所述实时数据库用于存储数据处理模块传递来的实时数据;所述实时检测参数配置子模块用于参数信息的装订导入,对参数信息进行正确性校验;所述数据存储和发布子模块用于通过网络发布数据处理结果,并实时显示;所述数据库配置导入子模块用于对历次试验数据进行迁移和备份,支持相同格式单元测试数据的导入。4. A large-scale processing and analysis system for arbitrary data mixing optimization according to claim 1, wherein the data management module includes a database management submodule, a real-time detection parameter configuration submodule, and a data storage and release submodule And the database configuration import submodule; set up a real-time database in the data management module, the real-time database is used to store the real-time data delivered by the data processing module; the real-time detection parameter configuration submodule is used for the binding import of parameter information, for Verify the correctness of parameter information; the data storage and publishing sub-module is used to publish data processing results through the network and display them in real time; the database configuration import sub-module is used to migrate and backup previous test data, and supports the same format Import of unit test data. 5.根据权利要求1所述的一种任意数据混合优化的大规模处理分析系统,其特征在于:所述数据判读分析模块包括数据自动判读子模块和数据比对分析子模块;所述数据自动判读子模块用于根据不同的测试状态和测试流程,自动完成遥测参数的判读工作;所述数据比对分析子模块用于不同试验数据的横向比对,存储不同任务、不同状态下的判据,对判据进行创建、编辑、删除和复制。5. The large-scale processing and analysis system of a kind of arbitrary data mixing optimization according to claim 1, it is characterized in that: described data interpretation and analysis module comprises data automatic interpretation sub-module and data comparison analysis sub-module; The interpretation sub-module is used to automatically complete the interpretation of telemetry parameters according to different test states and test procedures; the data comparison and analysis sub-module is used for horizontal comparison of different test data and stores criteria under different tasks and states , to create, edit, delete and copy the criteria. 6.根据权利要求1所述的一种任意数据混合优化的大规模处理分析系统,其特征在于:所述数据采集模块包括自动采集子模块,用于自动接收外部所传输的所述源文件的数据。6. A kind of large-scale processing analysis system of arbitrary data mixing optimization according to claim 1, it is characterized in that: described data collection module comprises automatic collection sub-module, is used for automatically receiving the described source file of external transmission data. 7.根据权利要求1所述的一种任意数据混合优化的大规模处理分析系统,其特征在于:所述数据判读分析模块还包括用户子模块以及报告子模块;所述用户子模块用于用户、角色和权限的分级数据、判据分级管理;所述报告子模块用于判读结果报告自动生成,并基于网络完成签署、确认。7. A large-scale processing and analysis system for arbitrary data mixing optimization according to claim 1, characterized in that: the data interpretation and analysis module also includes a user sub-module and a report sub-module; the user sub-module is used for user , hierarchical data of roles and permissions, and hierarchical management of criteria; the report sub-module is used for automatic generation of interpretation result reports, and completes signing and confirmation based on the network. 8.根据权利要求1所述的一种任意数据混合优化的大规模处理分析系统,其特征在于:所述数据处理模块能够进行动态编译、打包以及动态调度,实现热部署能力。8. A large-scale processing and analysis system for arbitrary data mixing and optimization according to claim 1, characterized in that: said data processing module can perform dynamic compilation, packaging and dynamic scheduling to realize hot deployment capability.
CN202211478698.9A 2022-11-23 2022-11-23 Large-scale processing and analyzing system for arbitrary data hybrid optimization Pending CN115729998A (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
CN202211478698.9A CN115729998A (en) 2022-11-23 2022-11-23 Large-scale processing and analyzing system for arbitrary data hybrid optimization

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
CN202211478698.9A CN115729998A (en) 2022-11-23 2022-11-23 Large-scale processing and analyzing system for arbitrary data hybrid optimization

Publications (1)

Publication Number Publication Date
CN115729998A true CN115729998A (en) 2023-03-03

Family

ID=85297824

Family Applications (1)

Application Number Title Priority Date Filing Date
CN202211478698.9A Pending CN115729998A (en) 2022-11-23 2022-11-23 Large-scale processing and analyzing system for arbitrary data hybrid optimization

Country Status (1)

Country Link
CN (1) CN115729998A (en)

Citations (4)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN105009078A (en) * 2013-02-12 2015-10-28 起元科技有限公司 Building applications for configuring processes
CN105786912A (en) * 2014-12-25 2016-07-20 远光软件股份有限公司 Data acquisition and transformation method and device
CN107463418A (en) * 2017-09-12 2017-12-12 北京宝兰德软件股份有限公司 The configuration file generation method and device of a kind of server middleware
CN114625371A (en) * 2022-02-18 2022-06-14 北京理工大学 Cross-operating-system multi-source fusion algorithm compiling method, compiler and storage medium

Patent Citations (4)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN105009078A (en) * 2013-02-12 2015-10-28 起元科技有限公司 Building applications for configuring processes
CN105786912A (en) * 2014-12-25 2016-07-20 远光软件股份有限公司 Data acquisition and transformation method and device
CN107463418A (en) * 2017-09-12 2017-12-12 北京宝兰德软件股份有限公司 The configuration file generation method and device of a kind of server middleware
CN114625371A (en) * 2022-02-18 2022-06-14 北京理工大学 Cross-operating-system multi-source fusion algorithm compiling method, compiler and storage medium

Similar Documents

Publication Publication Date Title
CN105843182B (en) A kind of power scheduling accident prediction system and method based on OMS
CN109213754A (en) A kind of data processing system and data processing method
EP0483037A2 (en) Remote and batch processing in an object oriented programming system
CN108268529B (en) Data summarization method and system based on business abstraction and multi-engine scheduling
CN106022007A (en) Cloud platform system and method oriented to biological omics big data calculation
CN112307501B (en) Big data system based on block chain technology, storage method and using method
CN106651125A (en) Material distribution system and processing method thereof
CN110781180B (en) Data screening method and data screening device
CN117376346A (en) Equipment data processing method and device based on edge calculation and distributed calculation
CN115757587A (en) Heterogeneous data source integration method and device, electronic equipment and storage medium
CN107896242B (en) Service sharing method and device
CN114443293A (en) A system and method for deploying a big data platform
CN111026972B (en) Subscription data pushing method, device, equipment and storage medium in Internet of things
CN118838551A (en) Lake storehouse chain integrated high-efficiency and reliable big data storage and analysis system
CN115729998A (en) Large-scale processing and analyzing system for arbitrary data hybrid optimization
CN112905720A (en) Operation data processing method and device based on source data management model
CN118051338A (en) A computing power activation method, device, electronic device and storage medium
CN111026432A (en) Big data processing platform, platform construction method and storage medium
CN117111894A (en) Method for converting data based on ETL module of low code development
CN104484230A (en) Multiple satellite data centre workflow scheduling algorithm on basis of near data calculation principle
CN115907270A (en) Process evolution system and method based on business activities and data
CN116701500A (en) Method and device for automatically generating traceability information in ETL, and electronic equipment
CN112596710B (en) Front-end system
CN114328584B (en) Service calling method, device and readable storage medium
CN116470968B (en) Ground test method and device for communication function of aerospace science system

Legal Events

Date Code Title Description
PB01 Publication
PB01 Publication
SE01 Entry into force of request for substantive examination
SE01 Entry into force of request for substantive examination
RJ01 Rejection of invention patent application after publication

Application publication date: 20230303

RJ01 Rejection of invention patent application after publication