[go: up one dir, main page]

CN118171201A - A multidimensional data modeling and analysis method and system based on subspace sequence - Google Patents

A multidimensional data modeling and analysis method and system based on subspace sequence Download PDF

Info

Publication number
CN118171201A
CN118171201A CN202410273460.5A CN202410273460A CN118171201A CN 118171201 A CN118171201 A CN 118171201A CN 202410273460 A CN202410273460 A CN 202410273460A CN 118171201 A CN118171201 A CN 118171201A
Authority
CN
China
Prior art keywords
subspace
data
sequence
analysis
network
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Granted
Application number
CN202410273460.5A
Other languages
Chinese (zh)
Other versions
CN118171201B (en
Inventor
王娟
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Shengtai Renhe Intelligent Technology Shenzhen Co ltd
Original Assignee
Shengtai Renhe Intelligent Technology Shenzhen Co ltd
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Shengtai Renhe Intelligent Technology Shenzhen Co ltd filed Critical Shengtai Renhe Intelligent Technology Shenzhen Co ltd
Priority to CN202410273460.5A priority Critical patent/CN118171201B/en
Publication of CN118171201A publication Critical patent/CN118171201A/en
Application granted granted Critical
Publication of CN118171201B publication Critical patent/CN118171201B/en
Active legal-status Critical Current
Anticipated expiration legal-status Critical

Links

Classifications

    • GPHYSICS
    • G06COMPUTING OR CALCULATING; COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F18/00Pattern recognition
    • G06F18/20Analysing
    • G06F18/24Classification techniques
    • G06F18/243Classification techniques relating to the number of classes
    • G06F18/2431Multiple classes
    • GPHYSICS
    • G06COMPUTING OR CALCULATING; COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F18/00Pattern recognition
    • G06F18/20Analysing
    • G06F18/21Design or setup of recognition systems or techniques; Extraction of features in feature space; Blind source separation
    • G06F18/214Generating training patterns; Bootstrap methods, e.g. bagging or boosting
    • GPHYSICS
    • G06COMPUTING OR CALCULATING; COUNTING
    • G06NCOMPUTING ARRANGEMENTS BASED ON SPECIFIC COMPUTATIONAL MODELS
    • G06N3/00Computing arrangements based on biological models
    • G06N3/02Neural networks
    • G06N3/08Learning methods
    • G06N3/092Reinforcement learning

Landscapes

  • Engineering & Computer Science (AREA)
  • Theoretical Computer Science (AREA)
  • Data Mining & Analysis (AREA)
  • Physics & Mathematics (AREA)
  • Evolutionary Computation (AREA)
  • Artificial Intelligence (AREA)
  • General Engineering & Computer Science (AREA)
  • Life Sciences & Earth Sciences (AREA)
  • General Physics & Mathematics (AREA)
  • Computer Vision & Pattern Recognition (AREA)
  • Bioinformatics & Computational Biology (AREA)
  • Bioinformatics & Cheminformatics (AREA)
  • Evolutionary Biology (AREA)
  • General Health & Medical Sciences (AREA)
  • Computing Systems (AREA)
  • Mathematical Physics (AREA)
  • Software Systems (AREA)
  • Molecular Biology (AREA)
  • Computational Linguistics (AREA)
  • Biophysics (AREA)
  • Biomedical Technology (AREA)
  • Health & Medical Sciences (AREA)
  • Management, Administration, Business Operations System, And Electronic Commerce (AREA)

Abstract

The invention discloses a multi-dimensional data modeling and analyzing method and system based on subspace sequences, which realize multi-angle analysis, dynamic evolution prediction and correlation effect analysis of a complex system through abstract representation and dynamic modeling of subspace sequences, and are innovative in that sequence dimensions are introduced, dependency relations among subspaces based on a graph model are constructed, and sequence dependence and topology constraint are combined; thus, the modeling accuracy can be enhanced, and the accuracy of system analysis is improved; meanwhile, the method does not need to rely on training data and historical parameters marked on a large scale, so that the method is very suitable for analysis of nonlinear complex systems; in addition, the topological relation among the nodes in the map is further mined through data mining, and the subspace association network map is updated according to the topological relation; based on the method, the deep analysis of the whole structure of the subspace network can be realized, and the modeling precision can be further increased, so that the accuracy of system analysis is improved.

Description

一种基于子空间序列的多维数据建模与分析方法及系统A multidimensional data modeling and analysis method and system based on subspace sequence

技术领域Technical Field

本发明属于面向复杂系统的多源异构数据建模与分析技术领域,具体涉及基于子空间序列的多维数据建模与分析方法及系统。The present invention belongs to the technical field of multi-source heterogeneous data modeling and analysis for complex systems, and specifically relates to a multidimensional data modeling and analysis method and system based on subspace sequences.

背景技术Background technique

目前,对复杂系统的分析,均是基于系统的建模与分析方法,例如基于向量自动回归(VAR)模型的时间序列预测、基于贝叶斯网络的概率图模型等,前述方法主要是基于统计建模思想,其依赖大量历史数据进行参数训练,对数据质量和数量较敏感,因此,这些方法多局限于线性建模,对非线性复杂系统的建模能力较弱,分析准确性较低;另外,像神经网络等非线性建模方法,则依赖于大规模标注的训练数据,无法很好的利用建模中已有的结构知识,如此,则会导致建模解释性较差;基于前述阐述,总体来说,现有技术对复杂动态系统的建模与分析仍存在明显缺陷,不能对复杂动态系统进行准确的分析;由此,如何提供一种能够适用于非线性复杂系统,且分析准确率高的多维数据建模与分析方法,已成为一个亟待解决的问题。At present, the analysis of complex systems is based on system modeling and analysis methods, such as time series prediction based on vector autoregression (VAR) model, probabilistic graphical model based on Bayesian network, etc. The above methods are mainly based on statistical modeling ideas, which rely on a large amount of historical data for parameter training and are sensitive to data quality and quantity. Therefore, these methods are mostly limited to linear modeling, and have weak modeling capabilities for nonlinear complex systems and low analysis accuracy. In addition, nonlinear modeling methods such as neural networks rely on large-scale annotated training data and cannot make good use of the existing structural knowledge in modeling, which will lead to poor modeling interpretability. Based on the above explanation, in general, the existing technology for modeling and analysis of complex dynamic systems still has obvious defects and cannot accurately analyze complex dynamic systems. Therefore, how to provide a multidimensional data modeling and analysis method that can be applied to nonlinear complex systems and has high analysis accuracy has become a problem that needs to be solved urgently.

发明内容Summary of the invention

本发明的目的是提供一种基于子空间序列的多维数据建模与分析方法及系统,用以解决现有技术无法对复杂动态系统进行准确的分析的问题。The purpose of the present invention is to provide a multidimensional data modeling and analysis method and system based on subspace sequences, so as to solve the problem that the prior art cannot accurately analyze complex dynamic systems.

为了实现上述目的,本发明采用以下技术方案:In order to achieve the above object, the present invention adopts the following technical solutions:

第一方面,提供了一种基于子空间序列的多维数据建模与分析方法,包括:In a first aspect, a multidimensional data modeling and analysis method based on subspace sequences is provided, comprising:

获取待分析目标的数据集合,其中,所述数据集合中包括有待分析目标的多个第一维度的数据,其中,所述待分析目标包括目标城市,且所述第一维度用于表征数据种类;Acquire a data set of a target to be analyzed, wherein the data set includes data of a plurality of first dimensions of the target to be analyzed, wherein the target to be analyzed includes a target city, and the first dimension is used to characterize a data type;

从所述数据集合中选取出若干指定第一维度的数据,并基于各个指定第一维度的数据,构建出每个指定第一维度的数据对应的子空间;Selecting a plurality of data of a specified first dimension from the data set, and constructing a subspace corresponding to each data of the specified first dimension based on each data of the specified first dimension;

构建出所有子空间共同的子空间序列,其中,所述子空间序列用于表征数据的逻辑顺序;Constructing a subspace sequence common to all subspaces, wherein the subspace sequence is used to represent the logical order of data;

基于所述子空间序列,对所述子空间序列与各个子空间进行对偶关系映射处理,以为各个子空间中的每个子空间实例建立序列编号,其中,任一子空间中的子空间实例用于表征该任一子空间中的一个数据;Based on the subspace sequence, a dual relationship mapping process is performed on the subspace sequence and each subspace to establish a sequence number for each subspace instance in each subspace, wherein the subspace instance in any subspace is used to represent a data in the any subspace;

利用建立了序列编号的各个子空间,构建出包含有所有子空间的子空间关联网络图,其中,所述子空间关联网络图表示为G={V,E,g},V为节点集合,E为节点边集合,g为图属性集合,所述节点集合中的各节点为各子空间中的子空间实例,且任一节点使用对应子空间实例的序列编号表示;By using each subspace with established serial numbers, a subspace association network graph containing all subspaces is constructed, wherein the subspace association network graph is represented by G={V,E,g}, V is a node set, E is a node edge set, g is a graph attribute set, each node in the node set is a subspace instance in each subspace, and any node is represented by a serial number corresponding to the subspace instance;

对所述子空间关联网络图进行数据挖掘处理,得到更新后的子空间关联网络图;Performing data mining processing on the subspace association network graph to obtain an updated subspace association network graph;

利用更新后的子空间关联网络图,得出所述待分析目标的分析结果,其中,所述分析结果包括所述目标城市的城市状态预测结果,且所述城市状态预测结果包括交通流量预测结果、空气质量预测结果和/或人流分布预测结果。The updated subspace association network diagram is used to obtain the analysis results of the target to be analyzed, wherein the analysis results include the city status prediction results of the target city, and the city status prediction results include traffic flow prediction results, air quality prediction results and/or crowd distribution prediction results.

基于上述公开的内容,本发明先获取待分析目标(如目标城市)的数据集合(即多维数据空间),然后,选择出指定第一维度的数据(即选择出关键种类的数据),来构建出对应的子空间;而后,引入能够反映数据逻辑顺序的子空间序列,来对各个子空间中的每个子空间实例进行编号,以利用子空间序列来追踪每个子空间状态的动态演化轨迹;在完成各子空间中实例的编号后,则可利用建立了序列编号的各个子空间,来构建出子空间关联网络图;如此,相当于构建了各个子空间的关系图谱,明确了各个子空间节点之间的依赖与约束关系;接着,对子空间关联网络图进行数据挖掘,以进一步的挖掘图谱中各节点间的拓扑关系,从而完善子空间关联网络图;最后,利用完善后的子空间关联网络图,即可完成对目标城市的状态分析,从而得出目标城市在不同状态时的交通流量、空气质量和/或人流分布等指标的预测结果。Based on the above disclosed content, the present invention first obtains a data set (i.e., a multidimensional data space) of a target to be analyzed (such as a target city), and then selects data of a specified first dimension (i.e., selects data of a key type) to construct a corresponding subspace; then, a subspace sequence that can reflect the logical order of the data is introduced to number each subspace instance in each subspace, so as to use the subspace sequence to track the dynamic evolution trajectory of each subspace state; after completing the numbering of the instances in each subspace, each subspace with a sequence number can be used to construct a subspace association network diagram; in this way, it is equivalent to constructing a relationship map of each subspace, and clarifying the dependency and constraint relationship between each subspace node; then, data mining is performed on the subspace association network diagram to further mine the topological relationship between each node in the map, thereby improving the subspace association network diagram; finally, using the improved subspace association network diagram, the state analysis of the target city can be completed, thereby obtaining the prediction results of indicators such as traffic flow, air quality and/or passenger flow distribution of the target city in different states.

通过上述设计,本发明提供了一种通用的基于子空间序列的过程建模方法,旨在支持利用多源异构数据进行复杂动态过程的表示建模和关联分析,该方法通过子空间序列的抽象化表示和动态建模,实现对复杂系统的多角度分析、动态演化预测和关联效应分析,其创新之处在于引入了序列维度,构建了基于图模型的子空间间的依赖关系,并同时结合了序列依赖与拓扑约束;如此,可增强建模的准确性,从而提高对系统分析的准确性;同时,本方法无需依赖于大规模标注的训练数据以及历史参数,因此,非常适用于非线性复杂系统的分析;此外,通过数据挖掘来进一步的挖掘图谱中各节点间的拓扑关系,并以此来更新子空间关联网络图;基于此,可实现对子空间网络整体结构的深入分析,从而能够确定出全局性的关联模式和重要节点,可进一步的增加建模精度,从而提高对系统分析(即目标城市状态预测)的准确性;由此,本发明非常适用于在复杂系统的分析领域的大规模应用与推广。Through the above design, the present invention provides a general process modeling method based on subspace sequence, which aims to support the representation modeling and association analysis of complex dynamic processes using multi-source heterogeneous data. The method realizes multi-angle analysis, dynamic evolution prediction and association effect analysis of complex systems through abstract representation and dynamic modeling of subspace sequences. Its innovation lies in the introduction of sequence dimension, the construction of dependency relationship between subspaces based on graph model, and the combination of sequence dependency and topological constraint at the same time; in this way, the accuracy of modeling can be enhanced, thereby improving the accuracy of system analysis; at the same time, the method does not need to rely on large-scale annotated training data and historical parameters, and is therefore very suitable for the analysis of nonlinear complex systems; in addition, the topological relationship between nodes in the graph is further mined through data mining, and the subspace association network diagram is updated accordingly; based on this, an in-depth analysis of the overall structure of the subspace network can be achieved, so that the global association pattern and important nodes can be determined, and the modeling accuracy can be further increased, thereby improving the accuracy of system analysis (i.e., prediction of the target city state); therefore, the present invention is very suitable for large-scale application and promotion in the field of complex system analysis.

在一个可能的设计中,所述数据集合还包括有多个第二维度的数据,其中,所述第二维度用于表征数据逻辑维度,且数据逻辑维度包括时间维度和/或事件维度;In a possible design, the data set further includes data of multiple second dimensions, wherein the second dimension is used to represent a data logical dimension, and the data logical dimension includes a time dimension and/or an event dimension;

其中,构建出所有子空间共同的子空间序列,包括:Among them, the subspace sequence common to all subspaces is constructed, including:

从所述数据集合中选择出全部第二维度的数据或若干第二维度的数据;Selecting all or some of the data of the second dimension from the data set;

基于选择出的第二维度的数据,构建出所有子空间共同的子空间序列,其中,所述子空间序列包括时间序列和/或事件发生序列。Based on the selected data of the second dimension, a subspace sequence common to all subspaces is constructed, wherein the subspace sequence includes a time sequence and/or an event occurrence sequence.

在一个可能的设计中,对所述子空间关联网络图进行数据挖掘处理,得到更新后的子空间关联网络图,包括:In a possible design, data mining is performed on the subspace association network graph to obtain an updated subspace association network graph, including:

获取各个子空间的特征描述信息,其中,各个子空间的特征描述信息表示形式相同,且任一子空间的特征描述信息包括向量表示信息、张量表示信息、图表示信息、深度表示信息以及语义表示信息中的一种或多种;Acquire feature description information of each subspace, wherein the feature description information of each subspace has the same representation form, and the feature description information of any subspace includes one or more of vector representation information, tensor representation information, graph representation information, depth representation information, and semantic representation information;

基于各个子空间的特征描述信息,并利用数据挖掘算法,对所述子空间关联网络图进行数据挖掘处理,以在数据挖掘后,得到所述更新后的子空间关联网络图。Based on the characteristic description information of each subspace and using a data mining algorithm, data mining processing is performed on the subspace association network diagram to obtain the updated subspace association network diagram after data mining.

在一个可能的设计中,获取各个子空间的特征描述信息,包括:In a possible design, feature description information of each subspace is obtained, including:

利用特征表示算法,提取出各个子空间对应的特征描述信息,其中,特征表示算法包括向量表示算法、张量表示算法、图表示算法、深度表示算法以及语义表示算法中的一种或多种。The feature description information corresponding to each subspace is extracted by using a feature representation algorithm, wherein the feature representation algorithm includes one or more of a vector representation algorithm, a tensor representation algorithm, a graph representation algorithm, a depth representation algorithm, and a semantic representation algorithm.

在一个可能的设计中,所述数据挖掘算法包括图卷积网络算法、网络嵌入算法、网络动态建模算法、多粒度语义学习算法、强化学习算法和/或小波网络算法。In one possible design, the data mining algorithm includes a graph convolutional network algorithm, a network embedding algorithm, a network dynamic modeling algorithm, a multi-granularity semantic learning algorithm, a reinforcement learning algorithm and/or a wavelet network algorithm.

在一个可能的设计中,利用更新后的子空间关联网络图,得出所述待分析目标的分析结果,包括:In a possible design, the updated subspace association network diagram is used to obtain the analysis result of the target to be analyzed, including:

利用增量学习算法或迁移学习算法,对所述更新后的子空间关联网络图中的各个子空间进行局部状态预测处理,以得到各个子空间的局部状态预测结果;Using an incremental learning algorithm or a transfer learning algorithm, a local state prediction process is performed on each subspace in the updated subspace association network diagram to obtain a local state prediction result of each subspace;

利用各个子空间的局部状态预测结果,得出所述待分析目标的分析结果。The analysis result of the target to be analyzed is obtained by using the local state prediction results of each subspace.

在一个可能的设计中,在得出所述待分析目标的分析结果后,所述方法还包括:In a possible design, after obtaining the analysis result of the target to be analyzed, the method further includes:

基于所述待分析目标的分析结果,调整所述子空间关联网络图,以实现子空间关联网络图的反馈学习。Based on the analysis result of the target to be analyzed, the subspace association network diagram is adjusted to achieve feedback learning of the subspace association network diagram.

第二方面,提供了一种基于子空间序列的多维数据建模与分析系统,包括:In a second aspect, a multidimensional data modeling and analysis system based on subspace sequences is provided, comprising:

获取单元,用于获取待分析目标的数据集合,其中,所述数据集合中包括有待分析目标的多个第一维度的数据,其中,所述待分析目标包括目标城市,且第一维度用于表征数据种类;An acquisition unit, configured to acquire a data set of a target to be analyzed, wherein the data set includes data of a plurality of first dimensions of the target to be analyzed, wherein the target to be analyzed includes a target city, and the first dimension is used to characterize a data type;

子空间构建单元,用于从所述数据集合中选取出若干指定第一维度的数据,并基于各个指定第一维度的数据,构建出每个指定第一维度的数据对应的子空间;A subspace construction unit, configured to select a plurality of data of a specified first dimension from the data set, and construct a subspace corresponding to each data of the specified first dimension based on each data of the specified first dimension;

子空间序列单元,用于构建出所有子空间共同的子空间序列,其中,所述子空间序列用于表征数据的逻辑顺序;A subspace sequence unit, used to construct a subspace sequence common to all subspaces, wherein the subspace sequence is used to represent the logical order of data;

映射单元,用于基于所述子空间序列,对所述子空间序列与各个子空间进行对偶关系映射处理,以为各个子空间中的每个子空间实例建立序列编号,其中,任一子空间中的子空间实例用于表征该任一子空间中的一个数据;A mapping unit, configured to perform dual relationship mapping processing on the subspace sequence and each subspace based on the subspace sequence, so as to establish a sequence number for each subspace instance in each subspace, wherein the subspace instance in any subspace is used to represent a data in the any subspace;

子空间关联网络图构建单元,用于利用建立了序列编号的各个子空间,构建出包含有所有子空间的子空间关联网络图,其中,所述子空间关联网络图表示为G={V,E,g},V为节点集合,E为节点边集合,g为图属性集合,所述节点集合中的各节点为各子空间中的子空间实例,且任一节点使用对应子空间实例的序列编号表示;A subspace association network graph construction unit is used to construct a subspace association network graph containing all subspaces by using each subspace with established serial numbers, wherein the subspace association network graph is represented by G={V,E,g}, V is a node set, E is a node edge set, g is a graph attribute set, each node in the node set is a subspace instance in each subspace, and any node is represented by a serial number corresponding to the subspace instance;

数据挖掘单元,用于对所述子空间关联网络图进行数据挖掘处理,得到更新后的子空间关联网络图;A data mining unit, used for performing data mining processing on the subspace association network graph to obtain an updated subspace association network graph;

分析单元,用于利用更新后的子空间关联网络图,得出所述待分析目标的分析结果,其中,所述分析结果包括所述目标城市的城市状态预测结果,且所述城市状态预测结果包括交通流量预测结果、空气质量预测结果和/或人流分布预测结果。An analysis unit is used to use the updated subspace association network diagram to obtain analysis results of the target to be analyzed, wherein the analysis results include the city status prediction results of the target city, and the city status prediction results include traffic flow prediction results, air quality prediction results and/or crowd distribution prediction results.

第三方面,提供了一种基于子空间序列的多维数据建模与分析装置,以装置为电子设备为例,包括依次通信相连的存储器、处理器和收发器,其中,所述存储器用于存储计算机程序,所述收发器用于收发消息,所述处理器用于读取所述计算机程序,执行如第一方面或第一方面中任意一种可能设计的所述基于子空间序列的多维数据建模与分析方法。In the third aspect, a multidimensional data modeling and analysis device based on subspace sequences is provided. Taking the device as an electronic device as an example, the device includes a memory, a processor and a transceiver which are communicatively connected in sequence, wherein the memory is used to store computer programs, the transceiver is used to send and receive messages, and the processor is used to read the computer program to execute the multidimensional data modeling and analysis method based on subspace sequences as described in the first aspect or any possible design of the first aspect.

第四方面,提供了一种存储介质,存储介质上存储有指令,当所述指令在计算机上运行时,执行如第一方面或第一方面中任意一种可能设计的所述基于子空间序列的多维数据建模与分析方法。In a fourth aspect, a storage medium is provided, on which instructions are stored. When the instructions are executed on a computer, the multidimensional data modeling and analysis method based on subspace sequences as described in the first aspect or any possible design of the first aspect is executed.

第五方面,提供了一种包含指令的计算机程序产品,当指令在计算机上运行时,使计算机执行如第一方面或第一方面中任意一种可能设计的所述基于子空间序列的多维数据建模与分析方法。In a fifth aspect, a computer program product comprising instructions is provided, which, when executed on a computer, enables the computer to execute the multidimensional data modeling and analysis method based on subspace sequences as described in the first aspect or any possible design of the first aspect.

有益效果:Beneficial effects:

(1)本发明提供了一种通用的基于子空间序列的过程建模方法,旨在支持利用多源异构数据进行复杂动态过程的表示建模和关联分析,该方法通过子空间序列的抽象化表示和动态建模,实现对复杂系统的多角度分析、动态演化预测和关联效应分析,其创新之处在于引入了序列维度,构建了基于图模型的子空间间的依赖关系,并同时结合了序列依赖与拓扑约束;如此,可增强建模的准确性,从而提高对系统分析的准确性;同时,本方法无需依赖于大规模标注的训练数据以及历史参数,因此,非常适用于非线性复杂系统的分析;此外,通过数据挖掘来进一步的挖掘图谱中各节点间的拓扑关系,并以此来更新子空间关联网络图;基于此,可实现对子空间网络整体结构的深入分析,从而能够确定出全局性的关联模式和重要节点,可进一步的增加建模精度,从而提高对系统分析(即目标城市状态预测)的准确性;由此,本发明非常适用于在复杂系统的分析领域的大规模应用与推广。(1) The present invention provides a general process modeling method based on subspace sequences, which aims to support the representation modeling and correlation analysis of complex dynamic processes using multi-source heterogeneous data. The method realizes multi-angle analysis, dynamic evolution prediction and correlation effect analysis of complex systems through abstract representation and dynamic modeling of subspace sequences. Its innovation lies in the introduction of sequence dimensions, the construction of dependencies between subspaces based on graph models, and the combination of sequence dependencies and topological constraints. In this way, the accuracy of modeling can be enhanced, thereby improving the accuracy of system analysis. At the same time, the method does not need to rely on large-scale annotated training data and historical parameters, so it is very suitable for the analysis of nonlinear complex systems. In addition, the topological relationship between nodes in the graph is further mined through data mining, and the subspace correlation network diagram is updated accordingly. Based on this, an in-depth analysis of the overall structure of the subspace network can be achieved, so that the global correlation pattern and important nodes can be determined, which can further increase the modeling accuracy, thereby improving the accuracy of system analysis (i.e., the prediction of the target city state). Therefore, the present invention is very suitable for large-scale application and promotion in the field of complex system analysis.

(2)本发明在数据挖掘过程中,通过引入图神经网络,来实现子空间序列的端到端联合表示学习,以统一各子空间的特征描述,并基于此,来进行子空间关联网络图的挖掘;如此,使得本发明无需人工特征工程,提升了建模的自动化与泛化能力。(2) In the data mining process, the present invention introduces graph neural networks to achieve end-to-end joint representation learning of subspace sequences, so as to unify the feature descriptions of each subspace, and based on this, to mine the subspace association network graph; in this way, the present invention does not require manual feature engineering, and improves the automation and generalization capabilities of modeling.

(3)在整个模型构建过程中,引入了反馈学习,即将预测结果又反馈到前期的网络构建和特征学习中,形成闭环,如此,可在使用过程中,实现子空间关联网络图的调整,从而不断提升模型精度,以进一步的提高对复杂系统分析的准确性。(3) Feedback learning is introduced in the entire model construction process, that is, the prediction results are fed back to the previous network construction and feature learning to form a closed loop. In this way, the subspace association network diagram can be adjusted during use, thereby continuously improving the model accuracy and further improving the accuracy of complex system analysis.

附图说明BRIEF DESCRIPTION OF THE DRAWINGS

图1为本发明实施例提供的基于子空间序列的多维数据建模与分析方法的步骤示意图;FIG1 is a schematic diagram of the steps of a multidimensional data modeling and analysis method based on subspace sequences provided by an embodiment of the present invention;

图2为本发明实施例提供的基于子空间序列的多维数据建模与分析方法的流程框图;FIG2 is a flowchart of a multidimensional data modeling and analysis method based on subspace sequences provided by an embodiment of the present invention;

图3为本发明实施例提供的物体模型示例图;FIG3 is an example diagram of an object model provided by an embodiment of the present invention;

图4为本发明实施例提供的场模型示意图;FIG4 is a schematic diagram of a field model provided by an embodiment of the present invention;

图5为本发明实施例提供的空间网络模型示意图;FIG5 is a schematic diagram of a spatial network model provided by an embodiment of the present invention;

图6为本发明实施例提供的具有序列编号的交通子空间的示意图;FIG6 is a schematic diagram of a traffic subspace with a sequence number provided by an embodiment of the present invention;

图7为本发明实施例提供的具有序列编号的城市热道效应子空间的示意图;FIG7 is a schematic diagram of a city thermal channel effect subspace with serial numbers provided by an embodiment of the present invention;

图8为本发明实施例提供的具有序列编号的商业区分布子空间的示意图;FIG8 is a schematic diagram of a commercial district distribution subspace with serial numbers provided by an embodiment of the present invention;

图9为本发明实施例提供的子空间表征示例图;FIG9 is a diagram showing an example of subspace representation provided by an embodiment of the present invention;

图10为本发明实施例提供的整体图力导向布局的示例图;FIG10 is an example diagram of an overall graph force-directed layout provided by an embodiment of the present invention;

图11为本发明实施例提供的分步时序图的示例图;FIG11 is an example diagram of a step-by-step timing diagram provided by an embodiment of the present invention;

图12为本发明实施例提供的状态预测与影响传播示例图;FIG12 is an example diagram of state prediction and impact propagation provided by an embodiment of the present invention;

图13为本发明实施例提供的基于子空间序列的多维数据建模与分析系统的结构示意图。FIG13 is a schematic diagram of the structure of a multidimensional data modeling and analysis system based on subspace sequences provided in an embodiment of the present invention.

具体实施方式Detailed ways

为了更清楚地说明本发明实施例或现有技术中的技术方案,下面将结合附图和实施例或现有技术的描述对本发明作简单地介绍,显而易见地,下面关于附图结构的描述仅仅是本发明的一些实施例,对于本领域普通技术人员来讲,在不付出创造性劳动的前提下,还可以根据这些附图获得其他的附图。在此需要说明的是,对于这些实施例方式的说明用于帮助理解本发明,但并不构成对本发明的限定。In order to more clearly illustrate the technical solutions in the embodiments of the present invention or the prior art, the present invention will be briefly introduced below in combination with the drawings and the description of the embodiments or the prior art. Obviously, the following description of the structure of the drawings is only some embodiments of the present invention. For ordinary technicians in this field, other drawings can be obtained based on these drawings without creative work. It should be noted that the description of these embodiments is used to help understand the present invention, but does not constitute a limitation of the present invention.

实施例:Example:

参见图1所示,本实施例所提供的基于子空间序列的多维数据建模与分析方法,通过子空间序列的抽象化表示和动态建模,来实现对复杂系统的多角度分析、动态演化预测和关联效应分析;其中,举例本方法可以但不限于在分析端侧运行,可选的,分析端可以但不限于为个人电脑(personal computer,PC)、平板电脑或智能手机,可以理解的,前述执行主体并不构成对本申请实施例的限定,相应的,本方法的运行步骤可以但不限于如下述步骤S1~S7所示。As shown in Figure 1, the multidimensional data modeling and analysis method based on subspace sequences provided in this embodiment realizes multi-angle analysis, dynamic evolution prediction and correlation effect analysis of complex systems through abstract representation and dynamic modeling of subspace sequences; among them, for example, this method can be but not limited to running on the analysis end side, optionally, the analysis end can be but not limited to a personal computer (personal computer, PC), a tablet computer or a smart phone. It can be understood that the aforementioned execution subject does not constitute a limitation on the embodiments of the present application. Accordingly, the operation steps of this method can be but not limited to the following steps S1 to S7.

S1.获取待分析目标的数据集合,其中,所述数据集合中包括有待分析目标的多个第一维度的数据,其中,所述待分析目标包括目标城市,且所述第一维度用于表征数据种类;在本实施例中,待分析目标可以但不限于为目标城市,即对目标城市进行智慧城市多维协同分析;当然,还可根据其他使用场景进行具体设定,如进行工业产品的生命周期分析等等,在此不限定于前述举例;同时,待分析目标的数据集合,实质则是其对应的多维数据空间,该多维数据空间中包含有第一维度的数据和第二维度的数据;具体的,第一维度用于表征数据种类,如目标城市的交通数据(包括车辆流量、道路状况等数据)、空气质量数据、人口分布数据等等,而第二维度则用于表征数据逻辑维度,且数据逻辑维度则可以但不限于包括时间维度和/或事件维度(该维度可根据不同应用场景和数据种类而具体设置,如还可以包括阶段维度(多阶段过程中数据的变化)和行为维度(不同行为引起的数据变化)等);如此,即可基于该数据集合,来进行待分析目标的数据建模,从而便于后续对其进行数据分析;可选的,建模过程可以但限于如下述步骤S2~S5所示。S1. Obtain a data set of a target to be analyzed, wherein the data set includes data of multiple first dimensions of the target to be analyzed, wherein the target to be analyzed includes a target city, and the first dimension is used to characterize the data type; in this embodiment, the target to be analyzed may be but is not limited to the target city, that is, a multi-dimensional collaborative analysis of a smart city is performed on the target city; of course, specific settings may also be made according to other usage scenarios, such as life cycle analysis of industrial products, etc., which are not limited to the aforementioned examples; at the same time, the data set of the target to be analyzed is essentially its corresponding multi-dimensional data space, which contains data of the first dimension and data of the second dimension; specifically, the first dimension The first dimension is used to characterize the data type, such as the traffic data of the target city (including vehicle flow, road conditions and other data), air quality data, population distribution data, etc., while the second dimension is used to characterize the data logic dimension, and the data logic dimension may include but is not limited to the time dimension and/or event dimension (the dimension may be specifically set according to different application scenarios and data types, such as the stage dimension (data changes in a multi-stage process) and the behavior dimension (data changes caused by different behaviors)); in this way, data modeling of the target to be analyzed may be performed based on the data set, so as to facilitate subsequent data analysis; optionally, the modeling process may be but is not limited to the following steps S2 to S5.

S2.从所述数据集合中选取出若干指定第一维度的数据,并基于各个指定第一维度的数据,构建出每个指定第一维度的数据对应的子空间;在本实施例中,选取出若干指定第一维度的数据,相当于是提取出关键种类的数据,如以目标城市为例,可提取出交通数据、空气质量数据、人口分布数据等等,当然,对于不同的分析目标,其对应的指定第一维度的数据不同,可根据实际使用的具体确定;同时,在构建子空间时,可使用域知识或先验确定子空间的构成维度,而后再基于此来构建出不同指定第一维度的数据对应的子空间。S2. Select a number of data of the specified first dimension from the data set, and construct a subspace corresponding to each data of the specified first dimension based on the data of each specified first dimension; in this embodiment, selecting a number of data of the specified first dimension is equivalent to extracting key types of data. For example, taking the target city as an example, traffic data, air quality data, population distribution data, etc. can be extracted. Of course, for different analysis targets, the corresponding data of the specified first dimension are different, which can be determined according to actual use; at the same time, when constructing the subspace, domain knowledge or a priori can be used to determine the constituent dimensions of the subspace, and then based on this, subspaces corresponding to data of different specified first dimensions are constructed.

如此通过前述设计,通过构建子空间的抽象化表示,可以从复杂系统中提取出多个关键反映维度(比如交通、空气质量、人口分布等维度),同时,对每一个关键子空间聚焦进行建模,可屏蔽系统的其他复杂成分,从而在降低建模难度的同时,获得多角度的分析视图,以提高建模的准确性。In this way, through the above design, by constructing an abstract representation of the subspace, multiple key reflection dimensions (such as traffic, air quality, population distribution, etc.) can be extracted from the complex system. At the same time, focusing on modeling each key subspace can shield other complex components of the system, thereby reducing the difficulty of modeling and obtaining a multi-angle analysis view to improve the accuracy of modeling.

在具体实施时,子空间创建为多维数据建模的常用技术,其原理不再赘述;而在得出各个指定第一维度的数据对应的子空间后,本实施例则构建了用于表征数据逻辑顺序的子空间序列,来对各个子空间中的每个数据进行编号,以便描述子空间实例之间的拓扑关系;其中,子空间序列的构建构成可以但不限于如下述步骤S3所示。In specific implementation, subspace creation is a common technology for multidimensional data modeling, and its principle is not repeated here. After obtaining the subspace corresponding to each specified first dimension of data, this embodiment constructs a subspace sequence for characterizing the logical order of the data to number each data in each subspace in order to describe the topological relationship between subspace instances. The construction structure of the subspace sequence can be but is not limited to that shown in the following step S3.

S3.构建出所有子空间共同的子空间序列,其中,所述子空间序列用于表征数据的逻辑顺序;在具体实施时,举例可以但不限于从所述数据集合中选择出全部第二维度的数据或若干第二维度的数据;然后基于选择出的第二维度的数据,构建出所有子空间共同的子空间序列;在本实施例中,举例所述子空间序列可以但不限于包括时间序列和/或事件发生序列。S3. Construct a subspace sequence common to all subspaces, wherein the subspace sequence is used to characterize the logical order of data; in specific implementation, the example may include but is not limited to selecting all second dimensional data or several second dimensional data from the data set; then, based on the selected second dimensional data, construct a subspace sequence common to all subspaces; in this embodiment, the subspace sequence may include but is not limited to time series and/or event occurrence sequence.

具体的,前述步骤S3实质是利用选择出的第二维度的数据,来形成一个有序的序列,这个序列可以表示时序、事件序列或其他序列关系,可通过时间戳或其他指标进行排序;如按时间顺序和/或按事件发生顺序进行排序;如以时间顺序进行排序为例,该子空间序列则为时间序列,序列中的各个元素则为时刻;又如,若按照事件发生顺序进行排序,前述子空间序列则为事件发生序列,序列则包含有各个事件;如此,基于将前述子空间序列与各个子空间结合,可在子空间建模基础上增加时间等序列维度,从而持续追踪每个子空间状态的动态演化轨迹,揭示子空间之间的协同动态演化规律。Specifically, the aforementioned step S3 actually uses the data of the selected second dimension to form an ordered sequence. This sequence can represent a time series, an event sequence or other sequence relationships, and can be sorted by timestamps or other indicators; such as sorting in chronological order and/or in the order of event occurrence; for example, taking chronological order as an example, the subspace sequence is a time series, and each element in the sequence is a moment; for example, if sorting is performed in the order of event occurrence, the aforementioned subspace sequence is an event occurrence sequence, and the sequence contains each event; in this way, based on combining the aforementioned subspace sequence with each subspace, sequence dimensions such as time can be added on the basis of subspace modeling, so as to continuously track the dynamic evolution trajectory of each subspace state and reveal the laws of collaborative dynamic evolution between subspaces.

可选的,利用子空间序列,来对各个子空间进行序列编号的具体过程可以但不限于如下述步骤S4所示。Optionally, the specific process of using the subspace sequence to serially number each subspace may be, but is not limited to, as shown in the following step S4.

S4.基于所述子空间序列,对所述子空间序列与各个子空间进行对偶关系映射处理,以为各个子空间中的每个子空间实例建立序列编号,其中,任一子空间中的子空间实例用于表征该任一子空间中的一个数据;在具体应用时,相当于是利用子空间序列构成的对偶空间,来对各个子空间中的子空间实例进行序列编号;在本实施例中,对偶空间是由步骤S3生成的序列维度构成的空间,与所选子空间建立对偶关系的映射,具体来说,对偶空间提供了一个外部、独立的坐标系统,描述子空间实例之间的拓扑关系;其中,子空间实例的序列编号由子空间和序列空间组合而成,如可记为{Sn,(Dij)},解释为第n个子空间中的实例可以由子空间种类Sn、序列维度i和序列号j唯一确定;如子空间序列为时间序列,{S1,(Di1)},可表示第一个子空间中,在第一时刻时的数据;当然,前述举例仅是示意,不仅限于此,其子空间序列维度可根据实际使用而具体设定。S4. Based on the subspace sequence, a dual relationship mapping process is performed on the subspace sequence and each subspace to establish a serial number for each subspace instance in each subspace, wherein the subspace instance in any subspace is used to represent a data in the any subspace; in specific applications, it is equivalent to using the dual space composed of the subspace sequence to serially number the subspace instances in each subspace; in this embodiment, the dual space is a space composed of the sequence dimension generated in step S3, and a dual relationship mapping is established with the selected subspace. Specifically, the dual space provides an external, independent coordinate system to describe the topological relationship between subspace instances; wherein the serial number of the subspace instance is composed of the subspace and the sequence space, such as {Sn, (Dij)}, which is interpreted as the instance in the nth subspace can be uniquely determined by the subspace type Sn, the sequence dimension i and the sequence number j; if the subspace sequence is a time series, {S1, (Di1)}, can represent the data in the first subspace at the first moment; of course, the above example is only for illustration and is not limited to this, and its subspace sequence dimension can be specifically set according to actual use.

在基于步骤S4完成各个子空间中每个子空间实例的序列编号后,即可利用建立了序列编号的各个子空间,来建立各个子空间的关系图谱(即下述子空间关联网络图),从而明确各个子空间节点之间的依赖与约束关系;其中,关系图谱的构建过程可以但不限于如下述步骤S5所示。After completing the serial numbering of each subspace instance in each subspace based on step S4, the subspaces with established serial numbers can be used to establish a relationship graph of each subspace (i.e., the subspace association network graph below), so as to clarify the dependency and constraint relationship between each subspace node; wherein, the process of constructing the relationship graph can be but is not limited to as shown in the following step S5.

S5.利用建立了序列编号的各个子空间,构建出包含有所有子空间的子空间关联网络图,其中,所述子空间关联网络图表示为G={V,E,g},V为节点集合,E为节点边集合,g为图属性集合,所述节点集合中的各节点为各子空间中的子空间实例,且任一节点使用对应子空间实例的序列编号表示;在本实施例中,相当于是将不同子空间中的序列编号作为模型中的节点,然后将具有关联关系之间的节点采用边进行连接,如此,即可构建出包含有所有子空间的子空间关联网络图。S5. Using the subspaces with established serial numbers, a subspace association network diagram including all subspaces is constructed, wherein the subspace association network diagram is represented as G={V,E,g}, V is a node set, E is a node edge set, g is a graph attribute set, each node in the node set is a subspace instance in each subspace, and any node is represented by the serial number of the corresponding subspace instance; in this embodiment, it is equivalent to using the serial numbers in different subspaces as nodes in the model, and then connecting the nodes with association relationships with edges, so that a subspace association network diagram including all subspaces can be constructed.

可选的,所述节点集合中的每个节点记为v{Sn,(Dij)},所述节点边集合中的每条边可记为:e{(Sm,(Dpq)),(Sn,(Dij))},其中,该边则表示序列编号为(Sm,(Dpq))的节点,与序列编号为(Sn,(Dij))之间的连接边;如此,则可实现所有子空间中各个节点之间的关联,从而形成节点间的关系图谱;更进一步的,前述图属性集合表示图的性质或结构的集合,可以但不限于包括图的拓扑结构以及权重信息;在初始时,该集合可能为空,但随着深度图神经网络技术的学习,这个集合会逐渐丰富;在不断学习过程中,图的性质和结构会动态调整和优化,以更准确地反映子空间之间的关联和拓扑结构。Optionally, each node in the node set is recorded as v{S n ,(D ij )}, and each edge in the node edge set can be recorded as: e{(S m ,(D pq )),(S n ,(D ij ))}, wherein the edge represents the connection edge between the node with sequence number (S m ,(D pq )) and the node with sequence number (S n ,(D ij )); in this way, the association between the nodes in all subspaces can be realized, thereby forming a relationship graph between the nodes; further, the aforementioned graph attribute set represents a set of properties or structures of the graph, which may include but is not limited to the topological structure and weight information of the graph; initially, the set may be empty, but as the deep graph neural network technology learns, this set will gradually become richer; in the continuous learning process, the properties and structure of the graph will be dynamically adjusted and optimized to more accurately reflect the association and topological structure between the subspaces.

在完成子空间关联网络图的构建后,则可进行数据挖掘处理,以挖掘节点之间的拓扑关系,发现全局性的模式,从而完善子空间关联网络图;其中,数据挖掘过程可以但不限于如下述步骤S6所示。After completing the construction of the subspace association network diagram, data mining processing can be performed to mine the topological relationship between nodes and discover global patterns, thereby improving the subspace association network diagram; wherein the data mining process can be but is not limited to as shown in the following step S6.

S6.对所述子空间关联网络图进行数据挖掘处理,得到更新后的子空间关联网络图;在具体应用时,举例可以但不限于先进行子空间特征学习,然后,再利用学习到的子空间特征,来进行数据挖掘;其中,前述过程可以但不限于如下述步骤S61和步骤S62所示。S6. Perform data mining on the subspace association network diagram to obtain an updated subspace association network diagram; in specific applications, for example, it can be but not limited to first performing subspace feature learning, and then using the learned subspace features to perform data mining; wherein the aforementioned process can be but not limited to as shown in the following steps S61 and S62.

S61.获取各个子空间的特征描述信息,其中,各个子空间的特征描述信息表示形式相同,且任一子空间的特征描述信息包括向量表示信息、张量表示信息、图表示信息、深度表示信息以及语义表示信息中的一种或多种;在本实施例中,步骤S61则是利用各种特征表达技术,学习子空间的特征表示,以捕获子空间的内在属性,该表示综合了全部子空间的信息,支持后续的全局模式发现与局部状态预测。S61. Obtain feature description information of each subspace, wherein the feature description information of each subspace has the same representation form, and the feature description information of any subspace includes one or more of vector representation information, tensor representation information, graph representation information, depth representation information and semantic representation information; in this embodiment, step S61 utilizes various feature expression techniques to learn the feature representation of the subspace to capture the intrinsic properties of the subspace, and the representation integrates the information of all subspaces to support subsequent global pattern discovery and local state prediction.

可选的,举例可以但不限于利用特征表示算法,来提取出各个子空间对应的特征描述信息,其中,特征表示算法包括向量表示算法(该算法为每个子空间学习一个低维向量,表示其数值特征)、张量表示算法(其使用张量表达子空间的多维特征,进行张量分解等操作,从而来得到张量表达)、图表示算法(实质是使用图结构表示子空间内在的拓扑结构,从而得到对应的拓扑表示信息)、深度表示算法(使用自动编码器、流形学习等方法学习子空间的深层非线性特征)以及语义表示算法(使用NLP技术学习表达子空间属性的词向量或句向量)中的一种或多种;具体的,还可综合不同技术学习到的子空间的集成特征,来作为子空间的特征描述信息。Optionally, for example, but not limited to, feature representation algorithms can be used to extract feature description information corresponding to each subspace, where the feature representation algorithms include vector representation algorithms (the algorithm learns a low-dimensional vector for each subspace to represent its numerical features), tensor representation algorithms (which use tensors to express the multi-dimensional features of the subspace, perform tensor decomposition and other operations to obtain tensor expressions), graph representation algorithms (essentially using graph structures to represent the intrinsic topological structure of the subspace, thereby obtaining corresponding topological representation information), deep representation algorithms (using autoencoders, manifold learning and other methods to learn the deep nonlinear features of the subspace) and semantic representation algorithms (using NLP technology to learn word vectors or sentence vectors that express subspace attributes) One or more; specifically, the integrated features of subspaces learned by different technologies can also be combined to serve as feature description information of the subspace.

在本实施例中,向量表示算法可采用:Word2Vec或GloVe(一种词向量表示算法);张量表示算法可采用:TensorRNN或CP分解(Canonical Polyadic Decomposition)等算法模型,图表示算法可采用:图卷积网络(Graph Convolutional Networks,GCNs)或图注意力网络(Graph Attention Networks,GATs)等;深度表示算法则可采用变分自动编码器(Variational Autoencoders,VAEs)或生成对抗网络(Generative AdversarialNetworks,GANs);语义表示算法可以但不限于采用:BERT(Bidirectional EncoderRepresentations from Transformers,语言描述模型)或GPT(Generative Pre-TrainingTransformer,生成式预训练Transformer模型);而集成特征的集成表示算法则可以但不限于采用:基于图神经网络的多模态融合或对抗多视图自编码器(Adversarial Multi-viewAutoencoder);当然,前述举例仅是示意,各种表示算法的具体选用不仅限定于此。In this embodiment, the vector representation algorithm may adopt: Word2Vec or GloVe (a word vector representation algorithm); the tensor representation algorithm may adopt: TensorRNN or CP decomposition (Canonical Polyadic Decomposition) and other algorithm models; the graph representation algorithm may adopt: Graph Convolutional Networks (GCNs) or Graph Attention Networks (GATs), etc.; the depth representation algorithm may adopt Variational Autoencoders (VAEs) or Generative Adversarial Networks (GANs); the semantic representation algorithm may adopt, but is not limited to: BERT (Bidirectional Encoder Representations from Transformers, language description model) or GPT (Generative Pre-Training Transformer, generative pre-training Transformer model); and the integrated representation algorithm of integrated features may adopt, but is not limited to: multimodal fusion based on graph neural network or adversarial multi-view autoencoder (Adversarial Multi-view Autoencoder); of course, the above examples are only for illustration, and the specific selection of various representation algorithms is not limited to this.

举例可以但不限于采用如下步骤S61a和步骤S61b,来得出前述特征描述信息。For example, but not limited to, the following steps S61a and S61b may be used to obtain the aforementioned feature description information.

在本实施例中,举例任一子空间的特征描述信息可以但不限于表示为:其中,F(·)则表示前述各特征表示算法中的特征学习函数,Sn,(Dxi,Dyj,Dzk)则表示任一子空间中的节点,当然,在该式中,其子空间序列则为三维。In this embodiment, the feature description information of any subspace may be, but is not limited to, represented as: Among them, F(·) represents the feature learning function in the aforementioned feature representation algorithms, Sn , ( Dxi , Dyj , Dzk ) represents the nodes in any subspace. Of course, in this formula, the subspace sequence is three-dimensional.

如此基于前述步骤S61,通过特征表达的学习,可以获得对子空间更具代表性的特征描述,从而可减少对原始数据的依赖,提升模型的泛化能力和适应性。In this way, based on the aforementioned step S61, a more representative feature description of the subspace can be obtained through learning of feature expression, thereby reducing dependence on original data and improving the generalization ability and adaptability of the model.

在得到各个子空间的特征描述信息后,即可基于此,来进行子空间关联网络图的拓扑关系与全局模式挖掘;其中,数据挖掘过程可以但不限于如下述步骤S62所示。After obtaining the feature description information of each subspace, the topological relationship and global pattern mining of the subspace association network graph can be performed based on this; wherein, the data mining process can be but is not limited to the step S62 shown below.

S62.基于各个子空间的特征描述信息以及所述子空间关联网络图,并利用数据挖掘算法,对所述子空间关联网络图进行数据挖掘处理,以在数据挖掘后,得到所述更新后的子空间关联网络图;在本实施例中,相当于是利用前述步骤S61所学习得到的子空间特征,以及前述步骤S5所构建的子空间关联网络上,运用图神经网络等技术,来挖掘节点之间的内在拓扑关系,并发现网络的全局链接模式,如此,可揭示子空间之间更深层的内在依赖关系,从而对网络拓扑结构的进一步优化和提升;可选的,举例所述数据挖掘算法包括可以但不限于图卷积网络算法(用卷积操作直接在图结构上进行特征提取和聚合,如GCN、GAT等,可以用于节点分类和节点发现)、网络嵌入算法(通过网络随机游走和语言模型嵌入等技术学习子空间的向量化表示,如Node2Vec、DeepWalk(其是将随机游走(random walk)和word2vec两种算法相结合的图结构数据挖掘算法)等算法)、网络动态建模算法(即采用连续时间动态网络模型,学习网络拓扑结构的动态演化,如EvolveGCN(动态图的参数演化图卷积网络))、多粒度语义学习算法(利用语义嵌入与知识图谱技术,学习子空间的语义表示,构建抽象级别更高的概念关联,而不仅是统计关联)、强化学习算法(其用于构建子空间关联的马尔可夫决策过程,并采用强化学习方式发现更复杂的非线性依赖关系)和/或小波网络算法(如GWNN,可以表示网络的全局拓扑结构)。S62. Based on the feature description information of each subspace and the subspace association network diagram, the subspace association network diagram is subjected to data mining processing by using a data mining algorithm, so as to obtain the updated subspace association network diagram after data mining. In this embodiment, it is equivalent to using the subspace features learned in the aforementioned step S61 and the subspace association network constructed in the aforementioned step S5, using graph neural network and other technologies to mine the intrinsic topological relationship between nodes and discover the global link pattern of the network, so as to reveal the deeper intrinsic dependency relationship between subspaces, thereby further optimizing and improving the network topological structure. Optionally, the data mining algorithm includes but is not limited to a graph convolutional network algorithm (using convolution operations to directly extract and aggregate features on the graph structure, such as GCN, GAT, etc., which can be used for node classification and node discovery), a network embedding algorithm (learning the vectorized representation of the subspace through network random walks and language model embedding and other technologies, such as Node2Vec, DeepWalk (which is a random walk (random walk) and word2vec), network dynamic modeling algorithms (i.e., using continuous-time dynamic network models to learn the dynamic evolution of network topology, such as EvolveGCN (parameter evolution graph convolutional network of dynamic graphs)), multi-granularity semantic learning algorithms (using semantic embedding and knowledge graph technology to learn the semantic representation of subspaces and build conceptual associations at a higher level of abstraction, rather than just statistical associations), reinforcement learning algorithms (which are used to construct Markov decision processes for subspace associations and use reinforcement learning to discover more complex nonlinear dependencies) and/or wavelet network algorithms (such as GWNN, which can represent the global topology of the network).

在本实施例中,本步骤所得到的更新后的子空间关联网络图,相比于前述步骤S5中初始子空间关联网络图的主要联系和区别如下:In this embodiment, the main connections and differences between the updated subspace association network diagram obtained in this step and the initial subspace association network diagram in the aforementioned step S5 are as follows:

(1)步骤S5中构建的是初始的子空间网络,它基于子空间之间的基本关联来连接;而本步骤构建的是具有更深拓扑关系的全局网络图,它在步骤S5的基础上,利用深度图神经网络技术学习子空间之间的内在关联和拓扑结构。(1) The initial subspace network constructed in step S5 is connected based on the basic associations between subspaces; this step constructs a global network graph with deeper topological relationships. Based on step S5, it uses deep graph neural network technology to learn the intrinsic associations and topological structures between subspaces.

(2)步骤S5中的模型更基础,而本步骤中的模型能反映更深层的内在依赖关系。(2) The model in step S5 is more basic, while the model in this step can reflect deeper internal dependencies.

(3)步骤S5提供了原始结构,本步骤在此基础上进行了抽象和推理,得到了更高层次的网络拓扑表示。(3) Step S5 provides the original structure, and this step abstracts and infers on this basis to obtain a higher-level network topology representation.

(4)本步骤中的模型图对初始网络进行了优化和加强,两者存在递进的关系。(4) The model diagram in this step optimizes and strengthens the initial network, and there is a progressive relationship between the two.

可选的,挖掘的节点间的拓扑关系可以包括:连接的紧密程度(是直接还是多步关联);连接的方向性(单向还是双向影响);连接的权重(影响的强度大小);连接的类型(物理、信息、语义等不同类型连接),如此,综合这些拓扑信息可以更深入理解子空间之间的内在联系。Optionally, the topological relationship between the mined nodes may include: the closeness of the connection (whether it is direct or multi-step association); the directionality of the connection (one-way or two-way influence); the weight of the connection (the strength of the influence); the type of connection (different types of connections such as physical, information, semantic, etc.). In this way, the integration of these topological information can provide a deeper understanding of the intrinsic connections between subspaces.

由此通过前述步骤S61和步骤S62,即可挖掘节点之间的拓扑关系,发现全局性的模式;这一步骤更注重对子空间网络整体结构的深入分析,强调全局性的关联模式和重要节点的发现;如此,在得到更新后的子空间关联网络图后,即可进行待分析目标的分析,其分析过程可以但不限于如下述步骤S7所示。Therefore, through the aforementioned steps S61 and S62, the topological relationship between nodes can be mined and global patterns can be discovered; this step focuses more on the in-depth analysis of the overall structure of the subspace network, emphasizing the discovery of global association patterns and important nodes; in this way, after obtaining the updated subspace association network diagram, the analysis of the target to be analyzed can be carried out, and the analysis process can be but is not limited to as shown in the following step S7.

S7.利用更新后的子空间关联网络图,得出所述待分析目标的分析结果,其中,所述分析结果包括所述目标城市的城市状态预测结果,且所述城市状态预测结果包括交通流量预测结果、空气质量预测结果和/或人流分布预测结果;在本实施例中,举例可以但不限于利用增量学习算法或迁移学习算法,对所述更新后的子空间关联网络图中的各个子空间进行局部状态预测处理,以得到各个子空间的局部状态预测结果;而后,则可利用各个子空间的局部状态预测结果,得出所述待分析目标的分析结果;如在前述交通数据对应的子空间基础上,进行局部状态预测,从而得到其在未来不同时刻时的交通流量数据;在空气质量对应子空间的基础上,进行局部状态预测,从而得到其在未来不同时刻的是空气质量等等。S7. Utilize the updated subspace association network diagram to obtain the analysis result of the target to be analyzed, wherein the analysis result includes the city state prediction result of the target city, and the city state prediction result includes the traffic flow prediction result, the air quality prediction result and/or the crowd distribution prediction result; in this embodiment, for example, it can be but not limited to utilizing an incremental learning algorithm or a transfer learning algorithm to perform local state prediction processing on each subspace in the updated subspace association network diagram to obtain the local state prediction result of each subspace; then, the local state prediction result of each subspace can be utilized to obtain the analysis result of the target to be analyzed; for example, based on the subspace corresponding to the aforementioned traffic data, a local state prediction is performed to obtain the traffic flow data at different times in the future; based on the subspace corresponding to the air quality, a local state prediction is performed to obtain the air quality at different times in the future, and so on.

在具体实施时,增量学习方式使模型能够持续学习和吸收新出现的子空间状态数据,从而逐步精炼对状态演化规律的理解;其次,通过识别新子空间与已有子空间的关联,进行跨子空间的知识迁移,加速对新子空间状态规律的学习过程;最后,增强学习技术使整个预测系统能够在线优化和调整,随着新知识的不断获取,模型的预测能力得以持续提升。In specific implementation, the incremental learning method enables the model to continuously learn and absorb newly emerging subspace state data, thereby gradually refining the understanding of the state evolution laws; secondly, by identifying the relationship between new subspaces and existing subspaces, cross-subspace knowledge transfer is carried out to accelerate the learning process of new subspace state laws; finally, reinforcement learning technology enables the entire prediction system to be optimized and adjusted online. With the continuous acquisition of new knowledge, the prediction ability of the model can be continuously improved.

可选的,可以但不限于使用下述方法,来实现前述状态预测:(1)构建基于记忆增强网络结构的元学习器,持续吸收子空间新状态样本,保存到关联知识库,并通过内部注意力机制选择相关记忆,增量调整模型参数,实现自适应学习;(2)预训练源任务的状态预测模型,新子空间利用参数迁移和微调,快速获得目标预测能力,且可识别子空间语义和拓扑相似度,从而实现更好的迁移学习效果;(3)利用诸如DDPG、A3C等算法构建增强学习智能体,对环境状态建模预测,并通过奖励机制不断改进策略,并在代理过程自我模拟产生新样本,以辅助模型训练;(4)考虑局部时序结构,分析子空间状态时间演化序列,建立像LSTM等动态时间关联模型,用于状态间时序联系的局部建模;(5)新状态数据到达后增量调整时间关联模型参数,以保证对状态演化规律的持续捕捉。Optionally, the state prediction can be achieved by using, but not limited to, the following methods: (1) constructing a meta-learner based on a memory-enhanced network structure, continuously absorbing new state samples of the subspace, saving them to the associated knowledge base, and selecting related memories through an internal attention mechanism, incrementally adjusting model parameters, and achieving adaptive learning; (2) pre-training the state prediction model of the source task, and using parameter migration and fine-tuning for the new subspace to quickly obtain the target prediction capability, and being able to identify the semantic and topological similarities of the subspace, thereby achieving better transfer learning effects; (3) using algorithms such as DDPG and A3C to build an enhanced learning agent, modeling and predicting the environment state, and continuously improving the strategy through a reward mechanism, and generating new samples through self-simulation in the agent process to assist model training; (4) considering the local temporal structure, analyzing the subspace state time evolution sequence, and establishing a dynamic time association model such as LSTM for local modeling of the temporal connection between states; (5) incrementally adjusting the time association model parameters after the arrival of new state data to ensure continuous capture of the state evolution law.

如此,实现局部状态预测实质是:利用之前步骤构建的子空间关联网络和全局模式挖掘的知识,采用合适的模型或算法对每个子空间的局部状态进行预测,这一预测过程可以基于历史数据、全局状态演化趋势以及其他相关信息;当然,局部预测所使用的方法为常用方法,其原理不再赘述。In this way, the essence of realizing local state prediction is: using the knowledge of subspace association network and global pattern mining constructed in the previous steps, and using appropriate models or algorithms to predict the local state of each subspace. This prediction process can be based on historical data, global state evolution trends and other relevant information; of course, the method used for local prediction is a common method, and its principle will not be repeated.

另外,为了进一步的提高模型精度,从而增加分析的准确性,本实施例还设置有反馈学习步骤,即得出所述待分析目标的分析结果后,基于所述待分析目标的分析结果,调整所述子空间关联网络图,以实现子空间关联网络图的反馈学习;如此,参见图2所示,其整个闭环过程为:首先,构建了子空间的关联网络,将子空间之间的拓扑联系表达了出来;然后,运用特征学习,得到了对各子空间状态的特征化表达,并在此基础上,进一步的挖掘子空间之间的关联模式与全局的演化规律;而后,指导对局部子空间状态的预测和分析;最后,预测结果又反馈到前期的网络构建和特征学习中,形成闭环,由此,则可在使用过程中,实现子空间关联网络图的调整,从而不断提升模型精度,以进一步的提高对复杂系统分析的准确性。In addition, in order to further improve the model accuracy and thus increase the accuracy of the analysis, the present embodiment is further provided with a feedback learning step, that is, after obtaining the analysis result of the target to be analyzed, the subspace association network diagram is adjusted based on the analysis result of the target to be analyzed to realize feedback learning of the subspace association network diagram; thus, referring to FIG2 , the entire closed-loop process is as follows: first, the association network of the subspace is constructed to express the topological relationship between the subspaces; then, feature learning is used to obtain a characteristic expression of the state of each subspace, and on this basis, the association pattern between the subspaces and the global evolution law are further explored; then, the prediction and analysis of the local subspace state are guided; finally, the prediction result is fed back to the previous network construction and feature learning to form a closed loop, thereby, the adjustment of the subspace association network diagram can be realized during use, thereby continuously improving the model accuracy to further improve the accuracy of the analysis of complex systems.

综上所述,本发明具有以下有益效果:In summary, the present invention has the following beneficial effects:

(1)复杂系统多角度子空间抽象化表示;通过构建子空间的抽象化表示,可以从复杂系统中提取出多个关键反映维度;同时,对每一个关键子空间聚焦进行建模,可屏蔽系统的其他复杂成分,从而在降低建模难度的同时,获得多角度的分析视图。(1) Abstract representation of complex systems from multiple perspectives in subspaces. By constructing an abstract representation of subspaces, multiple key reflection dimensions can be extracted from complex systems. At the same time, focusing on modeling each key subspace can shield other complex components of the system, thereby reducing the difficulty of modeling and obtaining a multi-angle analysis view.

(2)引入了序列维度进行动态子空间建模;通过在子空间建模基础上增加时间等序列维度,可以持续追踪每个子空间状态的动态演化轨迹,揭示子空间之间的协同动态演化规律,支持过程优化。(2) The sequence dimension is introduced for dynamic subspace modeling. By adding sequence dimensions such as time to subspace modeling, the dynamic evolution trajectory of each subspace state can be continuously tracked, revealing the laws of coordinated dynamic evolution between subspaces and supporting process optimization.

(3)基于图模型定义子空间间的依赖关系;通过构建基于子空间的关系图谱,明确表示了子空间节点之间的依赖与约束关系,并在此基础上,进行联合建模与推理,从而可分析系统中不同组成部分的相互作用机制。(3) Define the dependency relationship between subspaces based on the graphical model; by constructing a relationship graph based on subspaces, the dependency and constraint relationship between subspace nodes is clearly expressed, and on this basis, joint modeling and reasoning are carried out to analyze the interaction mechanism of different components in the system.

(4)基于构建的子空间依赖图,可以进行因果效应的传播分析,评估某一子空间状态变化将在系统内部引发的连锁反应,实现状态演化的预测与风险评估。(4) Based on the constructed subspace dependency graph, the propagation analysis of causal effects can be carried out to evaluate the chain reaction that a change in a subspace state will cause within the system, thereby achieving the prediction of state evolution and risk assessment.

(5)融合序列建模中顺序依赖性约束,与图模型中拓扑结构依赖性约束,构成一个序依赖与空间共同约束的子空间建模框架,如此,能够增强状态演化预测的准确性。(5) The sequential dependency constraints in sequence modeling and the topological dependency constraints in the graph model are integrated to form a subspace modeling framework with both sequential dependency and spatial constraints. This can enhance the accuracy of state evolution prediction.

(6)本发明支持通过引入更多区域性子空间进行集群组合,如此,可建立跨区域的大尺度联合建模,能够应用于需要局部性细粒度控制的大系统分析与决策优化。(6) The present invention supports cluster combination by introducing more regional subspaces, so that cross-regional large-scale joint modeling can be established, which can be applied to large system analysis and decision optimization that require local fine-grained control.

(7)基于图神经网络实现了子空间的联合表示;本发明利用图神经网络实现子空间序列的联合表示学习,使得模型可以自主提取状态演化的复杂特征表示,无需人工特征工程,扩展了建模的自动化与泛化能力。(7) The joint representation of subspaces is realized based on graph neural networks. The present invention uses graph neural networks to realize the joint representation learning of subspace sequences, so that the model can autonomously extract the complex feature representation of state evolution without the need for manual feature engineering, thereby expanding the automation and generalization capabilities of modeling.

(8)支持从微观单一子空间建模,扩展到宏观对整个子空间集合的统一建模,可实现从局部到全局的系统动态建模与情景预测,如此,可提供理论支撑复杂系统的策略规划与决策优化。(8) It supports the expansion from microscopic single subspace modeling to macroscopic unified modeling of the entire subspace set, and can realize system dynamic modeling and scenario prediction from local to global. In this way, it can provide theoretical support for the strategic planning and decision optimization of complex systems.

在一个可能的设计中,本实施例第二方面提供一种应用实例,即利用实施例第一方面所提供的方法,来进行智慧城市多维协同演化,其过程如下所示。In a possible design, the second aspect of this embodiment provides an application example, that is, using the method provided in the first aspect of the embodiment to perform multi-dimensional collaborative evolution of a smart city, and the process is as follows.

步骤一:子空间构建;从城市的多维异构数据中,选取交通网络、城市热岛、商业区分布、空气质量、人口分布、能源利用等6个关键子空间,以聚焦城市协同可持续发展的分析视角;前述6个关键子空间分别为:交通网络(空间网络模型),对城市交通网络进行建模,包括车辆流量、道路状况、公共交通系统等;城市热岛效应模拟(场模型),使用场模型对城市热岛效应进行建模,考虑气象数据、城市建筑密度等因素;商业区分布(物体模型)利用物体模型表示商业区的位置、规模、类型等信息;空气质量(场模型),利用场模型对城市不同地区的空气质量进行建模,包括颗粒物浓度、氮氧化物等;人口分布(物体模型),利用物体模型表示城市不同区域的人口分布,包括人口密度、居住区域等;能源利用(场模型),建立能源利用模型,考虑城市的能源消耗情况、可再生能源利用等。Step 1: Subspace construction; from the multi-dimensional heterogeneous data of the city, six key subspaces are selected, including transportation network, urban heat island, commercial area distribution, air quality, population distribution, and energy utilization, to focus on the analytical perspective of coordinated and sustainable development of the city; the above six key subspaces are: transportation network (spatial network model), modeling the urban transportation network, including vehicle flow, road conditions, public transportation system, etc.; urban heat island effect simulation (field model), using the field model to model the urban heat island effect, considering factors such as meteorological data and urban building density; commercial area distribution (object model), using the object model to represent the location, scale, type and other information of the commercial area; air quality (field model), using the field model to model the air quality of different areas of the city, including particulate matter concentration, nitrogen oxides, etc.; population distribution (object model), using the object model to represent the population distribution of different areas of the city, including population density, residential area, etc.; energy utilization (field model), establishing an energy utilization model, considering the city’s energy consumption, renewable energy utilization, etc.

在本实施例中,虽然看成6种子空间,但是形式都是空间数据,基本可以用三种空间模型建立物体模型,场模型和空间网络模型表达;详细描述如下:In this embodiment, although it is considered as 6 sub-spaces, the form is spatial data, which can basically be expressed by three spatial models to establish object models, field models and spatial network models; the detailed description is as follows:

物体模型(Object Model):其用于表示空间中的具体对象,包括点、线和面等;这种模型采用几何表达方式,通过描述对象的几何形状、位置和其他相关属性来呈现空间中的实体,例如,在智慧城市建模中,物体模型可以用于表示建筑物、道路或其他地理要素;参见图3所示,图3给出了物体模型的示意图,其中,图3中的左图为右图的抽象化表达。Object Model: It is used to represent specific objects in space, including points, lines, and surfaces. This model uses geometric expression to present entities in space by describing the geometric shape, position, and other related attributes of the object. For example, in smart city modeling, the object model can be used to represent buildings, roads, or other geographic features. See Figure 3, which shows a schematic diagram of the object model, where the left picture in Figure 3 is an abstract expression of the right picture.

场模型(Field Model):其用于表示与位置相关的连续变量或属性,这些属性在空间中呈现出连续的分布;场模型采用函数表达方式,描述空间中各点上的属性值,如温度、湿度、或其他环境因素;在智慧城市的例子中,场模型可用于模拟城市热岛效应,考虑气象数据、建筑密度等因素,其示例图可参见图4所示。Field Model: It is used to represent continuous variables or attributes related to location, which are continuously distributed in space. The field model uses functional expression to describe the attribute values at each point in space, such as temperature, humidity, or other environmental factors. In the example of smart cities, the field model can be used to simulate the urban heat island effect, taking into account factors such as meteorological data and building density. An example diagram can be seen in Figure 4.

空间网络模型(Spatial Network Model):空间网络模型用于表示地理空间中实体间的连接关系及其属性;这种模型采用网络图的形式,描绘实体之间的关联和相互作用;在智慧城市多维数据中,空间网络模型可以用于建模交通网络,包括车辆流量、道路状况等,节点表示实体,边表示它们之间的连接关系,同时可以携带一定的属性信息,其示意图可参见图5所示。Spatial Network Model: The spatial network model is used to represent the connection relationship and attributes between entities in geographic space. This model uses a network diagram to depict the association and interaction between entities. In the multidimensional data of smart cities, the spatial network model can be used to model traffic networks, including vehicle flow, road conditions, etc. Nodes represent entities, and edges represent the connection relationship between them. They can also carry certain attribute information. The schematic diagram can be seen in Figure 5.

在完成子空间的创建后,则可进行子空间序列的构建;如下述步骤二所示。After the subspace is created, the subspace sequence can be constructed, as shown in the following step 2.

步骤二:子空间序列生成;利用时间序列作为序列维度,对上述6个子空间建立时间序列模型,构成子空间的序列表达;选择匹配子空间特性的时间序列模型,比如场模型-时间变化,其表示连续场随时间的演化,意味着可以通过场模型捕捉空间状态随时间的增量变化;又如,物体模型-事件/过程来表示实体时间属性的稳定性,其观察到实体在时序中的事件和过程;再如,选择空间网络模型-时间快照来表达时空网络如时间扩展图和时间聚合图,捕捉到网络连接关系的时空变化。Step 2: Subspace sequence generation: Use time series as the sequence dimension to establish a time series model for the above six subspaces to form a sequence expression of the subspace; select a time series model that matches the characteristics of the subspace, such as the field model-time change, which represents the evolution of the continuous field over time, which means that the incremental changes of the spatial state over time can be captured through the field model; for example, the object model-event/process is used to represent the stability of the entity's temporal attributes, which observes the events and processes of the entity in the time series; for example, the spatial network model-time snapshot is selected to express the spatiotemporal network such as the time expansion graph and the time aggregation graph, which captures the spatiotemporal changes of the network connection relationship.

这里,以六个子空间为例,说明如何将它们与不同的时序模型相匹配,以更好地捕捉多维数据的时空动态变化:(1)交通网络(空间网络模型):使用时间快照模型,对城市交通网络进行建模,这能够在不同时间点采样交通网络的状态,包括车辆流量、道路状况、公共交通系统等的变化;(2)利用时间变化模型,对城市热岛效应进行建模,场模型考虑气象数据、城市建筑密度等因素,而时间变化模型捕捉城市热岛效应随时间的演化;(3)使用事件/过程模型,利用物体模型表示商业区的位置、规模、类型等信息。通过这种方式,我们可以观察到商业区在时序中的事件和过程,如商业区的扩张或变化;(4)利用时间快照模型,对城市不同地区的空气质量进行建模。场模型可以考虑颗粒物浓度、氮氧化物等的分布,而时间快照模型则捕捉空气质量在不同时间点的变化;(5)使用时间变化模型和事件/过程模型,通过物体模型表示城市不同区域的人口分布。这有助于观察人口变化的稳定性和相关事件,如人口密度、居住区域的演变;(6)利用场模型和时间变化模型,建立能源利用模型。考虑城市的能源消耗情况、可再生能源利用等,以捕捉能源利用随时间的演化过程。Here, we take six subspaces as examples to illustrate how to match them with different time series models to better capture the spatiotemporal dynamic changes of multidimensional data: (1) Traffic network (spatial network model): Use the time snapshot model to model the urban traffic network, which can sample the state of the traffic network at different time points, including changes in vehicle flow, road conditions, public transportation systems, etc.; (2) Use the time variation model to model the urban heat island effect. The field model considers factors such as meteorological data and urban building density, while the time variation model captures the evolution of the urban heat island effect over time; (3) Use the event/process model and use the object model to represent the location, scale, type and other information of the commercial district. In this way, we can observe the events and processes of the commercial district in the time series, such as the expansion or change of the commercial district; (4) Use the time snapshot model to model the air quality in different areas of the city. The field model can consider the distribution of particulate matter concentration, nitrogen oxides, etc., while the time snapshot model captures the changes in air quality at different time points; (5) Use the time variation model and the event/process model to represent the population distribution in different areas of the city through the object model. This helps to observe the stability of population changes and related events, such as population density and the evolution of residential areas; (6) Using field models and time-varying models, an energy utilization model is established. Considering the city's energy consumption, renewable energy utilization, etc., to capture the evolution of energy utilization over time.

在完成子空间序列构建后,即可基于此,来进行子空间的序列编号,如下述步骤三所示。After the subspace sequence is constructed, the subspace sequence can be numbered based on it, as shown in the following step three.

步骤三:对偶空间与序列编号;在序列生成过程中,同时构建一维时间序列空间作为上述子空间序列的对偶结构。在该对偶序列空间上,定义子空间状态的编号方法:{ID,时间},为每个时刻的子空间状态赋予唯一标识,以建立子空间与特定时刻的映射关系。例如,{S1,T1}表示时刻T1下ID为S1的交通网络子空间;该编号体系为后续分析子空间间在时间轴上的协同关联提供了基础;可选的,前述交通、热岛、商业分布三个关键子空间进行序列编号后的示意图可依次参见图6-8所示,由于剩余三个关键子空间类型与前述相同,在此不多加赘述。Step 3: Dual space and sequence numbering; During the sequence generation process, a one-dimensional time series space is simultaneously constructed as the dual structure of the above subspace sequence. On this dual sequence space, a subspace state numbering method is defined: {ID, time}, and a unique identifier is given to the subspace state at each moment to establish a mapping relationship between the subspace and a specific moment. For example, {S1, T1} represents the traffic network subspace with ID S1 at moment T1; this numbering system provides a basis for the subsequent analysis of the synergistic association between subspaces on the time axis; optionally, the schematic diagrams of the three key subspaces of traffic, heat island, and commercial distribution after serial numbering can be seen in Figures 6-8 respectively. Since the remaining three key subspace types are the same as those mentioned above, they will not be elaborated here.

在完成各个子空间中子空间实例的序列编号后,则可进行子空间关联网络图的构建,如下述步骤四所示。After completing the serial numbering of the subspace instances in each subspace, the subspace association network diagram can be constructed, as shown in the following step 4.

步骤四:子空间关联网络构建;根据子空间的时序序列号,将其作为网络节点,节点之间的连接关系表示不同子空间间的关联与相互制约。构建反映城市系统复杂关联的网状拓扑结构。Step 4: Subspace association network construction: According to the time sequence number of the subspace, it is used as a network node, and the connection relationship between the nodes represents the association and mutual restriction between different subspaces. Construct a mesh topological structure that reflects the complex association of the urban system.

步骤五:子空间特征学习;即为了处理不同类型的子空间,如空间网络、场和物体模型等,本实施例可以使用一种统一的表征方式,例如使用图神经网络(Graph NeuralNetwork,GNN),GNN是一类专门用于处理图结构数据的深度学习模型,能够有效捕捉空间关系、动态演化和多维特征。Step 5: Subspace feature learning; that is, in order to process different types of subspaces, such as spatial networks, fields and object models, this embodiment can use a unified representation method, such as using a graph neural network (GNN). GNN is a type of deep learning model specifically used to process graph structured data, which can effectively capture spatial relationships, dynamic evolution and multi-dimensional features.

对于每个子空间,我们可以将其表示为一个图;在空间网络模型中,每个交通节点可以表示为一个节点,其中包含节点的几何信息(坐标)以及与该节点相关的其他属性,如交通流量等;路连接可以用图中的边表示,每条边连接两个节点,表示两个交叉路口或交通节点之间的道路;在场模型中,每个栅格(raster)可以表示为一个节点,包含栅格的位置信息和场值(例如,温度、空气质量等),而边可以表示栅格之间的空间关系,例如相邻栅格之间的连接;如此,这样的边可以反映场在空间上的变化;同理,在物体模型中,每个空间对象(如商业区、人口密度高的区域等)可以表示为一个节点,包含对象的几何信息,而边可以表示对象之间的空间关系,例如相邻区域之间的连接;如此,这种统一的表征方式,可便于网络的建模,其对应的子空间表征示例图可以但不限于如图9所示。For each subspace, we can represent it as a graph; in the spatial network model, each traffic node can be represented as a node, which contains the node's geometric information (coordinates) and other attributes related to the node, such as traffic flow, etc.; road connections can be represented by edges in the graph, each edge connects two nodes, representing the road between two intersections or traffic nodes; in the field model, each raster can be represented as a node, containing the raster's location information and field values (for example, temperature, air quality, etc.), and the edges can represent the spatial relationship between rasters, such as the connection between adjacent rasters; in this way, such edges can reflect the changes in the field in space; similarly, in the object model, each spatial object (such as a commercial area, an area with high population density, etc.) can be represented as a node, containing the object's geometric information, and the edges can represent the spatial relationship between objects, such as the connection between adjacent areas; in this way, this unified representation method can facilitate network modeling, and its corresponding subspace representation example diagram can be, but is not limited to, as shown in Figure 9.

在完成特征学习后,即可进行数据挖掘,如下述步骤六所示。After feature learning is completed, data mining can be performed, as shown in step six below.

步骤六:拓扑关系与全局模式挖掘;对于整个网络的构建,在第五步已经将所有子空间都表征为图,那么在这一步可以构建一个大图(即全局图表示Global GraphRepresentation),其中18个子空间是节点,边代表某种关联;构建这个大图的方式可以考虑以下两种:Step 6: Topological relations and global pattern mining: For the construction of the entire network, all subspaces have been represented as graphs in the fifth step. In this step, a large graph (i.e., global graph representation) can be constructed, in which 18 subspaces are nodes and edges represent certain relationships. There are two ways to construct this large graph:

整体图跨时间跨子空间:The overall picture spans time and subspace:

全连接图:最简单的方式是构建一个全连接图,其中每个子空间都与其他所有子空间有边相连。这样的图表示所有子空间之间都存在关联,但可能过于密集。Fully connected graph: The simplest way is to construct a fully connected graph, where each subspace is connected to all other subspaces by edges. Such a graph indicates that all subspaces are connected, but it may be too dense.

基于关联强度/子空间相似性的连接:使用图神经网络,如GraphSAGE、GCN、GAT等,对子空间进行表示学习。在这个过程中,模型会学到子空间之间的关联和相似性。为图中的每条边计算权重。这些权重可以综合考虑关联强度和相似性;可以基于关联强度/相似性设置一个阈值,将高于阈值的关联/相似性保留,低于阈值的关联/相似性舍弃;由此通过前述设计,就可以构建一个更稀疏的网络,更强调关键的关联/相似性。Connection based on association strength/subspace similarity: Use graph neural networks, such as GraphSAGE, GCN, GAT, etc., to learn representations of subspaces. In this process, the model learns the associations and similarities between subspaces. Calculate weights for each edge in the graph. These weights can take into account association strength and similarity; a threshold can be set based on association strength/similarity, and associations/similarity above the threshold will be retained, and associations/similarity below the threshold will be discarded; thus, through the above design, a sparser network can be constructed, which emphasizes key associations/similarity.

力导向布局:在节点链接图中应用力导向布局,以更清晰地展示拓扑结构。力导向布局考虑节点之间的吸引和排斥力,使得高度相互关联的节点彼此靠近,不相关的节点分开;其中,整体图力导向布局的示例图可参见图10所示。Force-directed layout: Force-directed layout is applied in the node-link graph to show the topological structure more clearly. Force-directed layout considers the attraction and repulsion between nodes, so that highly interconnected nodes are close to each other and unrelated nodes are separated; an example diagram of the overall graph force-directed layout can be seen in Figure 10.

分步时序图:Step-by-step timing diagram:

时序建模:其中每个时刻的6个子空间集合构成一个切片(可以是向量或图表征数据),输入到序列模型中,例如循环神经网络(RNN)或长短时记忆网络(LSTM),以学习全局状态的时序演化。Time series modeling: The set of 6 subspaces at each moment constitutes a slice (which can be a vector or graph representation data), which is input into a sequence model, such as a recurrent neural network (RNN) or a long short-term memory network (LSTM), to learn the temporal evolution of the global state.

全局状态的演化:通过序列模型学到的全局状态表征可以用于理解整体系统在时序上的变化和演化;其中,分步时序图的示例图可以参见图11所示。Evolution of global state: The global state representation learned by the sequence model can be used to understand the changes and evolution of the overall system in time sequence. An example of a step-by-step timing diagram can be seen in Figure 11.

在本实施例中,可以对两者进行结合,即:先构建分布时序图,理解基本规律;然后再构建全局动态图,发现深层拓扑知识;如此,挖掘到的拓扑关系和全局模式可以为局部状态预测提供上下文信息,而通过了解整个系统的全局演化规律,可以更准确地预测特定子空间在未来时刻的状态;同时,挖掘到的拓扑关系可能暗示了一些关键的节点或子空间,它们在演化中可能发挥重要作用。In this embodiment, the two can be combined, that is, first construct a distribution time series diagram to understand the basic rules; then construct a global dynamic diagram to discover deep topological knowledge; in this way, the mined topological relationships and global patterns can provide contextual information for local state prediction, and by understanding the global evolution laws of the entire system, the state of a specific subspace at a future moment can be more accurately predicted; at the same time, the mined topological relationships may imply some key nodes or subspaces, which may play an important role in the evolution.

在完成子空间关联网络图的挖掘后,则可进行系统分析,如下述步骤七所示。After completing the mining of the subspace association network graph, system analysis can be performed, as shown in step seven below.

步骤七:局部状态预测与演化分析旨在通过局部状态的预测和演化分析,深入理解系统中各个子空间在不同时刻的演化规律,并评估系统在局部动态变化方面的稳健性,其具体描述如下:Step 7: Local state prediction and evolution analysis aims to deeply understand the evolution laws of each subspace in the system at different times through the prediction and evolution analysis of local states, and evaluate the robustness of the system in terms of local dynamic changes. The specific description is as follows:

局部状态预测:利用之前步骤构建的子空间关联网络和全局模式挖掘的知识,采用合适的模型或算法对每个子空间的局部状态进行预测;这一预测过程可以基于历史数据、全局状态演化趋势以及其他相关信息。Local state prediction: Using the knowledge of subspace association network and global pattern mining constructed in the previous steps, a suitable model or algorithm is used to predict the local state of each subspace; this prediction process can be based on historical data, global state evolution trends, and other relevant information.

网络拓扑结构的约束下:在预测过程中,考虑到子空间关联网络的拓扑结构对系统动态的影响。即在网络拓扑结构的约束下,预测每个子空间的局部状态变化情况。Under the constraints of network topology: In the prediction process, the influence of the topological structure of the subspace associated network on the system dynamics is taken into account. That is, under the constraints of the network topology, the local state changes of each subspace are predicted.

不同情形下的演化关系分析:分析在不同情形下,每个子空间的局部状态是如何演化的。这可以包括在系统扰动或变化条件下的局部动态响应。通过这一分析,可以识别系统中的关键子空间,以及它们在不同情境下的变化规律。Evolutionary relationship analysis under different circumstances: Analyze how the local state of each subspace evolves under different circumstances. This can include local dynamic responses under system disturbances or changes. Through this analysis, key subspaces in the system can be identified, as well as their changing patterns under different circumstances.

网络扰动对系统的影响传播:评估网络扰动对系统局部状态的影响传播。通过局部状态的预测和演化分析,可以揭示网络扰动如何在系统中传播,影响相关子空间的演化过程。Propagation of the impact of network disturbances on the system: Evaluate the propagation of the impact of network disturbances on the local state of the system. Through the prediction and evolution analysis of local states, it can reveal how network disturbances propagate in the system and affect the evolution of related subspaces.

迭代预测与传播路径动态性更新:在整个闭环分析框架下(见步骤八),六个子空间的状态可以通过迭代预测相互影响;每次进行预测时,更新子空间的状态,并将这些更新的状态信息传播到关联的子空间中。动态性更新传播路径,使得路径的权重和关联强度能够随着时间和预测的变化而调整,更好地反映子空间之间的动态关系;其中,状态预测与影响传播的示例图可参见图12所示。Iterative prediction and dynamic update of propagation paths: In the entire closed-loop analysis framework (see step 8), the states of the six subspaces can influence each other through iterative prediction; each time a prediction is made, the state of the subspace is updated, and the updated state information is propagated to the associated subspace. The propagation path is dynamically updated so that the weight and association strength of the path can be adjusted with time and changes in prediction, better reflecting the dynamic relationship between subspaces; an example diagram of state prediction and influence propagation can be shown in Figure 12.

实际场景中,在进行局部状态预测时,不仅考虑了子空间关联网络的拓扑约束,还结合了更加复杂的交通流量、商业活动、气候环境等因素,分析它们与交通网络子空间的协同演化。这使得交通流预测更加准确。In actual scenarios, when making local state predictions, not only the topological constraints of the subspace association network are considered, but also more complex factors such as traffic flow, commercial activities, and climate environment are combined to analyze their co-evolution with the traffic network subspace. This makes traffic flow predictions more accurate.

在不同情形下的演化分析中,详细考量了高峰时段的交通压力如何影响城市的空气质量和能源利用,以及引发更严重的热岛效应;这种复杂关联的建模,可以帮助我们评估交通网络异常对整个城市系统的层层传播影响。In the evolutionary analysis under different scenarios, we considered in detail how traffic pressure during peak hours affects the city's air quality and energy utilization, and how it causes a more serious urban heat island effect; this complex correlation modeling can help us evaluate the layered impact of traffic network anomalies on the entire urban system.

而在评估网络扰动传播时,比如道路修缮或交通事故造成的交通中断,我们不仅分析了其对交通网络自身的影响,还显式地挖掘了这种扰动如何通过交通-空气质量-商业活动多个关联子空间发生传播,对更大范围的城市系统造成冲击。When evaluating the propagation of network disturbances, such as traffic disruptions caused by road repairs or traffic accidents, we not only analyze their impact on the transportation network itself, but also explicitly explore how such disturbances propagate through multiple related subspaces of traffic-air quality-commercial activities, causing impacts on the larger urban system.

在迭代预测和更新中,我们也考虑了更多城市规划和人口分布等社会经济因素,与其他子空间形成的深层次动态关联。这样有助于我们从整体上提出更加智慧的城市调控建议,实现子空间的协同优化。In iterative prediction and updating, we also consider more socio-economic factors such as urban planning and population distribution, and the deep dynamic connections with other subspaces. This helps us to make more intelligent urban regulation suggestions as a whole and achieve coordinated optimization of subspaces.

步骤八:反馈学习在整个过程中,反馈学习是一个关键步骤,通过实时理解系统的动态演化,不断优化模型和预测。以下是反馈学习的示例:Step 8: Feedback Learning Feedback learning is a key step in the whole process, which continuously optimizes the model and prediction by understanding the dynamic evolution of the system in real time. The following is an example of feedback learning:

在前面步骤中,基于子空间之间的关联网络,对交通网络子空间的未来状态进行了预测;假设预测结果与实际出现的交通状况有一定偏差,则在反馈学习中,可以采取以下措施:In the previous step, the future state of the traffic network subspace was predicted based on the association network between the subspaces; assuming that the prediction results have a certain deviation from the actual traffic conditions, the following measures can be taken in feedback learning:

分析交通网络预测的偏差来源,是否其他子空间对其有重要影响;例如城市热岛效应导致交通状况异常恶化;然后根据分析,调整子空间之间的关联网络结构;增加热岛效应和交通网络之间的关联程度;在调整后的网络上,重新进行子空间特征学习,利用热岛数据增强交通网络的状态表达;而后,基于更新的子空间表达式,重新进行交通网络的未来状态预测;最后,验证改进的预测结果,并再重复这个反馈优化的过程,以逐步提升关联网络的表示能力,以及各维子空间的状态预测准确度。Analyze the source of deviation in traffic network prediction and whether other subspaces have an important impact on it; for example, the urban heat island effect causes abnormal deterioration of traffic conditions; then adjust the association network structure between subspaces based on the analysis; increase the degree of association between the heat island effect and the traffic network; re-learn subspace features on the adjusted network and use heat island data to enhance the state expression of the traffic network; then, based on the updated subspace expression, re-predict the future state of the traffic network; finally, verify the improved prediction results and repeat this feedback optimization process to gradually improve the representation ability of the association network and the state prediction accuracy of each dimensional subspace.

在一个可能的设计中,本实施例第三方面提供另一种应用实例,其构建了一个多维动态生命周期评估关联分析模型(Multidimensional Dynamic LCA CorrelationAnalysis Model,MDLCCAM),实现了对产品和过程的整体生命周期绩效进行动态、非线性和关联性建模。In a possible design, the third aspect of this embodiment provides another application example, which constructs a multidimensional dynamic life cycle assessment correlation analysis model (Multidimensional Dynamic LCA Correlation Analysis Model, MDLCCAM), which realizes dynamic, nonlinear and correlation modeling of the overall life cycle performance of products and processes.

MDLCCAM的技术核心是高维子空间抽象表示和关联网络建模;首先,通过定义反映系统多样性的子空间,构建高维状态空间;然后,子空间之间依托丰富的关联,形成复杂的网络拓扑结构;子空间序列中内置时间和事件标识,可对状态演化过程进行动态跟踪。The technical core of MDLCCAM is high-dimensional subspace abstract representation and association network modeling. First, a high-dimensional state space is constructed by defining subspaces that reflect the diversity of the system. Then, the subspaces form a complex network topology based on rich associations. The subspace sequence has built-in time and event identifiers, which can dynamically track the state evolution process.

基于子空间网络,可使用机器学习技术建模其中蕴含的非线性特征关系,并通过关联规则挖掘发现潜在影响机制;在此高维动态网络基础上,MDLCCAM可支持环境管理的在线决策,实现异常检测、路径构建、系统预测、过程优化等功能。通过动态、非线性和关联性建模,它拓展和增强了传统LCA分析,为可持续发展提供了有力的技术支撑。Based on the subspace network, machine learning technology can be used to model the nonlinear characteristic relationships contained therein, and potential influencing mechanisms can be discovered through association rule mining; based on this high-dimensional dynamic network, MDLCCAM can support online decision-making in environmental management and realize functions such as anomaly detection, path construction, system prediction, and process optimization. Through dynamic, nonlinear, and correlation modeling, it expands and enhances traditional LCA analysis and provides strong technical support for sustainable development.

在本实施例中,其构建步骤与前述实施例第二方面一致;其中,在关键维度选择上,可选择环境影响指标(反映对环境的潜在影响);资源利用指标(反映资源使用情况);排放指标(评估污染程度);经济指标(反映经济性影响);社会影响指标(反映社会效应);子空间的每个具体维度对应一个LCA计算指标;即子空间是一个包含所有LCA核心指标的综合空间。In this embodiment, the construction steps are consistent with the second aspect of the aforementioned embodiment; among them, in terms of the selection of key dimensions, environmental impact indicators (reflecting the potential impact on the environment); resource utilization indicators (reflecting resource usage); emission indicators (assessing the degree of pollution); economic indicators (reflecting economic impact); social impact indicators (reflecting social effects); each specific dimension of the subspace corresponds to an LCA calculation indicator; that is, the subspace is a comprehensive space that includes all LCA core indicators.

在子空间序列构建步骤中:本模型通过引入时间序列和事件序列两个维度构建二维动态序列结构,反映子空间状态的演化网络;时间序列以统一基准描述子空间状态随时间的变迁;事件序列代表子空间在不同条件驱动下的演化路径;如此,二维序列的每个点对应子空间在该时刻该事件下的一个状态,最后可构成一个二维动态序列网络。通过分析这个网络,可以解读多条件下子空间状态变化的动态规律,即产品或过程的环境影响评估结果在不同情形下的演化机制。In the subspace sequence construction step: this model constructs a two-dimensional dynamic sequence structure by introducing two dimensions, time series and event series, to reflect the evolution network of subspace states; the time series describes the changes of subspace states over time with a unified benchmark; the event sequence represents the evolution path of the subspace driven by different conditions; in this way, each point of the two-dimensional sequence corresponds to a state of the subspace under the event at that moment, and finally a two-dimensional dynamic sequence network can be formed. By analyzing this network, the dynamic laws of subspace state changes under multiple conditions can be interpreted, that is, the evolution mechanism of the environmental impact assessment results of products or processes under different circumstances.

在本模型构建的二维动态序列网络中,每个节点对应的子空间状态需要进行唯一标识和编号,为此,引入子空间序列的对偶空间概念;其中,序列编号的一般表达式为:{ID,(时间,事件)},例如:{S1,(T0,E1)}表示子空间S1在时间T0和事件条件E1下的对应的网络节点;该编号与序列网络节点形成一一映射关系;序列编号使得网络构建完成后,可通过编号快速搜索和索引节点,读取所承载的子空间及其状态值信息。这为开展网络分析和状态演化规律挖掘提供了重要支撑。In the two-dimensional dynamic sequence network constructed by this model, the subspace state corresponding to each node needs to be uniquely identified and numbered. For this purpose, the concept of dual space of subspace sequence is introduced; the general expression of sequence number is: {ID, (time, event)}, for example: {S1, (T0, E1)} represents the corresponding network node of subspace S1 under time T0 and event condition E1; this number forms a one-to-one mapping relationship with the sequence network node; the sequence numbering enables the node to be quickly searched and indexed by number after the network is built, and the subspace and its state value information carried can be read. This provides important support for network analysis and state evolution law mining.

在子空间关联网络图构建过程中,本实施例将多条事件驱动情景的子空间演化路径集成在一张图上,从而实现对子空间指标随时间和事件演化的全面分析;每个节点都具有以下编号格式:{Sn,(Tx,Ey)},其中Sn表示子空间,Tx表示时间点,Ey表示事件驱动条件;其中,横向集成不同情景下的子空间演化路径(如X轴表示时间,可将同一情景下的子空间演化路径沿时间轴集成,以揭示同一情景下子空间随时间的演化规律;这有助于识别在相同条件下子空间性能的动态趋势,因为子空间中的维度反映的指标具有时间演化性需求,即需要观察某一过程在时间流上指标值的动态变化情况);而纵向比较不同情景下同一时间点的子空间演化差异(如Y轴表示不同情景/事件驱动条件,将同一时间点的子空间演化路径在纵向上进行比较。这有助于识别在特定时间点不同情景下子空间之间的性能差异);由此,这样构建的二维网络综合模拟了这个双重演化机制,既包含子空间自身随时间的状态流变,也反映了时间切面上在各类事件下的相对差异比较。In the process of constructing the subspace association network graph, this embodiment integrates the subspace evolution paths of multiple event-driven scenarios into one graph, thereby realizing a comprehensive analysis of the evolution of subspace indicators over time and events; each node has the following numbering format: {Sn, (Tx, Ey)}, where Sn represents the subspace, Tx represents the time point, and Ey represents the event-driven condition; wherein, the subspace evolution paths under different scenarios are horizontally integrated (e.g., the X-axis represents time, and the subspace evolution paths under the same scenario can be integrated along the time axis to reveal the evolution law of the subspace under the same scenario over time; this helps to identify the differences in subspace performance under the same conditions. Dynamic trend, because the indicators reflected by the dimensions in the subspace have time evolution requirements, that is, it is necessary to observe the dynamic changes of the indicator values of a certain process in the time flow); and longitudinally compare the evolution differences of the subspace at the same time point under different scenarios (such as the Y-axis represents different scenarios/event driving conditions, and the subspace evolution paths at the same time point are compared longitudinally. This helps to identify the performance differences between subspaces under different scenarios at a specific time point); thus, the two-dimensional network constructed in this way comprehensively simulates this dual evolution mechanism, which includes both the state changes of the subspace itself over time and the relative differences in various events on the time section.

对于子空间特征学习:由于步骤一构建的子空间是一个集成反映产品或过程整体生命周期表现的单一实体,其涵盖了环境、资源、排放等多个维度的LCA指标;在步骤四构建的动态子空间网络中,每个节点都承载了对这些多维LCA指标的映射,从而形成了某一状态下子空间的特征;如此,当子空间数据充足时,首选采用深度学习等方式,以自动学习子空间内在特征的表达方式,从而建模其复杂的多维指标结构和非线性关系。在面临数据不足限制时,可以使用统计学习等替代方法进行特征工程,作为图神经网络方法的补充,使模型对子空间表示的学习更加可靠稳定。For subspace feature learning: Since the subspace constructed in step 1 is a single entity that integrates the overall life cycle performance of a product or process, it covers LCA indicators in multiple dimensions such as environment, resources, and emissions; in the dynamic subspace network constructed in step 4, each node carries the mapping of these multidimensional LCA indicators, thus forming the characteristics of the subspace under a certain state; thus, when the subspace data is sufficient, deep learning and other methods are preferred to automatically learn the expression of the intrinsic characteristics of the subspace, so as to model its complex multidimensional indicator structure and nonlinear relationship. When faced with insufficient data constraints, alternative methods such as statistical learning can be used for feature engineering as a supplement to the graph neural network method, making the model's learning of subspace representation more reliable and stable.

在完成子空间特征学习后,则可进行步骤六(即拓扑关系与全局模式挖掘);其中,可在构建的二维动态子空间网络基础上,可以通过关联分析、关系推理等技术挖掘节点间的内在拓扑关系,并在挖掘过程中,引入边的权重指标,以量化节点之间关联的强度;具体而言,关联分析可以根据子空间之间的相似性或各评估指标的关联程度,为边赋予权重,反映节点间耦合强度;更相似或相关性更强的子空间会获得更高的权重。如果网络表达指标的协同演化,边权重表示演化一致性,且子空间跨多个维度,可以综合每个维度的权重贡献;而对于时序网络,还可以根据子空间在时间轴上变化趋势的一致性确定权重。当关联难以量化时,也可以引入专家评估对边权重进行赋值。After completing the subspace feature learning, step six (i.e., topological relationship and global pattern mining) can be carried out; among them, on the basis of the constructed two-dimensional dynamic subspace network, the intrinsic topological relationship between nodes can be mined through association analysis, relationship reasoning and other technologies, and the weight index of the edge can be introduced in the mining process to quantify the strength of the association between nodes; specifically, the association analysis can assign weights to the edges according to the similarity between the subspaces or the degree of association of each evaluation indicator, reflecting the coupling strength between nodes; subspaces with more similarity or stronger correlation will obtain higher weights. If the network expresses the co-evolution of indicators, the edge weight represents the evolution consistency, and the subspace spans multiple dimensions, the weight contribution of each dimension can be integrated; and for time series networks, the weight can also be determined according to the consistency of the subspace change trend on the time axis. When the association is difficult to quantify, expert evaluation can also be introduced to assign edge weights.

设网络中子空间状态节点{Sn,(Tx,Ey)}简化记为Si,边为e(Si,Sj)。则边权重可以表示为:相似性权重:w_s im(e(Si,Sj))=s im(Si,Sj);关联权重:w_corr(e(Si,Sj))=corr(Si,Sj);协同演化权重:w_coe(e(Si,Sj))=corr(de lta(Si),de lta(Sj))(de lta为时间差分);维度综合权重:w(e(Si,Sj))=αw1+βw2+...(α,β为维度权重系数);专家评估权重:w_exp(e(Si,Sj))=E(re l(Si,Sj))(E为专家评估函数,re l是定性的关联类型,可能得取值包括:强正相关,正相关,独立,负相关,强负相关)。Assume that the subspace state node {Sn, (Tx, Ey)} in the network is simplified to Si, and the edge is e(Si, Sj). Then the edge weight can be expressed as: similarity weight: w_sim(e(Si, Sj)) = sim(Si, Sj); association weight: w_corr(e(Si, Sj)) = corr(Si, Sj); co-evolution weight: w_coe(e(Si, Sj)) = corr(delta(Si), delta(Sj))(delta is the time difference); dimension comprehensive weight: w(e(Si, Sj)) = αw1+βw2+...(α, β are dimension weight coefficients); expert evaluation weight: w_exp(e(Si, Sj)) = E(re l(Si, Sj))(E is the expert evaluation function, re l is the qualitative association type, and the possible values include: strong positive correlation, positive correlation, independence, negative correlation, and strong negative correlation).

通过加权的关联网络,可以构建反映节点真实绑定关系的网络拓扑结构;节点之间的关联强度、网络密度以及网络内外部结构都可能发生变化,这样,初始的网格状拓扑结构逐渐演化为具有复杂不规则连接的网络;而引入权重增强了对指标内在关联的量化刻画,在这样的加权网络上,可依托边权重采用模式识别和集群分析技术,发现网络中的全局模式结构;因此,所识别出的网络模式能准确反映产品工艺的评估指标之间的耦合机制及其协调演化特征;相比传统二值网络,加权网络使得模式和机制抽象更加精细和解释性,它提供了立体视角去深入理解和优化产品工艺的生命周期绩效。复杂性网络的生成为状态预测与过程优化决策奠定了信息基础。这为后续网络的状态预测与过程优化决策奠定了信息基础。Through the weighted association network, a network topology that reflects the real binding relationship of nodes can be constructed; the association strength between nodes, network density, and internal and external structures of the network may change, so that the initial grid-like topology gradually evolves into a network with complex irregular connections; the introduction of weights enhances the quantitative characterization of the intrinsic association of indicators. On such a weighted network, pattern recognition and cluster analysis techniques can be used based on edge weights to discover the global pattern structure in the network; therefore, the identified network pattern can accurately reflect the coupling mechanism between the evaluation indicators of the product process and its coordinated evolution characteristics; compared with the traditional binary network, the weighted network makes the pattern and mechanism abstraction more refined and explanatory, and it provides a three-dimensional perspective to deeply understand and optimize the life cycle performance of the product process. The generation of complex networks lays an information foundation for state prediction and process optimization decisions. This lays an information foundation for the state prediction and process optimization decisions of the subsequent network.

在完成数据挖掘后,即可进行局部状态预测与演化分析,即全局模式反映了整个产品或工艺的多指标协同演化的主要特征类型及其动态演化机制;这为局部状态的定义和分析提供了更全局的背景,有助于更有针对性地进行局部状态预测;其中,网络拓扑结构的改变引入了新的路径或关系,这可能揭示了原先未考虑到的子空间之间的联系。这些新路径可能会对局部状态的预测产生影响,因为新的关联可能导致新的局部状态的形成或影响现有状态的演化;具体来说,可分为节点距离调整以及新路径的出现。After completing data mining, local state prediction and evolution analysis can be performed, that is, the global pattern reflects the main characteristic types and dynamic evolution mechanisms of the multi-index co-evolution of the entire product or process; this provides a more global background for the definition and analysis of local states, and helps to make more targeted local state predictions; among them, the change in network topology introduces new paths or relationships, which may reveal the connection between subspaces that were not originally considered. These new paths may have an impact on the prediction of local states, because new associations may lead to the formation of new local states or affect the evolution of existing states; specifically, it can be divided into node distance adjustment and the emergence of new paths.

在实际场景中,以钢铁产业生命周期为例,有两个相关事件序列:“市场钢铁产量调整”和“企业间碳排放量交易”;在“市场钢铁产量调整”事件序列中,预测钢铁行业的总体“温室气体排放量”评估节点会出现一个未来的新状态X;为解释状态X的形成机理,我们构建了一条新路径,将其与第二个事件序列“企业间碳排放量交易”下的一个相关历史状态节点Y关联起来;这反映了外生的碳交易政策将通过内生的钢铁企业之间的排放配额调整机制,影响行业未来的温室气体总排放。这种跨事件序列的新路径识别,使得模型可以更准确地预测碳交易政策对钢铁业排放趋势的影响。In the actual scenario, taking the life cycle of the steel industry as an example, there are two related event sequences: "market steel production adjustment" and "inter-enterprise carbon emissions trading"; in the "market steel production adjustment" event sequence, it is predicted that a new future state X will appear in the overall "greenhouse gas emissions" assessment node of the steel industry; to explain the formation mechanism of state X, we constructed a new path to associate it with a related historical state node Y under the second event sequence "inter-enterprise carbon emissions trading"; this reflects that the exogenous carbon trading policy will affect the industry's future total greenhouse gas emissions through the endogenous emission quota adjustment mechanism between steel companies. This new path identification across event sequences allows the model to more accurately predict the impact of carbon trading policies on the emission trend of the steel industry.

在完成局部状态预测与演化分析后,则可进行反馈学习,其原理与前述举例相同,于此不再赘述。After completing the local state prediction and evolution analysis, feedback learning can be performed. The principle is the same as the above example and will not be repeated here.

如图13所示,本实施例第四方面提供了一种实现实施例第一方面中所述的基于子空间序列的多维数据建模与分析方法的硬件系统,包括:As shown in FIG. 13 , the fourth aspect of this embodiment provides a hardware system for implementing the multidimensional data modeling and analysis method based on subspace sequence described in the first aspect of the embodiment, including:

获取单元,用于获取待分析目标的数据集合,其中,所述数据集合中包括有待分析目标的多个第一维度的数据,其中,所述待分析目标包括目标城市,且第一维度用于表征数据种类;An acquisition unit, configured to acquire a data set of a target to be analyzed, wherein the data set includes data of a plurality of first dimensions of the target to be analyzed, wherein the target to be analyzed includes a target city, and the first dimension is used to characterize a data type;

子空间构建单元,用于从所述数据集合中选取出若干指定第一维度的数据,并基于各个指定第一维度的数据,构建出每个指定第一维度的数据对应的子空间;A subspace construction unit, configured to select a plurality of data of a specified first dimension from the data set, and construct a subspace corresponding to each data of the specified first dimension based on each data of the specified first dimension;

子空间序列单元,用于构建出所有子空间共同的子空间序列,其中,所述子空间序列用于表征数据的逻辑顺序;A subspace sequence unit, used to construct a subspace sequence common to all subspaces, wherein the subspace sequence is used to represent the logical order of data;

映射单元,用于基于所述子空间序列,对所述子空间序列与各个子空间进行对偶关系映射处理,以为各个子空间中的每个子空间实例建立序列编号,其中,任一子空间中的子空间实例用于表征该任一子空间中的一个数据;A mapping unit, configured to perform dual relationship mapping processing on the subspace sequence and each subspace based on the subspace sequence, so as to establish a sequence number for each subspace instance in each subspace, wherein the subspace instance in any subspace is used to represent a data in the any subspace;

子空间关联网络图构建单元,用于利用建立了序列编号的各个子空间,构建出包含有所有子空间的子空间关联网络图,其中,所述子空间关联网络图表示为G={V,E,g},V为节点集合,E为节点边集合,g为图属性集合,所述节点集合中的各节点为各子空间中的子空间实例,且任一节点使用对应子空间实例的序列编号表示;A subspace association network graph construction unit is used to construct a subspace association network graph containing all subspaces by using each subspace with established serial numbers, wherein the subspace association network graph is represented by G={V,E,g}, V is a node set, E is a node edge set, g is a graph attribute set, each node in the node set is a subspace instance in each subspace, and any node is represented by a serial number corresponding to the subspace instance;

数据挖掘单元,用于对所述子空间关联网络图进行数据挖掘处理,得到更新后的子空间关联网络图;A data mining unit, used for performing data mining processing on the subspace association network graph to obtain an updated subspace association network graph;

分析单元,用于利用更新后的子空间关联网络图,得出所述待分析目标的分析结果,其中,所述分析结果包括所述目标城市的城市状态预测结果,且所述城市状态预测结果包括交通流量预测结果、空气质量预测结果和/或人流分布预测结果。An analysis unit is used to use the updated subspace association network diagram to obtain analysis results of the target to be analyzed, wherein the analysis results include the city status prediction results of the target city, and the city status prediction results include traffic flow prediction results, air quality prediction results and/or crowd distribution prediction results.

本实施例提供的装置的工作过程、工作细节和技术效果,可以参见实施例第一方面,于此不再赘述。The working process, working details and technical effects of the device provided in this embodiment can be found in the first aspect of the embodiment and will not be described in detail here.

本实施例第五方面提供了一种基于子空间序列的多维数据建模与分析装置,以装置为电子设备为例,包括:依次通信相连的存储器、处理器和收发器,其中,所述存储器用于存储计算机程序,所述收发器用于收发消息,所述处理器用于读取所述计算机程序,执行如实施例第一方面所述的基于子空间序列的多维数据建模与分析方法。The fifth aspect of this embodiment provides a multidimensional data modeling and analysis device based on subspace sequences. Taking the device as an electronic device as an example, it includes: a memory, a processor and a transceiver that are communicatively connected in sequence, wherein the memory is used to store computer programs, the transceiver is used to send and receive messages, and the processor is used to read the computer program to execute the multidimensional data modeling and analysis method based on subspace sequences as described in the first aspect of the embodiment.

本实施例提供的电子设备的工作过程、工作细节和技术效果,可以参见实施例第一方面,于此不再赘述。The working process, working details and technical effects of the electronic device provided in this embodiment can be found in the first aspect of the embodiment and will not be described in detail here.

本实施例第六方面提供了一种存储包含有实施例第一方面所述的基于子空间序列的多维数据建模与分析方法的指令的存储介质,即所述存储介质上存储有指令,当所述指令在计算机上运行时,执行如实施例第一方面所述的基于子空间序列的多维数据建模与分析方法。The sixth aspect of this embodiment provides a storage medium that stores instructions for the multidimensional data modeling and analysis method based on subspace sequences as described in the first aspect of the embodiment, that is, the storage medium stores instructions, and when the instructions are run on a computer, the multidimensional data modeling and analysis method based on subspace sequences as described in the first aspect of the embodiment is executed.

本实施例提供的存储介质的工作过程、工作细节和技术效果,可以参见实施例第一方面,于此不再赘述。The working process, working details and technical effects of the storage medium provided in this embodiment can be found in the first aspect of the embodiment and will not be described in detail here.

本实施例第七方面提供了一种包含指令的计算机程序产品,当所述指令在计算机上运行时,使所述计算机执行如实施例第一方面所述的基于子空间序列的多维数据建模与分析方法。A seventh aspect of the present embodiment provides a computer program product comprising instructions, which, when executed on a computer, enables the computer to execute the multidimensional data modeling and analysis method based on subspace sequences as described in the first aspect of the embodiment.

最后应说明的是:以上所述仅为本发明的优选实施例而已,并不用于限制本发明的保护范围。凡在本发明的精神和原则之内,所作的任何修改、等同替换、改进等,均应包含在本发明的保护范围之内。Finally, it should be noted that the above description is only a preferred embodiment of the present invention and is not intended to limit the protection scope of the present invention. Any modification, equivalent replacement, improvement, etc. made within the spirit and principle of the present invention shall be included in the protection scope of the present invention.

Claims (10)

1. A multi-dimensional data modeling and analysis method based on subspace sequences, comprising:
acquiring a data set of an object to be analyzed, wherein the data set comprises a plurality of first dimension data of the object to be analyzed, the object to be analyzed comprises a target city, and the first dimension is used for representing a data type;
Selecting a plurality of pieces of data with specified first dimensions from the data set, and constructing subspaces corresponding to the pieces of data with the specified first dimensions based on the pieces of data with the specified first dimensions;
Constructing a subspace sequence common to all subspaces, wherein the subspace sequence is used for representing the logic sequence of data;
performing dual relation mapping processing on the subspace sequence and each subspace based on the subspace sequence to establish a sequence number for each subspace instance in each subspace, wherein the subspace instance in any subspace is used for representing one piece of data in the any subspace;
Constructing a subspace association network diagram containing all subspaces by using each subspace with established sequence numbers, wherein the subspace association network diagram is expressed as G= { V, E, G }, V is a node set, E is a node edge set, G is a diagram attribute set, each node in the node set is a subspace instance in each subspace, and any node is expressed by using the sequence number of the corresponding subspace instance;
performing data mining processing on the subspace association network map to obtain an updated subspace association network map;
And obtaining an analysis result of the target to be analyzed by using the updated subspace association network diagram, wherein the analysis result comprises a city state prediction result of the target city, and the city state prediction result comprises a traffic flow prediction result, an air quality prediction result and/or a people flow distribution prediction result.
2. The method of claim 1, wherein the data set further comprises data having a plurality of second dimensions, wherein the second dimensions are used to characterize data logical dimensions, and wherein the data logical dimensions comprise a time dimension and/or an event dimension;
wherein, constructing a subspace sequence common to all subspaces comprises:
selecting all the data of the second dimension or a plurality of data of the second dimension from the data set;
And constructing a subspace sequence common to all subspaces based on the selected data of the second dimension, wherein the subspace sequence comprises a time sequence and/or an event occurrence sequence.
3. The method of claim 1, wherein performing data mining on the subspace association network map to obtain an updated subspace association network map, comprises:
The method comprises the steps of obtaining feature description information of each subspace, wherein the feature description information of each subspace has the same representation form, and the feature description information of any subspace comprises one or more of vector representation information, tensor representation information, graph representation information, depth representation information and semantic representation information;
and carrying out data mining processing on the subspace association network map by utilizing a data mining algorithm based on the characteristic description information of each subspace so as to obtain the updated subspace association network map after data mining.
4. A method according to claim 3, wherein obtaining the feature description information for each subspace comprises:
and extracting feature description information corresponding to each subspace by utilizing a feature representation algorithm, wherein the feature representation algorithm comprises one or more of a vector representation algorithm, a tensor representation algorithm, a graph representation algorithm, a depth representation algorithm and a semantic representation algorithm.
5. The method of claim 4, wherein the data mining algorithm comprises a graph rolling network algorithm, a network embedding algorithm, a network dynamic modeling algorithm, a multi-granularity semantic learning algorithm, a reinforcement learning algorithm, and/or a wavelet network algorithm.
6. The method of claim 1, wherein using the updated subspace association network map to derive the analysis result of the target to be analyzed comprises:
Performing local state prediction processing on each subspace in the updated subspace association network map by using an incremental learning algorithm or a transfer learning algorithm to obtain a local state prediction result of each subspace;
And obtaining an analysis result of the target to be analyzed by using the local state prediction result of each subspace.
7. The method according to claim 1, wherein after deriving the analysis result of the object to be analyzed, the method further comprises:
And adjusting the subspace correlation network diagram based on the analysis result of the target to be analyzed so as to realize feedback learning of the subspace correlation network diagram.
8. A subspace sequence-based multidimensional data modeling and analysis system, comprising:
the system comprises an acquisition unit, a data analysis unit and a data analysis unit, wherein the acquisition unit is used for acquiring a data set of an object to be analyzed, the data set comprises a plurality of first dimension data of the object to be analyzed, the object to be analyzed comprises an object city, and the first dimension data is used for representing data types;
The subspace construction unit is used for selecting a plurality of pieces of data with specified first dimensions from the data set, and constructing subspaces corresponding to the pieces of data with the specified first dimensions based on the pieces of data with the specified first dimensions;
A subspace sequence unit, configured to construct a subspace sequence common to all subspaces, where the subspace sequence is used to characterize a logical sequence of data;
A mapping unit, configured to perform dual mapping processing on the subspace sequence and each subspace based on the subspace sequence, so as to establish a sequence number for each subspace instance in each subspace, where a subspace instance in any subspace is used to characterize one data in the any subspace;
A subspace association network diagram construction unit, configured to construct a subspace association network diagram including all subspaces by using each subspace with established sequence numbers, where the subspace association network diagram is expressed as g= { V, E, G }, V is a node set, E is a node edge set, G is a diagram attribute set, each node in the node set is a subspace instance in each subspace, and any node uses the sequence number representation of the corresponding subspace instance;
The data mining unit is used for performing data mining processing on the subspace association network map to obtain an updated subspace association network map;
And the analysis unit is used for obtaining an analysis result of the target to be analyzed by utilizing the updated subspace association network diagram, wherein the analysis result comprises a city state prediction result of the target city, and the city state prediction result comprises a traffic flow prediction result, an air quality prediction result and/or a people flow distribution prediction result.
9. An electronic device, comprising: the system comprises a memory, a processor and a transceiver which are connected in sequence in communication, wherein the memory is used for storing a computer program, the transceiver is used for receiving and transmitting messages, and the processor is used for reading the computer program and executing the multidimensional data modeling and analysis method based on the subspace sequence according to any one of claims 1-7.
10. A computer program product comprising instructions which, when run on a computer, cause the computer to perform the multi-dimensional data modeling and analysis method based on subspace sequences as claimed in any one of claims 1 to 7.
CN202410273460.5A 2024-03-11 2024-03-11 A multidimensional data modeling and analysis method and system based on subspace sequence Active CN118171201B (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
CN202410273460.5A CN118171201B (en) 2024-03-11 2024-03-11 A multidimensional data modeling and analysis method and system based on subspace sequence

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
CN202410273460.5A CN118171201B (en) 2024-03-11 2024-03-11 A multidimensional data modeling and analysis method and system based on subspace sequence

Publications (2)

Publication Number Publication Date
CN118171201A true CN118171201A (en) 2024-06-11
CN118171201B CN118171201B (en) 2025-01-28

Family

ID=91354008

Family Applications (1)

Application Number Title Priority Date Filing Date
CN202410273460.5A Active CN118171201B (en) 2024-03-11 2024-03-11 A multidimensional data modeling and analysis method and system based on subspace sequence

Country Status (1)

Country Link
CN (1) CN118171201B (en)

Cited By (5)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN118569981A (en) * 2024-07-29 2024-08-30 四川世纪银通科技有限公司 Customer repayment risk prediction method and system based on consumption portraits
CN119254595A (en) * 2024-12-05 2025-01-03 国鲸科技(广东横琴粤澳深度合作区)有限公司 An interconnected communication system based on multiple mesh circuits
CN119449452A (en) * 2024-11-25 2025-02-14 中国人民解放军92493部队信息技术中心 A network threat deduction system and method based on Transformer and graph attention network model
CN119516782A (en) * 2024-11-19 2025-02-25 昆山市运输事业发展中心 A network-level traffic operation status inference method for unobservable road sections
CN121037259A (en) * 2025-10-27 2025-11-28 中云汇(成都)科技集团有限公司 Environmentally friendly public toilet equipment status monitoring methods, devices and equipment

Citations (3)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
WO2014051824A2 (en) * 2012-07-09 2014-04-03 University Of Rochester Use of social interactions to predict complex phenomena
US20140149273A1 (en) * 2012-11-29 2014-05-29 Rick Angell Market Microstructure Data Method and Appliance
CN111782817A (en) * 2020-05-30 2020-10-16 国网福建省电力有限公司信息通信分公司 An information system-oriented knowledge graph construction method, device and electronic device

Patent Citations (3)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
WO2014051824A2 (en) * 2012-07-09 2014-04-03 University Of Rochester Use of social interactions to predict complex phenomena
US20140149273A1 (en) * 2012-11-29 2014-05-29 Rick Angell Market Microstructure Data Method and Appliance
CN111782817A (en) * 2020-05-30 2020-10-16 国网福建省电力有限公司信息通信分公司 An information system-oriented knowledge graph construction method, device and electronic device

Cited By (5)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN118569981A (en) * 2024-07-29 2024-08-30 四川世纪银通科技有限公司 Customer repayment risk prediction method and system based on consumption portraits
CN119516782A (en) * 2024-11-19 2025-02-25 昆山市运输事业发展中心 A network-level traffic operation status inference method for unobservable road sections
CN119449452A (en) * 2024-11-25 2025-02-14 中国人民解放军92493部队信息技术中心 A network threat deduction system and method based on Transformer and graph attention network model
CN119254595A (en) * 2024-12-05 2025-01-03 国鲸科技(广东横琴粤澳深度合作区)有限公司 An interconnected communication system based on multiple mesh circuits
CN121037259A (en) * 2025-10-27 2025-11-28 中云汇(成都)科技集团有限公司 Environmentally friendly public toilet equipment status monitoring methods, devices and equipment

Also Published As

Publication number Publication date
CN118171201B (en) 2025-01-28

Similar Documents

Publication Publication Date Title
CN118171201A (en) A multidimensional data modeling and analysis method and system based on subspace sequence
CN101093559B (en) Method for constructing expert system based on knowledge discovery
CN117952009A (en) Intelligent production line testable digital twin modeling method
Kong et al. Big data‐driven machine learning‐enabled traffic flow prediction
CN118627759A (en) Management method, device, equipment and storage medium of smart space system
CN101344902B (en) A Protein Secondary Structure Prediction Method Based on Association Analysis and Association Classification
Castelli et al. An evolutionary system for ozone concentration forecasting
CN117829712A (en) AGV cooperative scheduling path planning method based on ecosystem optimization algorithm
CN120782079A (en) Territorial space planning method based on big data
CN114936307A (en) A Normalized Graph Model Construction Method
Liu Influencing Factors for an Integrated Model of Green Building Energy Consumption Using BIM Dynamic Simulation and Multiobjective Decision‐Making
CN120298105B (en) A big data analysis system and method for financial guarantee circles
Williams et al. Deriving metamodels to relate machine learning quality to design repository characteristics in the context of additive manufacturing
CN120408607A (en) Cybersecurity compliance risk assessment methods, devices, equipment, and storage media
Wu et al. Evaluation of particle swarm optimization, genetic algorithms, and ant colony optimization in autonomous mobile robots scheduling: A comparative study
CN119886870A (en) Intelligent carbon system based on water treatment
CN118737489A (en) A node virus correlation prediction method based on SEAL concept
Alba et al. Theoretical models of selection pressure for dEAs: topology influence
Cheng et al. Optimization of design change propagation path for mechanical product considering component complexity
Zhou et al. A Knowledge Graph Based Factor Screening Approach for Wireless Communication Networks
Li Multidimensional Information Network Big Data Mining Algorithm Relying on Finite Element Analysis
CN120197915B (en) Power outage planning method and system based on multi-agent interpretable reinforcement learning
Ding et al. Forecasts and analysis of economic outputs for Chinese high-tech industries: insights from spatial–temporal information fusion
Gu Investigation on Multi-objective Optimization of Construction Engineering Based on Data Mining Association Rule Algorithm
Guido et al. Exploring Civil Engineering Research in the Philippines: A Scientometric Investigation

Legal Events

Date Code Title Description
PB01 Publication
PB01 Publication
SE01 Entry into force of request for substantive examination
SE01 Entry into force of request for substantive examination
GR01 Patent grant
GR01 Patent grant