CN102800128A - Method for establishing geomorphic description precision model by utilizing gplotmatrix and regression analysis - Google Patents
Method for establishing geomorphic description precision model by utilizing gplotmatrix and regression analysis Download PDFInfo
- Publication number
- CN102800128A CN102800128A CN2012102828722A CN201210282872A CN102800128A CN 102800128 A CN102800128 A CN 102800128A CN 2012102828722 A CN2012102828722 A CN 2012102828722A CN 201210282872 A CN201210282872 A CN 201210282872A CN 102800128 A CN102800128 A CN 102800128A
- Authority
- CN
- China
- Prior art keywords
- formula
- error
- topograph
- centerdot
- influence
- Prior art date
- Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
- Pending
Links
- 238000000034 method Methods 0.000 title claims abstract description 32
- 238000000611 regression analysis Methods 0.000 title abstract description 13
- 238000010586 diagram Methods 0.000 claims abstract description 35
- 239000011159 matrix material Substances 0.000 claims abstract description 34
- 238000004458 analytical method Methods 0.000 claims abstract description 6
- 108010014173 Factor X Proteins 0.000 claims description 5
- 230000000739 chaotic effect Effects 0.000 claims description 4
- 230000014509 gene expression Effects 0.000 claims 4
- 230000002596 correlated effect Effects 0.000 claims 1
- 238000011160 research Methods 0.000 abstract description 6
- 230000000007 visual effect Effects 0.000 abstract description 6
- 238000012800 visualization Methods 0.000 abstract description 6
- 238000013499 data model Methods 0.000 abstract description 4
- 238000005516 engineering process Methods 0.000 abstract description 4
- 238000007405 data analysis Methods 0.000 abstract description 2
- 230000003993 interaction Effects 0.000 abstract description 2
- 230000000694 effects Effects 0.000 description 7
- 238000009826 distribution Methods 0.000 description 5
- 230000008569 process Effects 0.000 description 5
- 238000002474 experimental method Methods 0.000 description 3
- NAWXUBYGYWOOIX-SFHVURJKSA-N (2s)-2-[[4-[2-(2,4-diaminoquinazolin-6-yl)ethyl]benzoyl]amino]-4-methylidenepentanedioic acid Chemical compound C1=CC2=NC(N)=NC(N)=C2C=C1CCC1=CC=C(C(=O)N[C@@H](CC(=C)C(O)=O)C(O)=O)C=C1 NAWXUBYGYWOOIX-SFHVURJKSA-N 0.000 description 2
- 230000009286 beneficial effect Effects 0.000 description 1
- 238000004364 calculation method Methods 0.000 description 1
- 230000007547 defect Effects 0.000 description 1
- 238000011161 development Methods 0.000 description 1
- 238000011439 discrete element method Methods 0.000 description 1
- 230000003628 erosive effect Effects 0.000 description 1
- 238000000605 extraction Methods 0.000 description 1
- 238000004519 manufacturing process Methods 0.000 description 1
- 238000013178 mathematical model Methods 0.000 description 1
- 230000007246 mechanism Effects 0.000 description 1
- 238000002360 preparation method Methods 0.000 description 1
- 238000012552 review Methods 0.000 description 1
- 238000012732 spatial analysis Methods 0.000 description 1
- 238000012360 testing method Methods 0.000 description 1
- 238000009827 uniform distribution Methods 0.000 description 1
- 238000007794 visualization technique Methods 0.000 description 1
Images
Landscapes
- Management, Administration, Business Operations System, And Electronic Commerce (AREA)
Abstract
本发明涉及利用散点图矩阵和回归分析建立地形描述精度模型的方法,可有效解决数字高程模型地形描述误差与其影响因子之间DEM地形描述精度模型的问题,解决的技术方案是,针对DEM地形描述误差与其影响因子之间互相作用,但又难以区分究竟哪几种因子对DEM地形描述误差的影响最大、它们之间的影响规律如何,并存在怎样的数学关系的问题,充分利用可视化形象、直观的特点,将数学建模与可视化技术结合起来,通过函数拟合的方式构建一种建立多元非线性数据模型的方法,最终建立新的DEM地形描述精度模型,本发明不仅增加了数学建模直观、形象的特征,而且增加了可视化数据分析的科学性和准确性,是数字高程模型误差分析与精度研究上的创新。
The invention relates to a method for establishing a terrain description accuracy model by using a scatter diagram matrix and regression analysis, which can effectively solve the problem of the DEM terrain description accuracy model between the digital elevation model terrain description error and its influencing factors. The technical solution is to aim at the DEM terrain The interaction between the description error and its influencing factors, but it is difficult to distinguish which factors have the greatest impact on the DEM terrain description error, what is the influence law between them, and what kind of mathematical relationship exists. Make full use of the visual image, Intuitive feature, combining mathematical modeling and visualization technology, constructing a method of establishing multivariate nonlinear data model through function fitting, and finally establishing a new DEM terrain description accuracy model, the invention not only increases the mathematical modeling Intuitive and visual features, and increase the scientificity and accuracy of visual data analysis, is an innovation in the research of digital elevation model error analysis and precision.
Description
技术领域 technical field
本发明涉及数字高程模型,特别是一种利用散点图矩阵和回归分析建立地形描述精度模型的方法。The invention relates to a digital elevation model, in particular to a method for establishing a terrain description accuracy model by using a scatter diagram matrix and regression analysis.
背景技术 Background technique
数字高程模型(Digital Elevation Model,简称DEM)误差分析与精度研究一直是人们关注的热点,也是贯穿于DEM建模与应用过程中的重要组成部分。库比克(Kubik,1998)在第十六届国际摄影测量与遥感学会(International Society for Photogrammetry and Remote Sensing,简称ISPRS)上提交了“数字高程模型回顾与展望”的工作报告。他指出虽然DEM精度问题在理论上已基本完成,但随着空间数据质量及不确定性研究的深入,关于对DEM地形描述误差的影响因子、传播机理及相关分析仍需要进一步研究。Digital Elevation Model (DEM for short) error analysis and accuracy research has always been a hot spot of concern, and it is also an important part throughout the process of DEM modeling and application. Kubik (1998) submitted a work report on "Review and Prospect of Digital Elevation Model" at the 16th International Society for Photogrammetry and Remote Sensing (ISPRS). He pointed out that although the problem of DEM accuracy has been basically completed in theory, with the deepening of research on spatial data quality and uncertainty, the impact factors, propagation mechanism and related analysis of DEM terrain description errors still need further research.
目前,人们大多采用数理统计与实验相结合的方法分析DEM精度。霍姆斯(Holmes,2000)在研究美国地质勘探局30米空间分辨率的DEM误差对地形特征的影响时,利用柱状图、离散点图分析了DEM误差的空间分布;陈楠、王钦敏(2006)在分析黄土高原不同空间分辨率的DEM对提取坡度精度的影响时,以折线图的形式研究了DEM地形描述误差与空间分辨率的关系;刘敏、汤国安(2007)在基于人工降雨实验获取的黄土小流域不同发育阶段地形指数变异特征分析中,绘制了地形指数随降雨侵蚀的频率分布图;李江(2007)在研究地形简化与DEM地形描述误差关系时,首先采用频率分布图分析了地形简化对坡度的影响,然后在此基础上,利用散点图剖析了DEM地形描述误差与平均坡度的相关性大小,取得了很好的效果。但随着近些年对DEM地形描述精度模型研究的深入,人们发现仅用传统的数理统计与实验相结合的方法只能描述单一影响因子对DEM地形描述精度的影响,关于DEM地形描述误差与其多个影响因子(指两个以上)之间相互关系的研究还需要换一种解决思路。At present, most people use the method of combining mathematical statistics and experiments to analyze the accuracy of DEM. Holmes (2000) analyzed the spatial distribution of DEM errors using histograms and discrete point maps when studying the impact of DEM errors with a 30-meter spatial resolution of the US Geological Survey on terrain features; Chen Nan and Wang Qinmin (2006 ) When analyzing the influence of different spatial resolution DEMs on the Loess Plateau on the accuracy of slope extraction, the relationship between the DEM terrain description error and spatial resolution was studied in the form of a line graph; Liu Min and Tang Guoan (2007) based on artificial rainfall experiments In the analysis of the variation characteristics of the topographic index at different development stages in the loess small watershed, the frequency distribution map of the topographic index along with rainfall erosion was drawn; when Li Jiang (2007) studied the relationship between terrain simplification and DEM topographic description error, he first used the frequency distribution map to analyze the The impact of terrain simplification on the slope, and then on this basis, using the scatter diagram to analyze the correlation between the DEM terrain description error and the average slope, and achieved good results. However, with the in-depth research on the DEM terrain description accuracy model in recent years, it is found that only the traditional mathematical statistics combined with experiments can only describe the influence of a single influencing factor on the DEM terrain description accuracy. The research on the relationship between multiple influencing factors (referring to more than two) still needs another solution.
可视化作为一种视觉比较技术,它将数据误差的不确定性以可见的方式展示在用户面前,将数据误差不确定性与地图制图、GIS空间分析紧密地联系在一起,使用户更直观、更形象、更正确地认识所描述的对象。因此,DEM地形描述误差可视化的意义不仅是能将DEM地形描述误差与其影响因子的关系描述出来,更重要的是它能够与回归分析结合起来,建立更加合理的DEM地形描述误差函数模型:汤国安(2001)建立了DEM地形描述误差与空间分辨率、剖面曲率的数学模型;王光霞(2004)对该模型进行修正,并研究DEM地形描述误差与空间分辨率、平均坡度的函数关系,进一步提高了模型的精度。但由于我国地域辽阔,地貌类型复杂多变。使得当分析如冰川、中山地貌时,上述精度模型的拟合效果并不好。检验其精度模型的建立过程,原因可能包括两个方面:一方面是影响DEM地形描述误差的地形因子,体现在构建DEM地形描述精度模型时,地形因子选取的不合理(指其与DEM地形描述误差相关性太小)或地形因子选取的不够(指DEM地形描述精度模型复杂,三个变量并不能揭示该模型的实质);另一方面是能够满足当前所有地貌类型的DEM地形描述精度模型很难法用一个统一的数学公式去表达。As a visual comparison technology, visualization displays the uncertainty of data errors in front of users in a visible way, and closely links the uncertainty of data errors with map drawing and GIS spatial analysis, making users more intuitive and more intuitive. Image, more correctly understand the described object. Therefore, the significance of DEM terrain description error visualization is not only to describe the relationship between DEM terrain description error and its influencing factors, but more importantly, it can be combined with regression analysis to establish a more reasonable DEM terrain description error function model: Tang Guoan (2001) established a mathematical model of DEM terrain description error, spatial resolution, and section curvature; Wang Guangxia (2004) revised the model, and studied the functional relationship between DEM terrain description error, spatial resolution, and average slope, and further improved the The accuracy of the model. However, due to the vast territory of our country, the types of landforms are complex and changeable. As a result, when analyzing landforms such as glaciers and mountains, the fitting effect of the above precision model is not good. To test the establishment process of its accuracy model, the reasons may include two aspects: one is the terrain factor that affects the error of the DEM terrain description, which is reflected in the unreasonable selection of the terrain factor when constructing the DEM terrain description accuracy model (referring to its relationship with the DEM terrain description The error correlation is too small) or the selection of terrain factors is not enough (referring to the complexity of the DEM terrain description accuracy model, the three variables cannot reveal the essence of the model); on the other hand, the DEM terrain description accuracy model that can meet all current landform types Difficult to use a unified mathematical formula to express.
因此,本文利用可视化与回归分析构建多元数据模型的重点就是从DEM地形描述误差与其影响因子中挖掘它们之间的关联特征,并根据相关性确定能否建立统一的DEM地形描述精度模型。而DEM地形描述误差的影响因子众多,如何从中选取最适宜的影响因子就成为解决这个问题的关键。散点图矩阵是描述多维数据中两两变量之间相互关系的可视化方法。它能在一幅图形中对比分析不同变量的相关性大小,同时能够可视化两个变量的关联信息,便于发现和选取两个变量之间的拟合函数。因此,可以利用散点图矩阵与回归分析的思想解决上述存在的问题。Therefore, the focus of this paper on building a multivariate data model using visualization and regression analysis is to mine the correlation features between DEM terrain description errors and their influencing factors, and to determine whether a unified DEM terrain description accuracy model can be established according to the correlation. However, there are many influencing factors on the error of DEM terrain description, how to select the most suitable influencing factors becomes the key to solve this problem. A scatterplot matrix is a visualization method that describes the relationship between two variables in multidimensional data. It can compare and analyze the correlation of different variables in a graph, and at the same time, it can visualize the correlation information of two variables, so as to facilitate the discovery and selection of the fitting function between the two variables. Therefore, the idea of scatter plot matrix and regression analysis can be used to solve the above existing problems.
发明内容 Contents of the invention
针对上述情况,为解决现有技术之缺陷,本发明之目的就是提供一种利用散点图矩阵和回归分析建立地形描述精度模型的方法,可有效解决数字高程模型(Digital Elevation Model,简称DEM)地形描述误差与其影响因子之间DEM地形描述精度模型的问题。In view of the above situation, in order to solve the defects of the prior art, the purpose of the present invention is to provide a method of using scatter plot matrix and regression analysis to establish a terrain description accuracy model, which can effectively solve the problem of Digital Elevation Model (Digital Elevation Model, referred to as DEM) The problem of DEM terrain description accuracy model between terrain description error and its influencing factors.
本发明要解决的技术问题是:针对DEM地形描述误差与其影响因子之间互相作用,但又难以区分究竟哪几种因子对DEM地形描述误差的影响最大、它们之间的影响规律如何,并存在怎样的数学关系的问题,提出一种利用散点图矩阵与回归分析相结合建立地形描述精度模型的方法,该方法充分利用可视化形象、直观的特点,将数学建模与可视化技术结合起来,通过函数拟合的方式构建一种建立多元非线性数据模型的方法,最终建立新的DEM地形描述精度模型。The technical problem to be solved by the present invention is: aiming at the interaction between the DEM terrain description error and its influencing factors, it is difficult to distinguish which factors have the greatest influence on the DEM terrain description error, what are the influence rules between them, and there are In order to solve the problem of mathematical relationship, a method of combining scatter plot matrix and regression analysis to establish a terrain description accuracy model is proposed. This method makes full use of the visual image and intuitive features, and combines mathematical modeling with visualization technology. A method of establishing a multivariate nonlinear data model is constructed by means of function fitting, and finally a new DEM terrain description accuracy model is established.
本发明的技术解决方案为:利用散点图矩阵与回归分析建立地形描述精度模型方法,步骤如下:The technical solution of the present invention is: utilize scatter diagram matrix and regression analysis to establish the terrain description accuracy model method, the steps are as follows:
第1步,绘制数字高程模型地形描述误差与其影响因子(包括空间分辨率、坡度、剖面曲率、相对高差、凹凸系数、坡向和粗糙度)的散点图矩阵,判断并利用一个影响因子拟合地形描述精度模型;The first step is to draw a scatter matrix of digital elevation model terrain description error and its influencing factors (including spatial resolution, slope, profile curvature, relative height difference, concave-convex coefficient, slope aspect and roughness), and determine and use an influencing factor Fitting terrain description accuracy model;
(1.1)绘制数字高程模型地形描述误差与其影响因子的散点图矩阵,设地形描述误差为Y,与其相关的影响因子为X,由于影响因子不止一个,这里分别设为X1,X2,…,Xk,其中,k表示影响因子的个数,且k>1,绘制地形描述误差Y与其影响因子X1,X2,…,Xk的散点图矩阵,并在每一个散点图上绘制置信椭圆,椭圆的长、短半径反映两个变量的相关性,即椭圆越扁,相关性越强;(1.1) Draw the scatter diagram matrix of the digital elevation model terrain description error and its influencing factors. Let the terrain description error be Y, and the related influencing factor be X. Since there are more than one influencing factors, here we set them as X 1 , X 2 , …, X k , where k represents the number of influencing factors, and k>1, draw the scatter diagram matrix of the terrain description error Y and its influencing factors X 1 , X 2 ,…, X k , and at each scatter point A confidence ellipse is drawn on the graph, and the long and short radii of the ellipse reflect the correlation between the two variables, that is, the flatter the ellipse, the stronger the correlation;
(1.2)通过观察上一步得到的散点图矩阵判断能否利用本方法建立地形描述精度模型,即根据地形描述误差Y与每一个影响因子Xi的散点图特征,其中,i表示第i个,且i=1,2,…,k,判断它们之间是否存在一定的函数关系。若地形描述误差Y与任意一个Xi的散点图特征都表现的非常混乱,无法用一个或多个多项式建立它们之间的相互联系,即Y与任意一个Xi不存在明显的关系特征,则不能利用本方法建立地形描述误差Y与影响因子X之间的函数模型,建模结束。若地形描述误差Y与任意一个Xi存在函数关系,则进一步确定能否利用一个多项式拟合Y与Xi的图形特征;(1.2) By observing the scatter diagram matrix obtained in the previous step, it is judged whether this method can be used to establish a terrain description accuracy model, that is, according to the scatter diagram characteristics of the terrain description error Y and each influencing factor X i , where i represents the ith , and i=1, 2, ..., k, to judge whether there is a certain functional relationship between them. If the terrain description error Y and the scatter diagram features of any X i are very chaotic, one or more polynomials cannot be used to establish the relationship between them, that is, there is no obvious relationship between Y and any X i , Then this method cannot be used to establish a functional model between the terrain description error Y and the influencing factor X, and the modeling ends. If there is a functional relationship between the terrain description error Y and any X i , it is further determined whether a polynomial can be used to fit the graphic features of Y and X i ;
(1.3)若能,则判断地形描述误差Y与满足当前条件所有影响因子Xi的相关性大小,并假设选择相关性最好的X1拟合公式(1),其中,n为拟合多项式的次数,a0,a1,…,an-1,an,建立地形描述精度模型,建模结束,(1.3) If yes, judge the correlation between terrain description error Y and all influencing factors X i that meet the current conditions, and assume that the best correlation X 1 fitting formula (1) is selected, where n is the fitting polynomial The number of times, a 0 , a 1 ,..., a n-1 , a n , to establish a terrain description accuracy model, and the modeling ends,
(1.4)若不能,则利用多组多项式分别拟合地形描述精度模型。首先,将满足当前条件的Y与Xi散点图划分成几个不同的区域,划分的依据为划分后每一个区域的Y与Xi都可以利用一个多项式表示,然后,寻找变量Xi,为了便于下一步拟合,使其与Y在每一个区域内拟合的多项式次数越低越好,即Xi与Y在每一个区域的散点图特征最好表现为线形相关,假设其为X1,最后,依据其图形特征拟合公式(2),其中,n为拟合多项式的次数,m为拟合多项式的个数,即划分的区域数,apq为拟合多项式的系数,p表示第p行,q表示第q列,且p=1,2,…,m,q=0,1,…,n,(1.4) If not, multiple sets of polynomials are used to fit the terrain description accuracy model respectively. First, divide the Y and Xi scatter diagrams that meet the current conditions into several different areas. The basis for the division is that Y and Xi in each area can be represented by a polynomial. Then, find the variable Xi , In order to facilitate the next step of fitting, the lower the polynomial degree of fitting Y in each area, the better, that is, the scatter plot characteristics of Xi and Y in each area are best shown as linear correlations, assuming it is X 1 , finally, fit the formula (2) according to its graphic features, where n is the number of fitted polynomials, m is the number of fitted polynomials, that is, the number of divided areas, a pq is the coefficient of the fitted polynomials, p represents row p, q represents column q, and p=1, 2,..., m, q=0, 1,..., n,
公式(2); Formula (2);
(1.5)公式(2)的系数矩阵可以提取新变量A,设Aq=(a1q,a2q,…,amq)T,其中,q表示第q个,且q=0,1,…,n,则公式(2)可写为公式(3),再进行第2步,(1.5) A new variable A can be extracted from the coefficient matrix of formula (2), let A q = (a 1q , a 2q ,…, a mq ) T , where q represents the qth, and q=0, 1,… , n, then formula (2) can be written as formula (3), and then proceed to the second step,
第2步,利用(1.5)获得的新变量A0,A1…An-1,An代替数字高程模型地形描述误差并与剩余变量,即除去第1步中已经拟合过的X1进行拟合,重复本步骤并直到可以利用一个函数建立地形描述精度模型;In the second step, use the new variables A 0 , A 1 ... A n-1 obtained in (1.5), A n to replace the digital elevation model terrain description error and combine it with the remaining variables, that is, remove the X 1 that has been fitted in the first step Perform fitting, repeat this step until a function can be used to establish a terrain description accuracy model;
(2.1)考虑到地形描述误差Y已经与X1拟合过函数,即可认为X1变量对地形描述误差的影响已被消除。因此,这里为了排除X1对Aq的作用,在Aq与Xi的拟合过程中,其中,i的取值为2,3…,k,并且Xi中基的选取应基于X1为固定值,即尽量选取X1变化范围最小的m个基。最后,绘制Aq,X2,X3,…,Xk的散点图矩阵,其中,q表示第q个,且q=0,1,…,n;(2.1) Considering that the terrain description error Y has been fitted with X1 , it can be considered that the influence of the X1 variable on the terrain description error has been eliminated. Therefore, in order to exclude the effect of X 1 on A q , in the fitting process of A q and X i , the value of i is 2, 3...,k, and the selection of the base in X i should be based on X 1 is a fixed value, that is, try to select m bases with the smallest variation range of X1 . Finally, draw the scatter plot matrix of A q , X 2 , X 3 , ..., X k , where q represents the qth, and q=0, 1, ..., n;
(2.2)观察(2.1)得到的散点图矩阵判断能否利用本方法建立统一的地形描述精度模型。即根据Aq与每一个影响因子Xi的散点图特征,其中,i表示第i个,且i=2,3,…,k,判断它们之间是否存在一定的函数关系。若Aq与任意一个Xi的散点图特征都表现的非常混乱,无法用一个或多个多项式建立它们之间的相互联系,即Aq与任意一个Xi不存在明显的关系特征,则不能利用本方法建立统一的地形描述误差Y与影响因子X之间的函数模型,建模结束,若Aq与任意一个Xi存在函数关系,则进一步确定能否利用一个多项式拟合Aq与Xi图形特征;(2.2) Observe the scatter diagram matrix obtained in (2.1) to judge whether this method can be used to establish a unified terrain description accuracy model. That is, according to the scatter diagram characteristics of A q and each influencing factor Xi , where i represents the i-th one, and i=2, 3,..., k, judge whether there is a certain functional relationship between them. If the scatter diagram features of A q and any X i are very chaotic, and one or more polynomials cannot be used to establish the relationship between them, that is, there is no obvious relationship between A q and any X i , then This method cannot be used to establish a unified functional model between the terrain description error Y and the influencing factor X. After the modeling is completed, if there is a functional relationship between A q and any X i , it is further determined whether a polynomial can be used to fit A q and X i graphic features;
(2.3)若能,则判断Aq与满足当前条件的所有影响因子Xi的相关性大小,并选择相关性最好的X2拟合公式(4),其中,n2为Aq拟合多项式的次数,b0,b1,…,,为拟合多项式的系数,继续进行第3步,(2.3) If yes, judge the correlation between A q and all influencing factors X i that meet the current conditions, and choose the X 2 fitting formula (4) with the best correlation, where n 2 is A q fitting degree of polynomial, b 0 , b 1 ,…, , To fit the coefficients of the polynomial, proceed to step 3,
(2.4)若不能,则利用多组多项式分别拟合Aq与Xi的关系模型,方法参考1.4,首先,将满足当前条件Aq与Xi的散点图划分成几个不同的区域,划分的依据为划分后每一个区域的Aq与Xi都可以利用一个多项式表示。然后,寻找变量Xi,为了便于下一步拟合,使其与Aq在每一个区域内拟合的多项式次数越低越好,即Xi与Aq在每一个区域的散点图特征最好表现为线形相关,假设其为X2,并依据其图形特征利用多个函数拟合公式(5),其中,n2为Aq拟合多项式的次数,m2为Aq拟合多项式的个数,为Aq拟合多项式的系数,p2表示第p2行,q2表示第q2列,且p2=1,2,…,m2,q2=0,1,…,n2,(2.4) If not, use multiple sets of polynomials to fit the relationship model between A q and Xi respectively. For the method, refer to 1.4. First, divide the scatter diagram that meets the current conditions A q and Xi into several different areas. The division is based on the fact that A q and Xi of each area after division can be represented by a polynomial. Then, look for the variable X i , in order to facilitate the next step of fitting, the lower the polynomial degree of fitting with A q in each area, the better, that is, the scatter plot characteristics of Xi and A q in each area are the best If it is linear correlation, assume it is X 2 , and use multiple functions to fit the formula (5) according to its graphic characteristics, where n 2 is the degree of A q fitting polynomial, m 2 is the degree of A q fitting polynomial number, is the coefficient of the polynomial fitting for A q , p 2 represents the p 2th row, q 2 represents the q 2th column, and p 2 =1, 2,..., m 2 , q 2 =0, 1,..., n 2 ,
公式(5); Formula (5);
(2.5)公式(5)的系数矩阵可再提取一系列新变量B,设=(,,…,)T,其中,q2=0,1,…,n2,则公式(5)可写为公式(6),然后利用新获取的变量重复进行第2步,直到可以用一个函数拟合,(2.5) A series of new variables B can be extracted from the coefficient matrix of formula (5), let =( , ,..., ) T , where, q 2 =0, 1, ..., n 2 , then formula (5) can be written as formula (6), and then use the newly acquired variables to repeat step 2 until a function can be used to fit,
第3步:整理数字高程模型地形描述误差Y与影响因子X的拟合过程,将第2步所得公式代入到第1步所得公式,得公式(7),Step 3: sort out the fitting process of digital elevation model terrain description error Y and influencing factor X, and substitute the formula obtained in step 2 into the formula obtained in
其中,k表示影响因子的个数,且k>1,描述精度模型完成。Among them, k represents the number of influencing factors, and k>1, the description accuracy model is completed.
附图说明 Description of drawings
图1为本发明的利用散点图矩阵与回归分析建立DEM地形描述精度模型的过程流程图。Fig. 1 is a flow chart of the process of establishing a DEM terrain description accuracy model using a scatter diagram matrix and regression analysis in the present invention.
图2为本发明的丘陵、黄土、冰川与中山四种地貌类型实验数据的DEM地形描述误差与空间分辨率、坡度、剖面曲率、相对高差、凹凸系数、坡向、粗糙度七种影响因子的散点图矩阵,用于判断并选取拟合变量和拟合函数。Fig. 2 is the DEM terrain description error and spatial resolution, slope, profile curvature, relative height difference, concave-convex coefficient, slope aspect, and seven influencing factors of the experimental data of the four types of landforms of hills, loess, glaciers and Zhongshan in the present invention The scatterplot matrix of is used to judge and select fitting variables and fitting functions.
图3为DEM地形描述误差的与空间分辨率的散点图矩阵,用于更清楚的辨别图形特征。Figure 3 is a scatter plot matrix of DEM terrain description error and spatial resolution, which is used to more clearly identify graphic features.
图4为本发明系数A1、A0与10米分辨率的DEM六种地形因子(坡度、剖面曲率、相对高差、凹凸系数、坡向和粗糙度)的散点图矩阵。Fig. 4 is a scatter diagram matrix of coefficients A 1 , A 0 and six topographic factors (slope, section curvature, relative height difference, concave-convex coefficient, aspect and roughness) of the DEM of the present invention with a resolution of 10 meters.
具体实施方式Detailed ways
以下结合附图对本发明的具体实施方式作进一步详细说明。The specific implementation manners of the present invention will be described in further detail below in conjunction with the accompanying drawings.
如图1所示,本发明的具体实施步骤如下:As shown in Figure 1, the specific implementation steps of the present invention are as follows:
数据准备,首先,选取比例尺为1:5万的丘陵、黄土、中山与冰川四种地貌类型等高线数据四幅,并将这四幅数据分别内插成分辨率为10米的规则格网DEM数据,并认为其为真值;然后,进一步利用新获取的四幅规则格网DEM数据等间隔分别抽取空间分辨率为20m、30m、40m、50m、60m、70m的六幅DEM数据(抽取的标准参考《1:5万数字高程模型(DEM)生产技术规定》);最后,计算24幅图的DEM地形描述误差。Data preparation, first, select four pieces of contour data of hills, loess, Zhongshan and glacier with a scale of 1:50,000, and interpolate these four pieces of data into regular grid DEM data with a resolution of 10 meters , and consider it to be the true value; then, further use the newly acquired four pieces of regular grid DEM data to extract six pieces of DEM data with spatial resolutions of 20m, 30m, 40m, 50m, 60m, and 70m at equal intervals (the extracted standard reference "1:50,000 Digital Elevation Model (DEM) Production Technical Regulations"); finally, calculate the DEM terrain description error of 24 maps.
第一步,绘制DEM地形描述误差与其影响因子的散点图矩阵,判断并利用一个影响因子拟合地形描述精度模型。The first step is to draw the scatter plot matrix of the DEM terrain description error and its influencing factors, and judge and use an influencing factor to fit the terrain description accuracy model.
(1.1)首先,绘制DEM地形描述误差与空间分辨率、坡度、剖面曲率、相对高差、凹凸系数、坡向、粗糙度七种影响因子的散点图矩阵,如图2所示。(1.1) First, draw the scatter diagram matrix of the seven influencing factors of DEM terrain description error and spatial resolution, slope, profile curvature, relative height difference, concave-convex coefficient, slope aspect, and roughness, as shown in Figure 2.
(1.2)可以看出,DEM地形描述误差与七种影响因子之间并没有一种统一分布的趋势,因此,难以利用一个函数拟合DEM地形描述误差与单一影响因子的函数关系,所以跳过步骤(1.3)。(1.2) It can be seen that there is no uniform distribution trend between the DEM terrain description error and the seven influencing factors. Therefore, it is difficult to use a function to fit the functional relationship between the DEM terrain description error and a single influencing factor, so skip Step (1.3).
(1.4)另外,发现DEM地形描述误差与空间分辨率、地形因子之间在某一阶段存在特定的分布特征,基本上可以把DEM地形描述误差与影响因子的分布按照地貌类型划分成四个独立的区域,即可以借助多组函数拟合DEM地形描述误差的图形特征,且DEM地形描述误差与分辨率的线性关系更加明显。因此,放大四种地貌类型的DEM地形描述误差与空间分辨率的散点图,如图3所示,可以分别线性拟合四种地貌类型的DEM地形描述误差与空间分辨率关系。设Et为DEM地形描述误差,R为空间分辨率,利用公式(2)则得到公式(8)。(1.4) In addition, it is found that there are specific distribution characteristics between the DEM terrain description error, spatial resolution, and terrain factors at a certain stage. Basically, the distribution of DEM terrain description errors and influencing factors can be divided into four independent categories according to the terrain type. In the area of , that is, the graphical characteristics of the DEM terrain description error can be fitted with the help of multiple sets of functions, and the linear relationship between the DEM terrain description error and the resolution is more obvious. Therefore, zooming in on the scatter plots of the DEM terrain description error and spatial resolution of the four types of landforms, as shown in Figure 3, can linearly fit the relationship between the DEM terrain description error and the spatial resolution of the four types of landforms. Let E t be the error of DEM terrain description, R be the spatial resolution, and formula (8) can be obtained by using formula (2).
(1.5)提取公式(8)的方程系数,设为A0、A1,则A1=(0.1010,0.0991,0.1912,0.5733)T,A0=(-0.9941,-0.7364,0.0985,-2.0085)T,即公式(8)利用公式(3)可写为公式(9),继续进行下一步。(1.5) Extract the equation coefficients of formula (8), set A 0 and A 1 , then A 1 = (0.1010, 0.0991, 0.1912, 0.5733) T , A 0 = (-0.9941, -0.7364, 0.0985, -2.0085) T , that is, formula (8) can be written as formula (9) using formula (3), and proceed to the next step.
第二步,利用获得的新变量A1、A0代替DEM地形描述误差并与剩余变量(指除去空间分辨率以外的六种影响因子)继续拟合。In the second step, use the obtained new variables A 1 and A 0 to replace the DEM terrain description error and continue fitting with the remaining variables (referring to the six influencing factors except spatial resolution).
(2.1)由于空间分辨率对DEM地形描述误差的影响已经在第一步中被考虑进去,且当空间分辨率固定时,其他任意一种影响因子的样本都恰为4个(与新变量A1、A0中基的大小相等),因此这里直接绘制系数A1、A0与10米分辨率的DEM六种地形因子(坡度、剖面曲率、相对高差、凹凸系数、坡向和粗糙度)的散点图矩阵,如图4所示。(2.1) Since the influence of spatial resolution on the error of DEM terrain description has been taken into account in the first step, and when the spatial resolution is fixed, the number of samples of any other influencing factor is exactly 4 (with the new variable A 1 , the size of the base in A 0 is equal), so here directly draw coefficients A 1 , A 0 and six topographic factors (slope, profile curvature, relative height difference, concave-convex coefficient, aspect and roughness of DEM with 10-meter resolution ) scatterplot matrix, as shown in Figure 4.
(2.2)可以看出,坡度、相对高差与方程系数A1的散点图特征明显,而凹凸系数与系数A0的散点图特征明显,并且,对系数A1、A0都分别可以只用一个多项式进行回归分析。(2.2) It can be seen that the scatter diagram features of slope, relative height difference and equation coefficient A 1 are obvious, while the scatter diagram characteristics of concave-convex coefficient and coefficient A 0 are obvious, and the coefficients A 1 and A 0 can be respectively Regression analysis with only one polynomial.
(2.3)对于系数A1,尽管坡度、相对高差都与其存在一定的函数关系,但相对高差与系数A1的相关程度更高、二次拟合效果更好,所以设相对高差为H,利用公式(4)则得到公式(10)。(2.3) For the coefficient A 1 , although the slope and the relative height difference have a certain functional relationship with it, the correlation between the relative height difference and the coefficient A 1 is higher, and the quadratic fitting effect is better, so the relative height difference is set as H, formula (10) is obtained by using formula (4).
对于系数A0,凹凸系数与系数A0的线性拟合效果更好。设凹凸系数为C,利用公式(4)则得到公式(11)。For the coefficient A 0 , the linear fitting effect of the concave-convex coefficient and the coefficient A 0 is better. Assuming that the concave-convex coefficient is C, formula (11) can be obtained by using formula (4).
第三步,利用公式(10)、(11)整理公式(8),得到DEM地形描述误差Et与空间分辨率R和相对高差H、凹凸系数C的关系公式(12):In the third step, use formulas (10) and (11) to sort out formula (8), and obtain the relationship formula (12) between DEM terrain description error E t and spatial resolution R, relative height difference H, and concave-convex coefficient C:
精度分析,另选取分辨率为10米的12幅规则格网DEM数据(其中,丘陵、黄土、冰川与中山地貌各三幅),对比已有的DEM地形描述精度模型旧公式(13)(王光霞(2004)改进的DEM地形描述精度模型,其中Et为DEM地形描述误差,R为空间分辨率,V为剖面曲率),本发明利用新的DEM地形描述误差与空间分辨率、相对高差、凹凸系数的关系公式(12),将这12幅数据的DEM地形描述误差重新进行了计算,结果如表1所示。Accuracy analysis, another 12 pieces of regular grid DEM data with a resolution of 10 meters (including three pieces of hills, loess, glaciers, and Zhongshan landforms) were selected, and compared with the old formula (13) of the existing DEM terrain description accuracy model (Wang Guangxia (2004) improved DEM terrain description accuracy model, where E t is the DEM terrain description error, R is the spatial resolution, and V is the profile curvature), the present invention uses the new DEM terrain description error and spatial resolution, relative height difference, The relationship formula (12) of the concave-convex coefficient recalculates the DEM terrain description errors of these 12 pieces of data, and the results are shown in Table 1.
表1 12幅DEM数据的DEM地形描述误差的计算结果(单位:米)Table 1 Calculation results of DEM terrain description error of 12 pieces of DEM data (unit: meter)
从表1看出,本发明新公式计算的DEM地形描述误差与真实误差在冰川地貌和中山地貌的吻合程度更高,其中,中山地貌的DEM地形描述误差提高了一个数量级,但在丘陵地貌和黄土地貌,两组公式的拟合效果互有好坏,为了进一步对比两组公式的优劣,本文分别计算了DEM真实误差与旧公式、新公式在丘陵与黄土地貌、中山与冰川地貌的相关系数及标准差,如表2所示。As can be seen from Table 1, the DEM terrain description error calculated by the new formula of the present invention is more consistent with the real error in glacial landforms and Zhongshan landforms. Wherein, the DEM terrain description error of Zhongshan landforms has improved by an order of magnitude, but in hilly landforms and Zhongshan landforms. For the loess landform, the fitting effects of the two sets of formulas are different. In order to further compare the advantages and disadvantages of the two sets of formulas, this paper calculates the correlation between the real error of DEM and the old formula and the new formula in hills and loess landforms, and Zhongshan and glacier landforms. Coefficients and standard deviations are shown in Table 2.
表2 真实误差与旧公式、新公式计算的DEM地形描述误差的相关系数及标准差Table 2 Correlation coefficient and standard deviation between real error and DEM terrain description error calculated by old formula and new formula
从表2中可以看出,新公式尽管在丘陵与黄土地貌的拟合效果与原公式相当,但其在冰川地貌与中山地貌拟合效果大幅度提升,达到了拓展地形描述精度模型适用范围的目的。所以,利用散点图矩阵和回归分析建立地形描述精度模型的方法是科学有效的。It can be seen from Table 2 that although the fitting effect of the new formula on hills and loess landforms is comparable to that of the original formula, its fitting effect on glacial landforms and Zhongshan landforms has been greatly improved, reaching the goal of expanding the application range of terrain description accuracy models. Purpose. Therefore, it is scientific and effective to use the scatter plot matrix and regression analysis to establish the terrain description accuracy model.
本发明的优点在于:The advantages of the present invention are:
(1)本发明将可视化技术与数理统计的方法结合起来,利用散点图矩阵解决了多元数据回归分析中选取拟合变量及拟合函数的方法,不仅增加了数学建模直观、形象的特征,而且增加了可视化数据分析的科学性和准确性。(1) The present invention combines the visualization technology with the method of mathematical statistics, and uses the scatter plot matrix to solve the method of selecting fitting variables and fitting functions in multivariate data regression analysis, which not only increases the intuitive and vivid features of mathematical modeling , and increase the scientificity and accuracy of visual data analysis.
(2)本发明利用多元数据的散点图矩阵特征与多次重复曲线拟合的思想,提出了一种多元数据之间非线性函数模型的建立方法,为今后构建其它领域的多元数据模型提供一种参考和借鉴。(2) The present invention utilizes the scatter plot matrix characteristics of multivariate data and the idea of repeated curve fitting, and proposes a method for establishing a nonlinear function model between multivariate data, which provides a basis for building multivariate data models in other fields in the future. A kind of reference and reference.
(3)本发明建立了DEM地形描述精度模型,揭示了DEM地形描述误差与其影响因子的变化规律,有利于指导人们理解与掌握DEM精度的实质。(3) The present invention establishes a DEM terrain description accuracy model, which reveals the variation rule of the DEM terrain description error and its influencing factors, which is beneficial to guide people to understand and grasp the essence of DEM accuracy.
Claims (1)
Priority Applications (1)
| Application Number | Priority Date | Filing Date | Title |
|---|---|---|---|
| CN2012102828722A CN102800128A (en) | 2012-08-09 | 2012-08-09 | Method for establishing geomorphic description precision model by utilizing gplotmatrix and regression analysis |
Applications Claiming Priority (1)
| Application Number | Priority Date | Filing Date | Title |
|---|---|---|---|
| CN2012102828722A CN102800128A (en) | 2012-08-09 | 2012-08-09 | Method for establishing geomorphic description precision model by utilizing gplotmatrix and regression analysis |
Publications (1)
| Publication Number | Publication Date |
|---|---|
| CN102800128A true CN102800128A (en) | 2012-11-28 |
Family
ID=47199224
Family Applications (1)
| Application Number | Title | Priority Date | Filing Date |
|---|---|---|---|
| CN2012102828722A Pending CN102800128A (en) | 2012-08-09 | 2012-08-09 | Method for establishing geomorphic description precision model by utilizing gplotmatrix and regression analysis |
Country Status (1)
| Country | Link |
|---|---|
| CN (1) | CN102800128A (en) |
Cited By (5)
| Publication number | Priority date | Publication date | Assignee | Title |
|---|---|---|---|---|
| CN108427741A (en) * | 2018-03-06 | 2018-08-21 | 太原理工大学 | A kind of DEM relative error evaluation methods based on a large amount of high-precision control points |
| CN108628806A (en) * | 2017-03-22 | 2018-10-09 | 卡西欧计算机株式会社 | Electronic equipment, electronic apparatus system, curve plotting method and recording medium |
| CN108700852A (en) * | 2017-01-27 | 2018-10-23 | 三菱日立电力系统株式会社 | Model parameter value estimating device and presumption method, program, the recording medium having program recorded thereon, model parameter value deduction system |
| CN112180914A (en) * | 2020-09-14 | 2021-01-05 | 北京石头世纪科技股份有限公司 | Map processing method, device, storage medium and robot |
| CN117453805A (en) * | 2023-12-22 | 2024-01-26 | 石家庄学院 | A visual analysis method for uncertainty data |
Citations (2)
| Publication number | Priority date | Publication date | Assignee | Title |
|---|---|---|---|---|
| CN101644595A (en) * | 2009-09-01 | 2010-02-10 | 南京大学 | Fitting method of complex water level process |
| CN101900546A (en) * | 2009-05-27 | 2010-12-01 | 中国科学院地理科学与资源研究所 | A Method for Constructing a Digital Elevation Model for Discrete Expression of Earth's Surface Topography |
-
2012
- 2012-08-09 CN CN2012102828722A patent/CN102800128A/en active Pending
Patent Citations (2)
| Publication number | Priority date | Publication date | Assignee | Title |
|---|---|---|---|---|
| CN101900546A (en) * | 2009-05-27 | 2010-12-01 | 中国科学院地理科学与资源研究所 | A Method for Constructing a Digital Elevation Model for Discrete Expression of Earth's Surface Topography |
| CN101644595A (en) * | 2009-09-01 | 2010-02-10 | 南京大学 | Fitting method of complex water level process |
Non-Patent Citations (7)
| Title |
|---|
| 张云端 等: "数字高程模型DEM精度研究", 《测绘与空间地理信息》, vol. 30, no. 3, 30 June 2007 (2007-06-30), pages 120 - 123 * |
| 汤国安 等: "数字高程模型地形描述精度量化模拟研究", 《测绘学报》, vol. 30, no. 4, 30 November 2001 (2001-11-30), pages 361 - 365 * |
| 王光霞 等: "数字高程模型地形描述精度的研究", 《测绘学报》, vol. 33, no. 2, 31 May 2004 (2004-05-31), pages 168 - 173 * |
| 胡文英 等: "基于DEM的遥感数据复原方法研究", 《国土资源遥感》, no. 1, 15 March 2007 (2007-03-15), pages 41 - 43 * |
| 贾施旎 等: "高程内插方法对DEM所提取坡度、坡向精度的影响", 《地球信息科学学报》, vol. 11, no. 1, 28 February 2009 (2009-02-28), pages 36 - 42 * |
| 齐晓飞 等: "DEM误差可视化方法分析与研究", 《测绘科学》, vol. 36, no. 3, 31 May 2011 (2011-05-31), pages 169 - 171 * |
| 齐晓飞: "DEM误差可视化方法的研究与实践", 《中国优秀硕士学位论文全文数据库》, no. 7, 15 July 2012 (2012-07-15) * |
Cited By (7)
| Publication number | Priority date | Publication date | Assignee | Title |
|---|---|---|---|---|
| CN108700852A (en) * | 2017-01-27 | 2018-10-23 | 三菱日立电力系统株式会社 | Model parameter value estimating device and presumption method, program, the recording medium having program recorded thereon, model parameter value deduction system |
| CN108628806A (en) * | 2017-03-22 | 2018-10-09 | 卡西欧计算机株式会社 | Electronic equipment, electronic apparatus system, curve plotting method and recording medium |
| CN108427741A (en) * | 2018-03-06 | 2018-08-21 | 太原理工大学 | A kind of DEM relative error evaluation methods based on a large amount of high-precision control points |
| CN112180914A (en) * | 2020-09-14 | 2021-01-05 | 北京石头世纪科技股份有限公司 | Map processing method, device, storage medium and robot |
| CN112180914B (en) * | 2020-09-14 | 2024-04-16 | 北京石头创新科技有限公司 | Map processing method, device, storage medium and robot |
| CN117453805A (en) * | 2023-12-22 | 2024-01-26 | 石家庄学院 | A visual analysis method for uncertainty data |
| CN117453805B (en) * | 2023-12-22 | 2024-03-15 | 石家庄学院 | Visual analysis method for uncertainty data |
Similar Documents
| Publication | Publication Date | Title |
|---|---|---|
| CN112765808B (en) | Ecological drought monitoring and evaluating method | |
| Yu et al. | Spatial and temporal dynamics of urban sprawl along two urban–rural transects: A case study of Guangzhou, China | |
| CN112819319A (en) | Method for measuring correlation between city vitality and spatial social characteristics and application | |
| CN106372277B (en) | Method for optimizing variation function model in forest land index space-time estimation | |
| CN102520464B (en) | Regional waterlogging forecasting system and forecasting method thereof | |
| CN101900546A (en) | A Method for Constructing a Digital Elevation Model for Discrete Expression of Earth's Surface Topography | |
| CN105701615A (en) | Crop suitability evaluation method based on environment information | |
| CN111489092B (en) | A method and system for evaluating the suitable habitat area for plant cultivation and planting | |
| CN105893770A (en) | Method for quantifying influence on basin water resources by climate change and human activities | |
| CN107860889A (en) | The Forecasting Methodology and equipment of the soil organism | |
| CN102800128A (en) | Method for establishing geomorphic description precision model by utilizing gplotmatrix and regression analysis | |
| CN103413036A (en) | Continuous forest fire weather level forecasting model and application thereof | |
| Liu et al. | Constraining land surface and atmospheric parameters of a locally coupled model using observational data | |
| CN117333781B (en) | Intelligent extraction method, device, equipment and medium for black soil erosion trench satellite remote sensing | |
| CN117787081A (en) | An uncertainty analysis method for hydrological model parameters based on the Morris and Sobol method | |
| CN109710889B (en) | Sampling method for accurately estimating forest productivity based on tree ring | |
| Wang et al. | Exploring the interdependencies of ecosystem services and social-ecological factors on the Loess Plateau through network analysis | |
| CN118212520B (en) | Soil erosion evaluation method and related equipment | |
| CN108205718A (en) | Production method and system are surveyed in a kind of cereal crops sampling | |
| CN113935658A (en) | A method for evaluating land use change in mountainous areas based on terrain gradient | |
| CN111914339B (en) | Landscape design method and system based on landscape performance evaluation | |
| Zhang et al. | Simulating the impact of climate change on the suitable area for cotton in Xinjiang based on SDMs model | |
| CN104424373B (en) | A kind of fine expression of space variable correlation | |
| Renton et al. | Functional–structural plant modelling using a combination of architectural analysis, L-systems and a canonical model of function | |
| CN103824325A (en) | Fruit tree branch interactive three-dimensional reconstruction method and system |
Legal Events
| Date | Code | Title | Description |
|---|---|---|---|
| C06 | Publication | ||
| PB01 | Publication | ||
| C10 | Entry into substantive examination | ||
| SE01 | Entry into force of request for substantive examination | ||
| C02 | Deemed withdrawal of patent application after publication (patent law 2001) | ||
| WD01 | Invention patent application deemed withdrawn after publication |
Application publication date: 20121128 |