Accurate geospatial data are essential for geographic information systems(GIS),environmental monitoring,and urban planning.The deep integration of the open Internet and geographic information technology has led to inc...Accurate geospatial data are essential for geographic information systems(GIS),environmental monitoring,and urban planning.The deep integration of the open Internet and geographic information technology has led to increasing challenges in the integrity and security of spatial data.In this paper,we consider abnormal spatial data as missing data and focus on abnormal spatial data recovery.Existing geospatial data recovery methods require complete datasets for training,resulting in time-consuming data recovery and lack of generalization.To address these issues,we propose a GAIN-LSTM-based geospatial data recovery method(TGAIN),which consists of two main works:(1)it uses a long-short-term recurrent neural network(LSTM)as a generator to analyze geospatial temporal data and capture its temporal correlation;(2)it constructs a complete TGAIN network using a cue-masked fusion matrix mechanism to obtain data that matches the original distribution of the input data.The experimental results on two publicly accessible datasets demonstrate that our proposed TGAIN approach surpasses four contemporary and traditional models in terms of mean absolute error(MAE),root mean square error(RMSE),mean square error(MSE),mean absolute percentage error(MAPE),coefficient of determination(R2)and average computational time across various data missing rates.Concurrently,TGAIN exhibits superior accuracy and robustness in data recovery compared to existing models,especially when dealing with a high rate of missing data.Our model is of great significance in improving the integrity of geospatial data and provides data support for practical applications such as urban traffic optimization prediction and personal mobility analysis.展开更多
This paper deals with the reflectance estimation model issue to improve the estimation accuracy. We propose a model containing two core procedures: dimensionality reduction and model mining. First, the dimensionality ...This paper deals with the reflectance estimation model issue to improve the estimation accuracy. We propose a model containing two core procedures: dimensionality reduction and model mining. First, the dimensionality reduction algorithm of hyperspectral data based on dependence degree(DRNDDD) is proposed to reduce the redundant hyperspectral band. DRND-DD solves the selection of suitable hyperspectral band via rough set theory. Furthermore, to improve the computation speed and accuracy of the model, based on DRND-DD, this paper proposes reflectance estimation model mining of leaf nitrogen concentration(LNC) for hyperspectral data by using hybrid gene expression programming(REMLNC-HGEP). Experimental results on three datasets demonstrate that the DRND-DD algorithm can obtain good results with a very short running time compared with principal component analysis(PCA), singular value decomposition(SVD), a dimensionality reduction algorithm based on the positive region(AR-PR) and a dimensionality reduction algorithm based on a discernable matrix(ARDM), and REMLNC-HGEP has low average time-consumption, high model mining success ratio and estimation accuracy. It was concluded that the REMLNC-HGEP performs better than the regression methods.展开更多
基金supported by the National Natural Science Foundation of China(No.62002144)Ministry of Education Chunhui Plan Research Project(Nos.202200345,HZKY20220125).
文摘Accurate geospatial data are essential for geographic information systems(GIS),environmental monitoring,and urban planning.The deep integration of the open Internet and geographic information technology has led to increasing challenges in the integrity and security of spatial data.In this paper,we consider abnormal spatial data as missing data and focus on abnormal spatial data recovery.Existing geospatial data recovery methods require complete datasets for training,resulting in time-consuming data recovery and lack of generalization.To address these issues,we propose a GAIN-LSTM-based geospatial data recovery method(TGAIN),which consists of two main works:(1)it uses a long-short-term recurrent neural network(LSTM)as a generator to analyze geospatial temporal data and capture its temporal correlation;(2)it constructs a complete TGAIN network using a cue-masked fusion matrix mechanism to obtain data that matches the original distribution of the input data.The experimental results on two publicly accessible datasets demonstrate that our proposed TGAIN approach surpasses four contemporary and traditional models in terms of mean absolute error(MAE),root mean square error(RMSE),mean square error(MSE),mean absolute percentage error(MAPE),coefficient of determination(R2)and average computational time across various data missing rates.Concurrently,TGAIN exhibits superior accuracy and robustness in data recovery compared to existing models,especially when dealing with a high rate of missing data.Our model is of great significance in improving the integrity of geospatial data and provides data support for practical applications such as urban traffic optimization prediction and personal mobility analysis.
基金supported in part by the National Natural Science Foundation of China (11&zd167, 51507084, 61572262)NSF of Jiangsu Province (BK20141427)+2 种基金NUPT (NY214097)Open research fund of Key Lab of Broadband Wireless Communication and Sensor Network Technology (NUPT), Ministry of Education (NYKL201507)Qinlan Project of Jiangsu Province and the General Project of National Natural Science Found of China under Grant 41471300
文摘This paper deals with the reflectance estimation model issue to improve the estimation accuracy. We propose a model containing two core procedures: dimensionality reduction and model mining. First, the dimensionality reduction algorithm of hyperspectral data based on dependence degree(DRNDDD) is proposed to reduce the redundant hyperspectral band. DRND-DD solves the selection of suitable hyperspectral band via rough set theory. Furthermore, to improve the computation speed and accuracy of the model, based on DRND-DD, this paper proposes reflectance estimation model mining of leaf nitrogen concentration(LNC) for hyperspectral data by using hybrid gene expression programming(REMLNC-HGEP). Experimental results on three datasets demonstrate that the DRND-DD algorithm can obtain good results with a very short running time compared with principal component analysis(PCA), singular value decomposition(SVD), a dimensionality reduction algorithm based on the positive region(AR-PR) and a dimensionality reduction algorithm based on a discernable matrix(ARDM), and REMLNC-HGEP has low average time-consumption, high model mining success ratio and estimation accuracy. It was concluded that the REMLNC-HGEP performs better than the regression methods.