For spatial based decision making such as choice of best place to construct a new department store, spatial data warehousing system is required more and more previous spatial data warehousing systems; however, provide...For spatial based decision making such as choice of best place to construct a new department store, spatial data warehousing system is required more and more previous spatial data warehousing systems; however, provided decision making of non-spatial data on a map and so those cannot support enough spatial based decision making. The spatial aggregations are proposed for spatial based decision making in spatial data warehouses. The meaning of aggregation operators for applying spatial data was modified and new spatial aggregations were defined. These aggregations can support hierarchical concept of spatial measure. Using these aggregations, the spatial analysis classified by non-spatial data is provided. In case study, how to use these aggregations and how to support spatial based decision making are shown.展开更多
Since web based GIS processes large size spatial geographic information on internet, we should try to improve the efficiency of spatial data query processing and transmission. This paper presents two efficient metho...Since web based GIS processes large size spatial geographic information on internet, we should try to improve the efficiency of spatial data query processing and transmission. This paper presents two efficient methods for this purpose: division transmission and progressive transmission methods. In division transmission method, a map can be divided into several parts, called “tiles”, and only tiles can be transmitted at the request of a client. In progressive transmission method, a map can be split into several phase views based on the significance of vertices, and a server produces a target object and then transmits it progressively when this spatial object is requested from a client. In order to achieve these methods, the algorithms, “tile division”, “priority order estimation” and the strategies for data transmission are proposed in this paper, respectively. Compared with such traditional methods as “map total transmission” and “layer transmission”, the web based GIS data transmission, proposed in this paper, is advantageous in the increase of the data transmission efficiency by a great margin.展开更多
针对点云数据中噪声点的剔除问题,提出了一种基于改进DBSCAN(density-based spatial clustering of applications with noise)算法的多尺度点云去噪方法。应用统计滤波对孤立离群点进行预筛选,去除点云中的大尺度噪声;对DBSCAN算法进行...针对点云数据中噪声点的剔除问题,提出了一种基于改进DBSCAN(density-based spatial clustering of applications with noise)算法的多尺度点云去噪方法。应用统计滤波对孤立离群点进行预筛选,去除点云中的大尺度噪声;对DBSCAN算法进行优化,减少算法时间复杂度和实现参数的自适应调整,以此将点云分为正常簇、疑似簇及异常簇,并立即去除异常簇;利用距离共识评估法对疑似簇进行精细判定,通过计算疑似点与其最近的正常点拟合表面之间的距离,判定其是否为异常,有效保持了数据的关键特征和模型敏感度。利用该方法对两个船体分段点云进行去噪,并与其他去噪算法进行对比,结果表明,该方法在去噪效率和特征保持方面具有优势,精确地保留了点云数据的几何特性。展开更多
本文利用卫星监测的数据构造夜间灯光复合指数表征城镇化水平,运用Superefficiency Ray Slacks-Based Measure(Super-RSBM)模型和Global Malmquist-Luenberger(GML)指数测算2000—2021年我国农业低碳全要素生产率(TFP),实证检验城镇化...本文利用卫星监测的数据构造夜间灯光复合指数表征城镇化水平,运用Superefficiency Ray Slacks-Based Measure(Super-RSBM)模型和Global Malmquist-Luenberger(GML)指数测算2000—2021年我国农业低碳全要素生产率(TFP),实证检验城镇化对我国农业低碳TFP的影响及其作用机制,并考察紧凑集约型和规模扩张型两种城镇化推进模式对农业低碳TFP的异质性影响。研究发现,从全国来看,城镇化推进与农业低碳TFP之间具有显著的U型关系,且邻近地区农业低碳TFP的提升对本地区产生示范效应;分区域来看,这种U型关系主要体现在农业适度发展区,而农业优化发展区的城镇化与农业低碳TFP之间呈现显著的正向线性关系,表明农业优化发展区应发挥“领头羊”作用,带动适度发展区早日跨越U型曲线的拐点,实现城镇化带动农业绿色发展;紧凑集约型的城镇化深度推进模式能够显著提升农业低碳TFP,而规模扩张型的城镇化广度推进模式降低了农业低碳TFP;农业低碳技术进步、农村劳动力转移、规模效应、农业产业链延伸和农村居民可支配收入增加是城镇化影响农业低碳TFP的主要途径。展开更多
For imbalanced datasets, the focus of classification is to identify samples of the minority class. The performance of current data mining algorithms is not good enough for processing imbalanced datasets. The synthetic...For imbalanced datasets, the focus of classification is to identify samples of the minority class. The performance of current data mining algorithms is not good enough for processing imbalanced datasets. The synthetic minority over-sampling technique(SMOTE) is specifically designed for learning from imbalanced datasets, generating synthetic minority class examples by interpolating between minority class examples nearby. However, the SMOTE encounters the overgeneralization problem. The densitybased spatial clustering of applications with noise(DBSCAN) is not rigorous when dealing with the samples near the borderline.We optimize the DBSCAN algorithm for this problem to make clustering more reasonable. This paper integrates the optimized DBSCAN and SMOTE, and proposes a density-based synthetic minority over-sampling technique(DSMOTE). First, the optimized DBSCAN is used to divide the samples of the minority class into three groups, including core samples, borderline samples and noise samples, and then the noise samples of minority class is removed to synthesize more effective samples. In order to make full use of the information of core samples and borderline samples,different strategies are used to over-sample core samples and borderline samples. Experiments show that DSMOTE can achieve better results compared with SMOTE and Borderline-SMOTE in terms of precision, recall and F-value.展开更多
Raster type of forest inventory data with site and growing stock variables interpreted for small squareshaped grid cells are increasingly available for forest planning.In Finland,there are two sources of this type of ...Raster type of forest inventory data with site and growing stock variables interpreted for small squareshaped grid cells are increasingly available for forest planning.In Finland,there are two sources of this type of lattice data:the multisource national forest inventory and the inventory that is based on airborne laser scanning(ALS).In both cases,stand variables are interpreted for 16 m×16 m cells.Both data sources cover all private forests of Finland and are freely available for forest planning.This study analyzed different ways to use the ALS raster data in forest planning.The analyses were conducted for a grid of 375×375 cells(140,625 cells,of which 97,893 were productive forest).The basic alternatives were to use the cells as calculation units throughout the planning process,or aggregate the cells into segments before planning calculations.The use of cells made it necessary to use spatial optimization to aggregate cuttings and other treatments into blocks that were large enough for the practical implementation of the plan.In addition,allowing premature cuttings in a part of the cells was a prerequisite for compact treatment areas.The use of segments led to 5–9%higher growth predictions than calculations based on cells.In addition,the areas of the most common fertility classes were overestimated and the areas of rare site classes were underestimated when segments were used.The shape of the treatment blocks was more irregular in cell-based planning.Using cells as calculation units instead of segments led to 20 times longer computing time of the whole planning process than the use of segments when the number of grid cells was approximately 100,000.展开更多
Conventional soil maps generally contain one or more soil types within a single soil polygon.But their geographic locations within the polygon are not specified.This restricts current applications of the maps in site-...Conventional soil maps generally contain one or more soil types within a single soil polygon.But their geographic locations within the polygon are not specified.This restricts current applications of the maps in site-specific agricultural management and environmental modelling.We examined the utility of legacy pedon data for disaggregating soil polygons and the effectiveness of similarity-based prediction for making use of the under-or over-sampled legacy pedon data for the disaggregation.The method consisted of three steps.First,environmental similarities between the pedon sites and each location were computed based on soil formative environmental factors.Second,according to soil types of the pedon sites,the similarities were aggregated to derive similarity distribution for each soil type.Third,a hardening process was performed on the maps to allocate candidate soil types within the polygons.The study was conducted at the soil subgroup level in a semi-arid area situated in Manitoba,Canada.Based on 186 independent pedon sites,the evaluation of the disaggregated map of soil subgroups showed an overall accuracy of 67% and a Kappa statistic of 0.62.The map represented a better spatial pattern of soil subgroups in both detail and accuracy compared to a dominant soil subgroup map,which was commonly used in practice.Incorrect predictions mainly occurred in the agricultural plain area and the soil subgroups that are very similar in taxonomy,indicating that new environmental covariates need to be developed.We concluded that the combination of legacy pedon data with similarity-based prediction is an effective solution for soil polygon disaggregation.展开更多
提出了一种基于时间戳和关键字的聚类算法来解决告警数据种类繁多且难以提取关键信息的问题。首先,对告警数据中的最新发生时间进行K-Means聚类;其次,基于告警数据开始时间进行K-Means二次聚类;再次,使用具有噪声的基于密度的聚类算法(D...提出了一种基于时间戳和关键字的聚类算法来解决告警数据种类繁多且难以提取关键信息的问题。首先,对告警数据中的最新发生时间进行K-Means聚类;其次,基于告警数据开始时间进行K-Means二次聚类;再次,使用具有噪声的基于密度的聚类算法(Density-Based Spatial Clustering of Application with Noise,DBSCAN)对每列关键字进行聚类;最后,对结果进行了整合,并给出了关联性描述结果。实验结果表明,通过上述聚类算法构建的告警数据分析与处理模型的平均压缩率为79.28%,平均准确率达到93.41%,能够有效提高对现有告警数据的具象化描述能力,降低告警数据理解的复杂度。展开更多
针对海战场环境下态势评估中目标数量多、类型复杂多样的问题,首先引入数据聚类对态势评估的目标分群环节进行聚类分群,提出基于DBSCAN(density-based spatial clustering of applications with noise)算法的密度聚类,可聚类任意形状的...针对海战场环境下态势评估中目标数量多、类型复杂多样的问题,首先引入数据聚类对态势评估的目标分群环节进行聚类分群,提出基于DBSCAN(density-based spatial clustering of applications with noise)算法的密度聚类,可聚类任意形状的数据簇,遍历性好,能够对战场环境下目标进行全面合理的分群;然后,给出了算法计算的基本步骤,并利用算例对已知战场态势的目标群进行正确性验证;最后,将该算法与基于划分的K-means算法、基于层次的AGNES(AGglomerative NESting)算法进行了对比分析,证明了该算法的有效性和合理性。展开更多
基金This research was supported by the MIC ( Ministry of Information and Communication) , Korea , under the ITRC(Information Technology Research Center) support program supervised by the IITA (Institute of Information Technology As-sessment)
文摘For spatial based decision making such as choice of best place to construct a new department store, spatial data warehousing system is required more and more previous spatial data warehousing systems; however, provided decision making of non-spatial data on a map and so those cannot support enough spatial based decision making. The spatial aggregations are proposed for spatial based decision making in spatial data warehouses. The meaning of aggregation operators for applying spatial data was modified and new spatial aggregations were defined. These aggregations can support hierarchical concept of spatial measure. Using these aggregations, the spatial analysis classified by non-spatial data is provided. In case study, how to use these aggregations and how to support spatial based decision making are shown.
文摘Since web based GIS processes large size spatial geographic information on internet, we should try to improve the efficiency of spatial data query processing and transmission. This paper presents two efficient methods for this purpose: division transmission and progressive transmission methods. In division transmission method, a map can be divided into several parts, called “tiles”, and only tiles can be transmitted at the request of a client. In progressive transmission method, a map can be split into several phase views based on the significance of vertices, and a server produces a target object and then transmits it progressively when this spatial object is requested from a client. In order to achieve these methods, the algorithms, “tile division”, “priority order estimation” and the strategies for data transmission are proposed in this paper, respectively. Compared with such traditional methods as “map total transmission” and “layer transmission”, the web based GIS data transmission, proposed in this paper, is advantageous in the increase of the data transmission efficiency by a great margin.
文摘针对点云数据中噪声点的剔除问题,提出了一种基于改进DBSCAN(density-based spatial clustering of applications with noise)算法的多尺度点云去噪方法。应用统计滤波对孤立离群点进行预筛选,去除点云中的大尺度噪声;对DBSCAN算法进行优化,减少算法时间复杂度和实现参数的自适应调整,以此将点云分为正常簇、疑似簇及异常簇,并立即去除异常簇;利用距离共识评估法对疑似簇进行精细判定,通过计算疑似点与其最近的正常点拟合表面之间的距离,判定其是否为异常,有效保持了数据的关键特征和模型敏感度。利用该方法对两个船体分段点云进行去噪,并与其他去噪算法进行对比,结果表明,该方法在去噪效率和特征保持方面具有优势,精确地保留了点云数据的几何特性。
文摘本文利用卫星监测的数据构造夜间灯光复合指数表征城镇化水平,运用Superefficiency Ray Slacks-Based Measure(Super-RSBM)模型和Global Malmquist-Luenberger(GML)指数测算2000—2021年我国农业低碳全要素生产率(TFP),实证检验城镇化对我国农业低碳TFP的影响及其作用机制,并考察紧凑集约型和规模扩张型两种城镇化推进模式对农业低碳TFP的异质性影响。研究发现,从全国来看,城镇化推进与农业低碳TFP之间具有显著的U型关系,且邻近地区农业低碳TFP的提升对本地区产生示范效应;分区域来看,这种U型关系主要体现在农业适度发展区,而农业优化发展区的城镇化与农业低碳TFP之间呈现显著的正向线性关系,表明农业优化发展区应发挥“领头羊”作用,带动适度发展区早日跨越U型曲线的拐点,实现城镇化带动农业绿色发展;紧凑集约型的城镇化深度推进模式能够显著提升农业低碳TFP,而规模扩张型的城镇化广度推进模式降低了农业低碳TFP;农业低碳技术进步、农村劳动力转移、规模效应、农业产业链延伸和农村居民可支配收入增加是城镇化影响农业低碳TFP的主要途径。
基金supported by the National Key Research and Development Program of China(2018YFB1003700)the Scientific and Technological Support Project(Society)of Jiangsu Province(BE2016776)+2 种基金the“333” project of Jiangsu Province(BRA2017228 BRA2017401)the Talent Project in Six Fields of Jiangsu Province(2015-JNHB-012)
文摘For imbalanced datasets, the focus of classification is to identify samples of the minority class. The performance of current data mining algorithms is not good enough for processing imbalanced datasets. The synthetic minority over-sampling technique(SMOTE) is specifically designed for learning from imbalanced datasets, generating synthetic minority class examples by interpolating between minority class examples nearby. However, the SMOTE encounters the overgeneralization problem. The densitybased spatial clustering of applications with noise(DBSCAN) is not rigorous when dealing with the samples near the borderline.We optimize the DBSCAN algorithm for this problem to make clustering more reasonable. This paper integrates the optimized DBSCAN and SMOTE, and proposes a density-based synthetic minority over-sampling technique(DSMOTE). First, the optimized DBSCAN is used to divide the samples of the minority class into three groups, including core samples, borderline samples and noise samples, and then the noise samples of minority class is removed to synthesize more effective samples. In order to make full use of the information of core samples and borderline samples,different strategies are used to over-sample core samples and borderline samples. Experiments show that DSMOTE can achieve better results compared with SMOTE and Borderline-SMOTE in terms of precision, recall and F-value.
基金Open access funding provided by University of Eastern Finland (UEF) including Kuopio University Hospital
文摘Raster type of forest inventory data with site and growing stock variables interpreted for small squareshaped grid cells are increasingly available for forest planning.In Finland,there are two sources of this type of lattice data:the multisource national forest inventory and the inventory that is based on airborne laser scanning(ALS).In both cases,stand variables are interpreted for 16 m×16 m cells.Both data sources cover all private forests of Finland and are freely available for forest planning.This study analyzed different ways to use the ALS raster data in forest planning.The analyses were conducted for a grid of 375×375 cells(140,625 cells,of which 97,893 were productive forest).The basic alternatives were to use the cells as calculation units throughout the planning process,or aggregate the cells into segments before planning calculations.The use of cells made it necessary to use spatial optimization to aggregate cuttings and other treatments into blocks that were large enough for the practical implementation of the plan.In addition,allowing premature cuttings in a part of the cells was a prerequisite for compact treatment areas.The use of segments led to 5–9%higher growth predictions than calculations based on cells.In addition,the areas of the most common fertility classes were overestimated and the areas of rare site classes were underestimated when segments were used.The shape of the treatment blocks was more irregular in cell-based planning.Using cells as calculation units instead of segments led to 20 times longer computing time of the whole planning process than the use of segments when the number of grid cells was approximately 100,000.
基金supported by the National Natural Science Foundation of China (41130530,91325301,41431177,41571212,41401237)the Project of "One-Three-Five" Strategic Planning & Frontier Sciences of the Institute of Soil Science,Chinese Academy of Sciences (ISSASIP1622)+1 种基金the Government Interest Related Program between Canadian Space Agency and Agriculture and Agri-Food,Canada (13MOA01002)the Natural Science Research Program of Jiangsu Province (14KJA170001)
文摘Conventional soil maps generally contain one or more soil types within a single soil polygon.But their geographic locations within the polygon are not specified.This restricts current applications of the maps in site-specific agricultural management and environmental modelling.We examined the utility of legacy pedon data for disaggregating soil polygons and the effectiveness of similarity-based prediction for making use of the under-or over-sampled legacy pedon data for the disaggregation.The method consisted of three steps.First,environmental similarities between the pedon sites and each location were computed based on soil formative environmental factors.Second,according to soil types of the pedon sites,the similarities were aggregated to derive similarity distribution for each soil type.Third,a hardening process was performed on the maps to allocate candidate soil types within the polygons.The study was conducted at the soil subgroup level in a semi-arid area situated in Manitoba,Canada.Based on 186 independent pedon sites,the evaluation of the disaggregated map of soil subgroups showed an overall accuracy of 67% and a Kappa statistic of 0.62.The map represented a better spatial pattern of soil subgroups in both detail and accuracy compared to a dominant soil subgroup map,which was commonly used in practice.Incorrect predictions mainly occurred in the agricultural plain area and the soil subgroups that are very similar in taxonomy,indicating that new environmental covariates need to be developed.We concluded that the combination of legacy pedon data with similarity-based prediction is an effective solution for soil polygon disaggregation.
文摘提出了一种基于时间戳和关键字的聚类算法来解决告警数据种类繁多且难以提取关键信息的问题。首先,对告警数据中的最新发生时间进行K-Means聚类;其次,基于告警数据开始时间进行K-Means二次聚类;再次,使用具有噪声的基于密度的聚类算法(Density-Based Spatial Clustering of Application with Noise,DBSCAN)对每列关键字进行聚类;最后,对结果进行了整合,并给出了关联性描述结果。实验结果表明,通过上述聚类算法构建的告警数据分析与处理模型的平均压缩率为79.28%,平均准确率达到93.41%,能够有效提高对现有告警数据的具象化描述能力,降低告警数据理解的复杂度。
文摘针对海战场环境下态势评估中目标数量多、类型复杂多样的问题,首先引入数据聚类对态势评估的目标分群环节进行聚类分群,提出基于DBSCAN(density-based spatial clustering of applications with noise)算法的密度聚类,可聚类任意形状的数据簇,遍历性好,能够对战场环境下目标进行全面合理的分群;然后,给出了算法计算的基本步骤,并利用算例对已知战场态势的目标群进行正确性验证;最后,将该算法与基于划分的K-means算法、基于层次的AGNES(AGglomerative NESting)算法进行了对比分析,证明了该算法的有效性和合理性。