Regional Landslide Susceptibility Zonation(LSZ) is always challenged by the available amount of field data, especially in southwestern China where large mountainous areas and limited field information coincide. Statis...Regional Landslide Susceptibility Zonation(LSZ) is always challenged by the available amount of field data, especially in southwestern China where large mountainous areas and limited field information coincide. Statistical learning algorithms are believed to be superior to traditional statistical algorithms for their data adaptability. The aim of the paper is to evaluate how statistical learning algorithms perform on regional LSZ with limited field data. The focus is on three statistical learning algorithms, Logistic Regression(LR), Artificial Neural Networks(ANN) and Support Vector Machine(SVM). Hanzhong city, a landslide prone area in southwestern China is taken as a study case. Nine environmental factors are selected as inputs. The accuracies of the resulting LSZ maps are evaluated through landslide density analysis(LDA), receiver operating characteristic(ROC) curves and Kappa index statistics. The dependence of the algorithm on the size of field samples is examined by varying the sizes of the training set. The SVM has proven to be the most accurate and the most stable algorithm at small training set sizes and on all known landslide sizes. The accuracy of SVM shows a steadilyincreasing trend and reaches a high level at a small size of the training set, while accuracies of LR and ANN algorithms show distinct fluctuations. The geomorphological interpretations confirm the strength of SVM on all landslide sizes. Our results show that the strengths of SVM in generalization capability and model robustness make it an appropriate and efficient tool for regional LSZ with limited landslide field samples.展开更多
Large high-dimensional data have posed great challenges to existing algorithms for frequent itemsets mining.To solve the problem,a hybrid method,consisting of a novel row enumeration algorithm and a column enumeration...Large high-dimensional data have posed great challenges to existing algorithms for frequent itemsets mining.To solve the problem,a hybrid method,consisting of a novel row enumeration algorithm and a column enumeration algorithm,is proposed.The intention of the hybrid method is to decompose the mining task into two subtasks and then choose appropriate algorithms to solve them respectively.The novel algorithm,i.e.,Inter-transaction is based on the characteristic that there are few common items between or among long transactions.In addition,an optimization technique is adopted to improve the performance of the intersection of bit-vectors.Experiments on synthetic data show that our method achieves high performance in large high-dimensional data.展开更多
Water vapor plays a crucial role in atmospheric processes that act over a wide range of temporal and spatial scales, from global climate to micrometeorology. The determination of water vapor distribution in the atmosp...Water vapor plays a crucial role in atmospheric processes that act over a wide range of temporal and spatial scales, from global climate to micrometeorology. The determination of water vapor distribution in the atmosphere and its changing pattern is very important. Although atmospheric scientists have developed a variety of means to measure precipitable water vapor(PWV) using remote sensing data that have been widely used, there are some limitations in using one kind satellite measurements for PWV retrieval over land. In this paper, a new algorithm is proposed for retrieving PWV over land by combining different kinds of remote sensing data and it would work well under the cloud weather conditions. The PWV retrieval algorithm based on near infrared data is more suitable to clear sky conditions with high precision. The 23.5 GHz microwave remote sensing data is sensitive to water vapor and powerful in cloud-covered areas because of its longer wavelengths that permit viewing into and through the atmosphere. Therefore, the PWV retrieval results from near infrared data and the indices combined by microwave bands remote sensing data which are sensitive to water vapor will be regressed to generate the equation for PWV retrieval under cloud covered areas. The algorithm developed in this paper has the potential to detect PWV under all weather conditions and makes an excellent complement to PWV retrieved by near infrared data. Different types of surface exert different depolarization effects on surface emissions, which would increase the complexity of the algorithm. In this paper, MODIS surface classification data was used to consider this influence. Compared with the GPS results, the root mean square error of our algorithm is 8 mm for cloud covered area. Regional consistency was found between the results from MODIS and our algorithm. Our algorithm can yield reasonable results on the surfaces covered by cloud where MODIS cannot be used to retrieve PWV.展开更多
基金supported by the open fund of Key Laboratory of Geoscience Spatial Information Technology, Ministry of Land and Resource of the China (Grant No. KLGSIT2013-15)The GIS-studio (www.gis-studio.nl) of the Institute for Biodiversity and Ecosystem Dynamics (IBED) is acknowledged for computational support
文摘Regional Landslide Susceptibility Zonation(LSZ) is always challenged by the available amount of field data, especially in southwestern China where large mountainous areas and limited field information coincide. Statistical learning algorithms are believed to be superior to traditional statistical algorithms for their data adaptability. The aim of the paper is to evaluate how statistical learning algorithms perform on regional LSZ with limited field data. The focus is on three statistical learning algorithms, Logistic Regression(LR), Artificial Neural Networks(ANN) and Support Vector Machine(SVM). Hanzhong city, a landslide prone area in southwestern China is taken as a study case. Nine environmental factors are selected as inputs. The accuracies of the resulting LSZ maps are evaluated through landslide density analysis(LDA), receiver operating characteristic(ROC) curves and Kappa index statistics. The dependence of the algorithm on the size of field samples is examined by varying the sizes of the training set. The SVM has proven to be the most accurate and the most stable algorithm at small training set sizes and on all known landslide sizes. The accuracy of SVM shows a steadilyincreasing trend and reaches a high level at a small size of the training set, while accuracies of LR and ANN algorithms show distinct fluctuations. The geomorphological interpretations confirm the strength of SVM on all landslide sizes. Our results show that the strengths of SVM in generalization capability and model robustness make it an appropriate and efficient tool for regional LSZ with limited landslide field samples.
基金The work was supported in part by Research Fund for the Doctoral Program of Higher Education of China(No.20060255006)
文摘Large high-dimensional data have posed great challenges to existing algorithms for frequent itemsets mining.To solve the problem,a hybrid method,consisting of a novel row enumeration algorithm and a column enumeration algorithm,is proposed.The intention of the hybrid method is to decompose the mining task into two subtasks and then choose appropriate algorithms to solve them respectively.The novel algorithm,i.e.,Inter-transaction is based on the characteristic that there are few common items between or among long transactions.In addition,an optimization technique is adopted to improve the performance of the intersection of bit-vectors.Experiments on synthetic data show that our method achieves high performance in large high-dimensional data.
基金supported by the National Natural Science Foundation of China(Grant Nos.4147130541405036&41301653)+1 种基金the Sichuan Youth Science Foundation(Grant No.2015JQ0037)the Chongqing Meteorological Bureau Open Fund(Grant No.KFJJ-201402)
文摘Water vapor plays a crucial role in atmospheric processes that act over a wide range of temporal and spatial scales, from global climate to micrometeorology. The determination of water vapor distribution in the atmosphere and its changing pattern is very important. Although atmospheric scientists have developed a variety of means to measure precipitable water vapor(PWV) using remote sensing data that have been widely used, there are some limitations in using one kind satellite measurements for PWV retrieval over land. In this paper, a new algorithm is proposed for retrieving PWV over land by combining different kinds of remote sensing data and it would work well under the cloud weather conditions. The PWV retrieval algorithm based on near infrared data is more suitable to clear sky conditions with high precision. The 23.5 GHz microwave remote sensing data is sensitive to water vapor and powerful in cloud-covered areas because of its longer wavelengths that permit viewing into and through the atmosphere. Therefore, the PWV retrieval results from near infrared data and the indices combined by microwave bands remote sensing data which are sensitive to water vapor will be regressed to generate the equation for PWV retrieval under cloud covered areas. The algorithm developed in this paper has the potential to detect PWV under all weather conditions and makes an excellent complement to PWV retrieved by near infrared data. Different types of surface exert different depolarization effects on surface emissions, which would increase the complexity of the algorithm. In this paper, MODIS surface classification data was used to consider this influence. Compared with the GPS results, the root mean square error of our algorithm is 8 mm for cloud covered area. Regional consistency was found between the results from MODIS and our algorithm. Our algorithm can yield reasonable results on the surfaces covered by cloud where MODIS cannot be used to retrieve PWV.