This work was to generate landslide susceptibility maps for the Three Gorges Reservoir(TGR) area, China by using different machine learning models. Three advanced machine learning methods, namely, gradient boosting de...This work was to generate landslide susceptibility maps for the Three Gorges Reservoir(TGR) area, China by using different machine learning models. Three advanced machine learning methods, namely, gradient boosting decision tree(GBDT), random forest(RF) and information value(InV) models, were used, and the performances were assessed and compared. In total, 202 landslides were mapped by using a series of field surveys, aerial photographs, and reviews of historical and bibliographical data. Nine causative factors were then considered in landslide susceptibility map generation by using the GBDT, RF and InV models. All of the maps of the causative factors were resampled to a resolution of 28.5 m. Of the 486289 pixels in the area,28526 pixels were landslide pixels, and 457763 pixels were non-landslide pixels. Finally, landslide susceptibility maps were generated by using the three machine learning models, and their performances were assessed through receiver operating characteristic(ROC) curves, the sensitivity, specificity,overall accuracy(OA), and kappa coefficient(KAPPA). The results showed that the GBDT, RF and In V models in overall produced reasonable accurate landslide susceptibility maps. Among these three methods, the GBDT method outperforms the other two machine learning methods, which can provide strong technical support for producing landslide susceptibility maps in TGR.展开更多
In many practical situation, some of the attribute values for an object may be interval and set-valued. This paper introduces the interval and set-valued information systems and decision systems. According to the sema...In many practical situation, some of the attribute values for an object may be interval and set-valued. This paper introduces the interval and set-valued information systems and decision systems. According to the semantic relation of attribute values, interval and set-valued information systems can be classified into two categories: disjunctive (Type 1) and conjunctive (Type 2) systems. In this paper, we mainly focus on semantic interpretation of Type 1. Then, we define a new fuzzy preference relation and construct a fuzzy rough set model for interval and set-valued information systems. Moreover, based on the new fuzzy preference relation, the concepts of the significance measure of condition attributes and the relative significance measure of condition attributes are given in interval and set-valued decision information systems by the introduction of fuzzy positive region and the dependency degree. And on this basis, a heuristic algorithm for calculating fuzzy positive region reduction in interval and set-valued decision information systems is given. Finally, we give an illustrative example to substantiate the theoretical arguments. The results will help us to gain much more insights into the meaning of fuzzy rough set theory. Furthermore, it has provided a new perspective to study the attribute reduction problem in decision systems.展开更多
Karst rocky desertification is a phenomenon of land degradation as a result of affection by the interaction of natural and human factors.In the past,in the rocky desertification areas,supervised classification and uns...Karst rocky desertification is a phenomenon of land degradation as a result of affection by the interaction of natural and human factors.In the past,in the rocky desertification areas,supervised classification and unsupervised classification are often used to classify the remote sensing image.But they only use pixel brightness characteristics to classify it.So the classification accuracy is low and can not meet the needs of practical application.Decision tree classification is a new technology for remote sensing image classification.In this study,we select the rocky desertification areas Kaizuo Township as a case study,use the ASTER image data,DEM and lithology data,by extracting the normalized difference vegetation index,ratio vegetation index,terrain slope and other data to establish classification rules to build decision trees.In the ENVI software support,we access the classification images.By calculating the classification accuracy and kappa coefficient,we find that better classification results can be obtained,desertification information can be extracted automatically and if more remote sensing image bands used,higher resolution DEM employed and less errors data reduced during processing,classification accuracy can be improve further.展开更多
The ID3 algorithm is a classical learning algorithm of decision tree in data mining.The algorithm trends to choosing the attribute with more values,affect the efficiency of classification and prediction for building a...The ID3 algorithm is a classical learning algorithm of decision tree in data mining.The algorithm trends to choosing the attribute with more values,affect the efficiency of classification and prediction for building a decision tree.This article proposes a new approach based on an improved ID3 algorithm.The new algorithm introduces the importance factor λ when calculating the information entropy.It can strengthen the label of important attributes of a tree and reduce the label of non-important attributes.The algorithm overcomes the flaw of the traditional ID3 algorithm which tends to choose the attributes with more values,and also improves the efficiency and flexibility in the process of generating decision trees.展开更多
The trend toward designing an intelligent distribution system based on students’individual differences and individual needs has taken precedence in view of the traditional dormitory distribution system,which neglects...The trend toward designing an intelligent distribution system based on students’individual differences and individual needs has taken precedence in view of the traditional dormitory distribution system,which neglects the students’personality traits,causes dormitory disputes,and affects the students’quality of life and academic quality.This paper collects freshmen's data according to college students’personal preferences,conducts a classification comparison,uses the decision tree classification algorithm based on the information gain principle as the core algorithm of dormitory allocation,determines the description rules of students’personal preferences and decision tree classification preferences,completes the conceptual design of the database of entity relations and data dictionaries,meets students’personality classification requirements for the dormitory,and lays the foundation for the intelligent dormitory allocation system.展开更多
In many decision making tasks,the features and decision are ordinal.Several ordinal classification learning algorithms have been developed in recent years,it is shown that these algorithms are sensitive to noisy sampl...In many decision making tasks,the features and decision are ordinal.Several ordinal classification learning algorithms have been developed in recent years,it is shown that these algorithms are sensitive to noisy samples and do not work in real-world applications.In this work,we propose a new measure of feature quality, called rank mutual information.Then,we design an ordinal decision tree(REOT) construction technique based on rank mutual information.The theoretic and experimental analysis shows that the proposed algorithm is effective.展开更多
An improved decision tree method for web information retrieval with self-mapping attributes is proposed.The self-mapping tree has a value of self-mapping attribute in its internal node,and information based on dissimi...An improved decision tree method for web information retrieval with self-mapping attributes is proposed.The self-mapping tree has a value of self-mapping attribute in its internal node,and information based on dissimilarity between a pair of mapping sequences.This method selects self-mapping which exists between data by exhaustive search based on relation and attribute information.Experimental results confirm that the improved method constructs comprehensive and accurate decision tree.Moreover,an example shows that the self-mapping decision tree is promising for data mining and knowledge discovery.展开更多
European Community policy concerning water is placing increasing demands on the acquisition of information about the quality of aquatic environments. The cost of this information has led to a reflection on the rationa...European Community policy concerning water is placing increasing demands on the acquisition of information about the quality of aquatic environments. The cost of this information has led to a reflection on the rationalization of monitoring networks and, therefore, on the economic value of information produced by these networks. The aim of this article is to contribute to this reflection. To do so, we used the Bayesian framework to define the value of additional information in relation to the following three parameters: initial assumptions (prior probabilities) on the states of nature, costs linked to a poor decision (error costs) and accuracy of additional information. We then analyzed the impact of these parameters on this value, particularly the combined role of prior probabilities and error costs that increased or decreased the value of information depending on the initial uncertainty level. We then illustrated the results using a case study of a stream in the Bas-Rhin department in France.展开更多
[Objective] This study aimed to improve the accuracy of remote sensing classification for Dongting Lake Wetland.[Method] Based on the TM data and ground GIS information of Donting Lake,the decision tree classification...[Objective] This study aimed to improve the accuracy of remote sensing classification for Dongting Lake Wetland.[Method] Based on the TM data and ground GIS information of Donting Lake,the decision tree classification method was established through the expert classification knowledge base.The images of Dongting Lake wetland were classified into water area,mudflat,protection forest beach,Carem spp beach,Phragmites beach,Carex beach and other water body according to decision tree layers.[Result] The accuracy of decision tree classification reached 80.29%,which was much higher than the traditional method,and the total Kappa coefficient was 0.883 9,indicating that the data accuracy of this method could fulfill the requirements of actual practice.In addition,the image classification results based on knowledge could solve some classification mistakes.[Conclusion] Compared with the traditional method,the decision tree classification based on rules could classify the images by using various conditions,which reduced the data processing time and improved the classification accuracy.展开更多
There are many kinds of real options,which are valuable,in each phase of the lifetime of an information technology(IT)project.However,in the current IT investment decision theory,real options that embedded in IT proje...There are many kinds of real options,which are valuable,in each phase of the lifetime of an information technology(IT)project.However,in the current IT investment decision theory,real options that embedded in IT projects are not considered. In this paper, the process of IT project decision and implementation is fully analyzed, the real options that may be embedded in an IT project are identified, and a real option analysis (ROA) method is proposed for evaluation of an IT project under uncertain business environment. ROA employs Black-Scholes expansion model and cancels the assumption that the cost of project is certain. The numerical example manifests that the ROA can better evaluate IT project and select the IT investment alternative. Finally, a road map is provided to help selecting the suitable evaluation method to make IT investment decision.展开更多
Business Analytics is one of the vital processes that must be incorpo-rated into any business.It supports decision-makers in analyzing and predicting future trends based on facts(Data-driven decisions),especially when...Business Analytics is one of the vital processes that must be incorpo-rated into any business.It supports decision-makers in analyzing and predicting future trends based on facts(Data-driven decisions),especially when dealing with a massive amount of business data.Decision Trees are essential for business ana-lytics to predict business opportunities and future trends that can retain corpora-tions’competitive advantage and survival and improve their business value.This research proposes a tree-based predictive model for business analytics.The model is developed based on ranking business features and gradient-boosted trees.For validation purposes,the model is tested on a real-world dataset of Universal Bank to predict personal loan acceptance.It is validated based on Accuracy,Precision,Recall,and F-score.The experimentfindings show that the proposed model can predict personal loan acceptance efficiently and effectively with better accuracy than the traditional tree-based models.The model can also deal with a massive amount of business data and support corporations’decision-making process.展开更多
This paper describes an approach to ethical decision design. It discusses the use of several mathematical tools applicable to the world of economics and business. It then demonstrates how the tools analogously extend ...This paper describes an approach to ethical decision design. It discusses the use of several mathematical tools applicable to the world of economics and business. It then demonstrates how the tools analogously extend themselves to making ethical decisions as supported by personal values. A review of the mathematics, including the design of a decision tree, the concept of mathematical expectation, and mathematical modeling are included.展开更多
基金This work was supported in part by the National Natural Science Foundation of China(61601418,41602362,61871259)in part by the Opening Foundation of Hunan Engineering and Research Center of Natural Resource Investigation and Monitoring(2020-5)+1 种基金in part by the Qilian Mountain National Park Research Center(Qinghai)(grant number:GKQ2019-01)in part by the Geomatics Technology and Application Key Laboratory of Qinghai Province,Grant No.QHDX-2019-01.
文摘This work was to generate landslide susceptibility maps for the Three Gorges Reservoir(TGR) area, China by using different machine learning models. Three advanced machine learning methods, namely, gradient boosting decision tree(GBDT), random forest(RF) and information value(InV) models, were used, and the performances were assessed and compared. In total, 202 landslides were mapped by using a series of field surveys, aerial photographs, and reviews of historical and bibliographical data. Nine causative factors were then considered in landslide susceptibility map generation by using the GBDT, RF and InV models. All of the maps of the causative factors were resampled to a resolution of 28.5 m. Of the 486289 pixels in the area,28526 pixels were landslide pixels, and 457763 pixels were non-landslide pixels. Finally, landslide susceptibility maps were generated by using the three machine learning models, and their performances were assessed through receiver operating characteristic(ROC) curves, the sensitivity, specificity,overall accuracy(OA), and kappa coefficient(KAPPA). The results showed that the GBDT, RF and In V models in overall produced reasonable accurate landslide susceptibility maps. Among these three methods, the GBDT method outperforms the other two machine learning methods, which can provide strong technical support for producing landslide susceptibility maps in TGR.
文摘In many practical situation, some of the attribute values for an object may be interval and set-valued. This paper introduces the interval and set-valued information systems and decision systems. According to the semantic relation of attribute values, interval and set-valued information systems can be classified into two categories: disjunctive (Type 1) and conjunctive (Type 2) systems. In this paper, we mainly focus on semantic interpretation of Type 1. Then, we define a new fuzzy preference relation and construct a fuzzy rough set model for interval and set-valued information systems. Moreover, based on the new fuzzy preference relation, the concepts of the significance measure of condition attributes and the relative significance measure of condition attributes are given in interval and set-valued decision information systems by the introduction of fuzzy positive region and the dependency degree. And on this basis, a heuristic algorithm for calculating fuzzy positive region reduction in interval and set-valued decision information systems is given. Finally, we give an illustrative example to substantiate the theoretical arguments. The results will help us to gain much more insights into the meaning of fuzzy rough set theory. Furthermore, it has provided a new perspective to study the attribute reduction problem in decision systems.
文摘Karst rocky desertification is a phenomenon of land degradation as a result of affection by the interaction of natural and human factors.In the past,in the rocky desertification areas,supervised classification and unsupervised classification are often used to classify the remote sensing image.But they only use pixel brightness characteristics to classify it.So the classification accuracy is low and can not meet the needs of practical application.Decision tree classification is a new technology for remote sensing image classification.In this study,we select the rocky desertification areas Kaizuo Township as a case study,use the ASTER image data,DEM and lithology data,by extracting the normalized difference vegetation index,ratio vegetation index,terrain slope and other data to establish classification rules to build decision trees.In the ENVI software support,we access the classification images.By calculating the classification accuracy and kappa coefficient,we find that better classification results can be obtained,desertification information can be extracted automatically and if more remote sensing image bands used,higher resolution DEM employed and less errors data reduced during processing,classification accuracy can be improve further.
文摘The ID3 algorithm is a classical learning algorithm of decision tree in data mining.The algorithm trends to choosing the attribute with more values,affect the efficiency of classification and prediction for building a decision tree.This article proposes a new approach based on an improved ID3 algorithm.The new algorithm introduces the importance factor λ when calculating the information entropy.It can strengthen the label of important attributes of a tree and reduce the label of non-important attributes.The algorithm overcomes the flaw of the traditional ID3 algorithm which tends to choose the attributes with more values,and also improves the efficiency and flexibility in the process of generating decision trees.
文摘The trend toward designing an intelligent distribution system based on students’individual differences and individual needs has taken precedence in view of the traditional dormitory distribution system,which neglects the students’personality traits,causes dormitory disputes,and affects the students’quality of life and academic quality.This paper collects freshmen's data according to college students’personal preferences,conducts a classification comparison,uses the decision tree classification algorithm based on the information gain principle as the core algorithm of dormitory allocation,determines the description rules of students’personal preferences and decision tree classification preferences,completes the conceptual design of the database of entity relations and data dictionaries,meets students’personality classification requirements for the dormitory,and lays the foundation for the intelligent dormitory allocation system.
基金supported by National Natural Science Foundation of China under Grant 60703013 and 10978011Key Program of National Natural Science Foundation of China under Grant 60932008+1 种基金National Science Fund for Distinguished Young Scholars under Grant 50925625China Postdoctoral Science Foundation.
文摘In many decision making tasks,the features and decision are ordinal.Several ordinal classification learning algorithms have been developed in recent years,it is shown that these algorithms are sensitive to noisy samples and do not work in real-world applications.In this work,we propose a new measure of feature quality, called rank mutual information.Then,we design an ordinal decision tree(REOT) construction technique based on rank mutual information.The theoretic and experimental analysis shows that the proposed algorithm is effective.
文摘An improved decision tree method for web information retrieval with self-mapping attributes is proposed.The self-mapping tree has a value of self-mapping attribute in its internal node,and information based on dissimilarity between a pair of mapping sequences.This method selects self-mapping which exists between data by exhaustive search based on relation and attribute information.Experimental results confirm that the improved method constructs comprehensive and accurate decision tree.Moreover,an example shows that the self-mapping decision tree is promising for data mining and knowledge discovery.
文摘European Community policy concerning water is placing increasing demands on the acquisition of information about the quality of aquatic environments. The cost of this information has led to a reflection on the rationalization of monitoring networks and, therefore, on the economic value of information produced by these networks. The aim of this article is to contribute to this reflection. To do so, we used the Bayesian framework to define the value of additional information in relation to the following three parameters: initial assumptions (prior probabilities) on the states of nature, costs linked to a poor decision (error costs) and accuracy of additional information. We then analyzed the impact of these parameters on this value, particularly the combined role of prior probabilities and error costs that increased or decreased the value of information depending on the initial uncertainty level. We then illustrated the results using a case study of a stream in the Bas-Rhin department in France.
文摘[Objective] This study aimed to improve the accuracy of remote sensing classification for Dongting Lake Wetland.[Method] Based on the TM data and ground GIS information of Donting Lake,the decision tree classification method was established through the expert classification knowledge base.The images of Dongting Lake wetland were classified into water area,mudflat,protection forest beach,Carem spp beach,Phragmites beach,Carex beach and other water body according to decision tree layers.[Result] The accuracy of decision tree classification reached 80.29%,which was much higher than the traditional method,and the total Kappa coefficient was 0.883 9,indicating that the data accuracy of this method could fulfill the requirements of actual practice.In addition,the image classification results based on knowledge could solve some classification mistakes.[Conclusion] Compared with the traditional method,the decision tree classification based on rules could classify the images by using various conditions,which reduced the data processing time and improved the classification accuracy.
文摘There are many kinds of real options,which are valuable,in each phase of the lifetime of an information technology(IT)project.However,in the current IT investment decision theory,real options that embedded in IT projects are not considered. In this paper, the process of IT project decision and implementation is fully analyzed, the real options that may be embedded in an IT project are identified, and a real option analysis (ROA) method is proposed for evaluation of an IT project under uncertain business environment. ROA employs Black-Scholes expansion model and cancels the assumption that the cost of project is certain. The numerical example manifests that the ROA can better evaluate IT project and select the IT investment alternative. Finally, a road map is provided to help selecting the suitable evaluation method to make IT investment decision.
基金Taif University Researchers Supporting Project number(TURSP-2020/10),Taif University,Taif,Saudi Arabia.
文摘Business Analytics is one of the vital processes that must be incorpo-rated into any business.It supports decision-makers in analyzing and predicting future trends based on facts(Data-driven decisions),especially when dealing with a massive amount of business data.Decision Trees are essential for business ana-lytics to predict business opportunities and future trends that can retain corpora-tions’competitive advantage and survival and improve their business value.This research proposes a tree-based predictive model for business analytics.The model is developed based on ranking business features and gradient-boosted trees.For validation purposes,the model is tested on a real-world dataset of Universal Bank to predict personal loan acceptance.It is validated based on Accuracy,Precision,Recall,and F-score.The experimentfindings show that the proposed model can predict personal loan acceptance efficiently and effectively with better accuracy than the traditional tree-based models.The model can also deal with a massive amount of business data and support corporations’decision-making process.
文摘This paper describes an approach to ethical decision design. It discusses the use of several mathematical tools applicable to the world of economics and business. It then demonstrates how the tools analogously extend themselves to making ethical decisions as supported by personal values. A review of the mathematics, including the design of a decision tree, the concept of mathematical expectation, and mathematical modeling are included.