Association rules and C4.5 rules can overcome the shortage of the traditional land evaluation methods and improve the intelligibility and efficiency of the land evaluation knowledge.In order to compare these two kinds...Association rules and C4.5 rules can overcome the shortage of the traditional land evaluation methods and improve the intelligibility and efficiency of the land evaluation knowledge.In order to compare these two kinds of classification rules in the application,two fuzzy classifiers were established by combining with fuzzy decision algorithm especially based on Second General Soil Survey of Guangdong Province.The results of experiments demonstrated that the fuzzy classifier based on association rules obtain a higher accuracy rate,but with more complex calculation process and more computational overhead;the fuzzy classifier based on C4.5 rules obtain a slightly lower accuracy,but with fast computation and simpler calculation.展开更多
Some properties of a class of quasi-differentiable functions(the difference of two finite convex functions) are considered in this paper. And the convergence of the steepest descent algorithm for unconstrained and c...Some properties of a class of quasi-differentiable functions(the difference of two finite convex functions) are considered in this paper. And the convergence of the steepest descent algorithm for unconstrained and constrained quasi-differentiable programming is proved.展开更多
A number of clustering algorithms were used to analyze many databases in the field of image clustering. The main objective of this research work was to perform a comparative analysis of the two of the existing partiti...A number of clustering algorithms were used to analyze many databases in the field of image clustering. The main objective of this research work was to perform a comparative analysis of the two of the existing partitions based clustering algorithms and a hybrid clustering algorithm. The results verification done by using classification algorithms via its accuracy. The perfor-mance of clustering and classification algorithms were carried out in this work based on the tumor identification, cluster quality and other parameters like run time and volume complexity. Some of the well known classification algorithms were used to find the accuracy of produced results of the clustering algorithms. The performance of the clustering algorithms proved mean-ingful in many domains, particularly k-Means, FCM. In addition, the proposed multifarious clustering technique has revealed their efficiency in terms of performance in predicting tumor affected regions in mammogram images. The color images are converted in to gray scale images and then it is processed. Finally, it is identified the best method for the analysis of finding tumor in breast images. This research would be immensely useful to physicians and radiologist to identify cancer affected area in the breast.展开更多
Based on the data of daily precipitation in Lianyungang area from 1951 to 2012 and various climate signal data from the National Climate Center website and the NOAA website,a model for predicting whether the number of...Based on the data of daily precipitation in Lianyungang area from 1951 to 2012 and various climate signal data from the National Climate Center website and the NOAA website,a model for predicting whether the number of rainstorm days in summer in Lianyungang area is large was established by the classical C5. 0 decision tree algorithm. The data samples in 48 years( accounting for about 80% of total number of samples)was as the training set of a model,and the training accuracy rate of the model was 95. 83%. The data samples in the remaining 14 years( accounting for about 20% of total number of samples) were used as the test set of the model to test the model,and the test accuracy of the model was 85. 71%. The results showed that the prediction model of number of rainstorm days in summer constructed by C5. 0 algorithm had high accuracy and was easy to explain. Moreover,it is convenient for meteorological staff to use directly. At the same time,this study provides a new idea for short-term climate prediction of number of rainstorm days in summer.展开更多
Intrusion detection systems provide additional defense capacity to a networked information system in addition to the security measures provided by the firewalls. This paper proposes an active rule based enhancement to...Intrusion detection systems provide additional defense capacity to a networked information system in addition to the security measures provided by the firewalls. This paper proposes an active rule based enhancement to the C4.5 algorithm for network intrusion detection in order to detect misuse behaviors of internal attackers through effective classification and decision making in computer networks. This enhanced C4.5 algorithm derives a set of classification rules from network audit data and then the generated rules are used to detect network intrusions in a real-time environment. Unlike most existing decision tree based approaches, the spawned rules generated and fired in this work are more effective because the information-theoretic approach minimizes the expected number of tests needed to classify an object and guarantees that a simple (but not necessarily the simplest) tree is found. The main advantage of this proposed algorithm is that the generalization ability of enhanced C4.5 decision trees is better than that of C4.5 decision trees. We have employed data from the third international knowledge discovery and data mining tools competition (KDDcup’99) to train and test the feasibility of this proposed model. By applying the enhanced C4.5 algorithm an average detection rate of 93.28 percent and a false positive rate of 0.7 percent have respectively been obtained in this work.展开更多
As a distributed computing platform, Hadoop provides an effective way to handle big data. In Hadoop, the completion time of job will be delayed by a straggler. Although the definitive cause of the straggler is hard to...As a distributed computing platform, Hadoop provides an effective way to handle big data. In Hadoop, the completion time of job will be delayed by a straggler. Although the definitive cause of the straggler is hard to detect, speculative execution is usually used for dealing with this problem, by simply backing up those stragglers on alternative nodes. In this paper, we design a new Speculative Execution algorithm based on C4.5 Decision Tree, SECDT, for Hadoop. In SECDT, we speculate completion time of stragglers and also of backup tasks, based on a kind of decision tree method: C4.5 decision tree. After we speculate the completion time, we compare the completion time of stragglers and of the backup tasks, calculating their differential value, and selecting the straggler with the maximum differential value to start the backup task.Experiment result shows that the SECDT can predict execution time more accurately than other speculative execution methods, hence reduce the job completion time.展开更多
Many companies like credit card, insurance, bank, retail industry require direct marketing. Data mining can help those institutes to set marketing goal. Data mining techniques have good prospects in their target audie...Many companies like credit card, insurance, bank, retail industry require direct marketing. Data mining can help those institutes to set marketing goal. Data mining techniques have good prospects in their target audiences and improve the likelihood of response. In this work we have investigated two data mining techniques: the Naive Bayes and the C4.5 decision tree algorithms. The goal of this work is to predict whether a client will subscribe a term deposit. We also made comparative study of performance of those two algorithms. Publicly available UCI data is used to train and test the performance of the algorithms. Besides, we extract actionable knowledge from decision tree that focuses to take interesting and important decision in business area.展开更多
In this paper, relaxed iterative algorithms of Krasnoselskii-type and Halpern-type that approximate a solution of a system of a generalized mixed equilibrium problem anda common fixed point of a countable family of to...In this paper, relaxed iterative algorithms of Krasnoselskii-type and Halpern-type that approximate a solution of a system of a generalized mixed equilibrium problem anda common fixed point of a countable family of totally quasi-C-asymptotically nonexpansivemulti-valued maps are constructed. Strong convergence of the sequence generated by thesealgorithms is proved in uniformly smooth and strictly convex real Banach spaces with Kadec-Klee property. Furthermore, several applications of our theorems are also presented. Finally,our theorems are significant improvements on several important recent results for this classof nonlinear problems.展开更多
This study assessed the contribution of climate projections to improving rainfall information for cocoa crops in the central and southern regions of Côte d’Ivoire. Particular attention was paid to fourteen local...This study assessed the contribution of climate projections to improving rainfall information for cocoa crops in the central and southern regions of Côte d’Ivoire. Particular attention was paid to fourteen localities in these two climatic zones. Simulation data were obtained from the CORDEX ensemble and observation data from CHIRPS. They cover the period 1991-2005 for the reference period and the future period from 2021 to 2050 for the RCP4.5 and RCP8.5 scenarios. In addition, the study was based on the water requirements necessary during the critical phase of the cocoa tree (the flowering phase) for a good yield from the cocoa production chain on the one hand, and on a selection of three climate indices CDD, CWD and r95PTOT to study their spatio-temporal changes over two future periods 2021-2035 (near future) and 2036-2050 (medium-term) on the other. These climatic indices influence cocoa cultivation and their use in studies of climatic impacts on agriculture is of prime importance. The analysis of their spatio-temporal changes in this work also contributes to providing climate services based on rainfall, to which cocoa crops are highly sensitive. Our results show that the CDD and CWD indices vary from one region to another depending on latitude. For the fourteen localities studied, the number of consecutive dry days (CDD) could increase between now and 2050, while the number of consecutive wet days (CWD) could decrease over the period 2021-2035 and then increase over the period 2036-2050. The localities of Tabou, Aboisso and San-Pedro record high numbers of CDD index and CWD index for both projection scenarios. In comparison with the RCP4.5 and RCP8.5 scenarios, these results show that the RCP8.5 scenarios are having an impact on cocoa growing in Côte d’Ivoire.展开更多
In the last decade, the MRI (Magnetic Resonance Imaging) image segmentation has become one of the most active research fields in the medical imaging domain. Because of the fuzzy nature of the MRI images, many research...In the last decade, the MRI (Magnetic Resonance Imaging) image segmentation has become one of the most active research fields in the medical imaging domain. Because of the fuzzy nature of the MRI images, many researchers have adopted the fuzzy clustering approach to segment them. In this work, a fast and robust multi-agent system (MAS) for MRI segmentation of the brain is proposed. This system gets its robustness from a robust c-means algorithm (RFCM) and obtains its fastness from the beneficial properties of agents, such as autonomy, social ability and reactivity. To show the efficiency of the proposed method, we test it on a normal brain brought from the BrainWeb Simulated Brain Database. The experimental results are valuable in both robustness to noise and running times standpoints.展开更多
The purpose of reoptimization using approximation methods—application of knowledge about the solution of the initial instance I, provided to achieve a better quality of approximation (approximation ratio) of an algor...The purpose of reoptimization using approximation methods—application of knowledge about the solution of the initial instance I, provided to achieve a better quality of approximation (approximation ratio) of an algorithm for determining optimal or close to it solutions of some “minor” changes of instance I. To solve the problem Ins-Max-EkCSP-P (reoptimization of Max-EkCSP-P with the addition of one constraint) with approximation resistant predicate P exists a polynomial threshold (optimal) -approximation algorithm, where the threshold “random” approximation ratio of P). When the unique games conjecture (UGC) is hold there exists a polynomial threshold (optimal) -approximation algorithm (where and the integrality gap of semidefinite relaxation of Max-EkCSP-P problem Z) to solve the problem Ins-Max-EkCSP-P.展开更多
基金Supported by Science and Technology Plan Project of Guangdong Province (2009B010900026,2009CD058,2009CD078,2009CD079,2009CD080)Special Funds for Support Program of Development of Modern Information Service Industry of Guangdong Province(06120840B0370124)Funded Fund Project of South China Agricultural University (2007K017)~~
文摘Association rules and C4.5 rules can overcome the shortage of the traditional land evaluation methods and improve the intelligibility and efficiency of the land evaluation knowledge.In order to compare these two kinds of classification rules in the application,two fuzzy classifiers were established by combining with fuzzy decision algorithm especially based on Second General Soil Survey of Guangdong Province.The results of experiments demonstrated that the fuzzy classifier based on association rules obtain a higher accuracy rate,but with more complex calculation process and more computational overhead;the fuzzy classifier based on C4.5 rules obtain a slightly lower accuracy,but with fast computation and simpler calculation.
基金Supported by the State Foundations of Ph.D.Units(20020141013)Supported by the NSF of China(10001007)
文摘Some properties of a class of quasi-differentiable functions(the difference of two finite convex functions) are considered in this paper. And the convergence of the steepest descent algorithm for unconstrained and constrained quasi-differentiable programming is proved.
文摘A number of clustering algorithms were used to analyze many databases in the field of image clustering. The main objective of this research work was to perform a comparative analysis of the two of the existing partitions based clustering algorithms and a hybrid clustering algorithm. The results verification done by using classification algorithms via its accuracy. The perfor-mance of clustering and classification algorithms were carried out in this work based on the tumor identification, cluster quality and other parameters like run time and volume complexity. Some of the well known classification algorithms were used to find the accuracy of produced results of the clustering algorithms. The performance of the clustering algorithms proved mean-ingful in many domains, particularly k-Means, FCM. In addition, the proposed multifarious clustering technique has revealed their efficiency in terms of performance in predicting tumor affected regions in mammogram images. The color images are converted in to gray scale images and then it is processed. Finally, it is identified the best method for the analysis of finding tumor in breast images. This research would be immensely useful to physicians and radiologist to identify cancer affected area in the breast.
基金Support by Meteorological Open Research Foundation for the Huaihe River Basin(HRM201602)Foundation for Young Scholars of Jiangsu Meteorological Bureau(Q201708,KQ201802)+2 种基金Science and Technology Innovation Team Foundation for Marine Meteorological Forecast Technology of Lianyungang Meteorological BureauKey Technology R&D Program Project of Lianyungang City(SH1634)Special Project for Forecasters of Jiangsu Meteorological Bureau(JSYBY201811,JSYBY201812,JSYBY201810)
文摘Based on the data of daily precipitation in Lianyungang area from 1951 to 2012 and various climate signal data from the National Climate Center website and the NOAA website,a model for predicting whether the number of rainstorm days in summer in Lianyungang area is large was established by the classical C5. 0 decision tree algorithm. The data samples in 48 years( accounting for about 80% of total number of samples)was as the training set of a model,and the training accuracy rate of the model was 95. 83%. The data samples in the remaining 14 years( accounting for about 20% of total number of samples) were used as the test set of the model to test the model,and the test accuracy of the model was 85. 71%. The results showed that the prediction model of number of rainstorm days in summer constructed by C5. 0 algorithm had high accuracy and was easy to explain. Moreover,it is convenient for meteorological staff to use directly. At the same time,this study provides a new idea for short-term climate prediction of number of rainstorm days in summer.
文摘Intrusion detection systems provide additional defense capacity to a networked information system in addition to the security measures provided by the firewalls. This paper proposes an active rule based enhancement to the C4.5 algorithm for network intrusion detection in order to detect misuse behaviors of internal attackers through effective classification and decision making in computer networks. This enhanced C4.5 algorithm derives a set of classification rules from network audit data and then the generated rules are used to detect network intrusions in a real-time environment. Unlike most existing decision tree based approaches, the spawned rules generated and fired in this work are more effective because the information-theoretic approach minimizes the expected number of tests needed to classify an object and guarantees that a simple (but not necessarily the simplest) tree is found. The main advantage of this proposed algorithm is that the generalization ability of enhanced C4.5 decision trees is better than that of C4.5 decision trees. We have employed data from the third international knowledge discovery and data mining tools competition (KDDcup’99) to train and test the feasibility of this proposed model. By applying the enhanced C4.5 algorithm an average detection rate of 93.28 percent and a false positive rate of 0.7 percent have respectively been obtained in this work.
文摘As a distributed computing platform, Hadoop provides an effective way to handle big data. In Hadoop, the completion time of job will be delayed by a straggler. Although the definitive cause of the straggler is hard to detect, speculative execution is usually used for dealing with this problem, by simply backing up those stragglers on alternative nodes. In this paper, we design a new Speculative Execution algorithm based on C4.5 Decision Tree, SECDT, for Hadoop. In SECDT, we speculate completion time of stragglers and also of backup tasks, based on a kind of decision tree method: C4.5 decision tree. After we speculate the completion time, we compare the completion time of stragglers and of the backup tasks, calculating their differential value, and selecting the straggler with the maximum differential value to start the backup task.Experiment result shows that the SECDT can predict execution time more accurately than other speculative execution methods, hence reduce the job completion time.
文摘Many companies like credit card, insurance, bank, retail industry require direct marketing. Data mining can help those institutes to set marketing goal. Data mining techniques have good prospects in their target audiences and improve the likelihood of response. In this work we have investigated two data mining techniques: the Naive Bayes and the C4.5 decision tree algorithms. The goal of this work is to predict whether a client will subscribe a term deposit. We also made comparative study of performance of those two algorithms. Publicly available UCI data is used to train and test the performance of the algorithms. Besides, we extract actionable knowledge from decision tree that focuses to take interesting and important decision in business area.
文摘In this paper, relaxed iterative algorithms of Krasnoselskii-type and Halpern-type that approximate a solution of a system of a generalized mixed equilibrium problem anda common fixed point of a countable family of totally quasi-C-asymptotically nonexpansivemulti-valued maps are constructed. Strong convergence of the sequence generated by thesealgorithms is proved in uniformly smooth and strictly convex real Banach spaces with Kadec-Klee property. Furthermore, several applications of our theorems are also presented. Finally,our theorems are significant improvements on several important recent results for this classof nonlinear problems.
文摘This study assessed the contribution of climate projections to improving rainfall information for cocoa crops in the central and southern regions of Côte d’Ivoire. Particular attention was paid to fourteen localities in these two climatic zones. Simulation data were obtained from the CORDEX ensemble and observation data from CHIRPS. They cover the period 1991-2005 for the reference period and the future period from 2021 to 2050 for the RCP4.5 and RCP8.5 scenarios. In addition, the study was based on the water requirements necessary during the critical phase of the cocoa tree (the flowering phase) for a good yield from the cocoa production chain on the one hand, and on a selection of three climate indices CDD, CWD and r95PTOT to study their spatio-temporal changes over two future periods 2021-2035 (near future) and 2036-2050 (medium-term) on the other. These climatic indices influence cocoa cultivation and their use in studies of climatic impacts on agriculture is of prime importance. The analysis of their spatio-temporal changes in this work also contributes to providing climate services based on rainfall, to which cocoa crops are highly sensitive. Our results show that the CDD and CWD indices vary from one region to another depending on latitude. For the fourteen localities studied, the number of consecutive dry days (CDD) could increase between now and 2050, while the number of consecutive wet days (CWD) could decrease over the period 2021-2035 and then increase over the period 2036-2050. The localities of Tabou, Aboisso and San-Pedro record high numbers of CDD index and CWD index for both projection scenarios. In comparison with the RCP4.5 and RCP8.5 scenarios, these results show that the RCP8.5 scenarios are having an impact on cocoa growing in Côte d’Ivoire.
文摘In the last decade, the MRI (Magnetic Resonance Imaging) image segmentation has become one of the most active research fields in the medical imaging domain. Because of the fuzzy nature of the MRI images, many researchers have adopted the fuzzy clustering approach to segment them. In this work, a fast and robust multi-agent system (MAS) for MRI segmentation of the brain is proposed. This system gets its robustness from a robust c-means algorithm (RFCM) and obtains its fastness from the beneficial properties of agents, such as autonomy, social ability and reactivity. To show the efficiency of the proposed method, we test it on a normal brain brought from the BrainWeb Simulated Brain Database. The experimental results are valuable in both robustness to noise and running times standpoints.
文摘The purpose of reoptimization using approximation methods—application of knowledge about the solution of the initial instance I, provided to achieve a better quality of approximation (approximation ratio) of an algorithm for determining optimal or close to it solutions of some “minor” changes of instance I. To solve the problem Ins-Max-EkCSP-P (reoptimization of Max-EkCSP-P with the addition of one constraint) with approximation resistant predicate P exists a polynomial threshold (optimal) -approximation algorithm, where the threshold “random” approximation ratio of P). When the unique games conjecture (UGC) is hold there exists a polynomial threshold (optimal) -approximation algorithm (where and the integrality gap of semidefinite relaxation of Max-EkCSP-P problem Z) to solve the problem Ins-Max-EkCSP-P.