Blasting is a common method of breaking rock in surface mines.Although the fragmentation with proper size is the main purpose,other undesirable effects such as flyrock are inevitable.This study is carried out to evalu...Blasting is a common method of breaking rock in surface mines.Although the fragmentation with proper size is the main purpose,other undesirable effects such as flyrock are inevitable.This study is carried out to evaluate the capability of a novel kernel-based extreme learning machine algorithm,called kernel extreme learning machine(KELM),by which the flyrock distance(FRD) is predicted.Furthermore,the other three data-driven models including local weighted linear regression(LWLR),response surface methodology(RSM) and boosted regression tree(BRT) are also developed to validate the main model.A database gathered from three quarry sites in Malaysia is employed to construct the proposed models using 73 sets of spacing,burden,stemming length and powder factor data as inputs and FRD as target.Afterwards,the validity of the models is evaluated by comparing the corresponding values of some statistical metrics and validation tools.Finally,the results verify that the proposed KELM model on account of highest correlation coefficient(R) and lowest root mean square error(RMSE) is more computationally efficient,leading to better predictive capability compared to LWLR,RSM and BRT models for all data sets.展开更多
In this paper, we propose two weighted learning methods for the construction of single hidden layer feedforward neural networks. Both methods incorporate weighted least squares. Our idea is to allow the training insta...In this paper, we propose two weighted learning methods for the construction of single hidden layer feedforward neural networks. Both methods incorporate weighted least squares. Our idea is to allow the training instances nearer to the query to offer bigger contributions to the estimated output. By minimizing the weighted mean square error function, optimal networks can be obtained. The results of a number of experiments demonstrate the effectiveness of our proposed methods.展开更多
The Extreme Learning Machine(ELM) and its variants are effective in many machine learning applications such as Imbalanced Learning(IL) or Big Data(BD) learning. However, they are unable to solve both imbalanced ...The Extreme Learning Machine(ELM) and its variants are effective in many machine learning applications such as Imbalanced Learning(IL) or Big Data(BD) learning. However, they are unable to solve both imbalanced and large-volume data learning problems. This study addresses the IL problem in BD applications. The Distributed and Weighted ELM(DW-ELM) algorithm is proposed, which is based on the Map Reduce framework. To confirm the feasibility of parallel computation, first, the fact that matrix multiplication operators are decomposable is illustrated.Then, to further improve the computational efficiency, an Improved DW-ELM algorithm(IDW-ELM) is developed using only one Map Reduce job. The successful operations of the proposed DW-ELM and IDW-ELM algorithms are finally validated through experiments.展开更多
Objective: By optimizing the extreme learning machine network with particle swarm optimization, we established a syndrome classification and prediction model for primary liver cancer(PLC), classified and predicted the...Objective: By optimizing the extreme learning machine network with particle swarm optimization, we established a syndrome classification and prediction model for primary liver cancer(PLC), classified and predicted the syndrome diagnosis of medical record data for PLC and compared and analyzed the prediction results with different algorithms and the clinical diagnosis results. This paper provides modern technical support for clinical diagnosis and treatment, and improves the objectivity, accuracy and rigor of the classification of traditional Chinese medicine(TCM) syndromes.Methods: From three top-level TCM hospitals in Nanchang, 10,602 electronic medical records from patients with PLC were collected, dating from January 2009 to May 2020. We removed the electronic medical records of 542 cases of syndromes and adopted the cross-validation method in the remaining10,060 electronic medical records, which were randomly divided into a training set and a test set.Based on fuzzy mathematics theory, we quantified the syndrome-related factors of TCM symptoms and signs, and information from the TCM four diagnostic methods. Next, using an extreme learning machine network with particle swarm optimization, we constructed a neural network syndrome classification and prediction model that used "TCM symptoms + signs + tongue diagnosis information + pulse diagnosis information" as input, and PLC syndrome as output. This approach was used to mine the nonlinear relationship between clinical data in electronic medical records and different syndrome types. The accuracy rate of classification was used to compare this model to other machine learning classification models.Results: The classification accuracy rate of the model developed here was 86.26%. The classification accuracy rates of models using support vector machine and Bayesian networks were 82.79% and 85.84%,respectively. The classification accuracy rates of the models for all syndromes in this paper were between82.15% and 93.82%.Conclusion: Compared with the case of data processed using traditional binary inputs, the experiment shows that the medical record data processed by fuzzy mathematics was more accurate, and closer to clinical findings. In addition, the model developed here was more refined, more accurate, and quicker than other classification models. This model provides reliable diagnosis for clinical treatment of PLC and a method to study of the rules of syndrome differentiation and treatment in TCM.展开更多
Classification of imbalanced data is a well explored issue in the data mining and machine learning community where one class representation is overwhelmed by other classes.The Imbalanced distribution of data is a natu...Classification of imbalanced data is a well explored issue in the data mining and machine learning community where one class representation is overwhelmed by other classes.The Imbalanced distribution of data is a natural occurrence in real world datasets,so needed to be dealt with carefully to get important insights.In case of imbalance in data sets,traditional classifiers have to sacrifice their performances,therefore lead to misclassifications.This paper suggests a weighted nearest neighbor approach in a fuzzy manner to deal with this issue.We have adapted the‘existing algorithm modification solution’to learn from imbalanced datasets that classify data without manipulating the natural distribution of data unlike the other popular data balancing methods.The K nearest neighbor is a non-parametric classification method that is mostly used in machine learning problems.Fuzzy classification with the nearest neighbor clears the belonging of an instance to classes and optimal weights with improved nearest neighbor concept helping to correctly classify imbalanced data.The proposed hybrid approach takes care of imbalance nature of data and reduces the inaccuracies appear in applications of original and traditional classifiers.Results show that it performs well over the existing fuzzy nearest neighbor and weighted neighbor strategies for imbalanced learning.展开更多
The estimation of the fuzzy membership function parameters for interval type 2 fuzzy logic system(IT2-FLS)is a challenging task in the presence of uncertainty and imprecision.Grasshopper optimization algorithm(GOA)is ...The estimation of the fuzzy membership function parameters for interval type 2 fuzzy logic system(IT2-FLS)is a challenging task in the presence of uncertainty and imprecision.Grasshopper optimization algorithm(GOA)is a fresh population based meta-heuristic algorithm that mimics the swarming behavior of grasshoppers in nature,which has good convergence ability towards optima.The main objective of this paper is to apply GOA to estimate the optimal parameters of the Gaussian membership function in an IT2-FLS.The antecedent part parameters(Gaussian membership function parameters)are encoded as a population of artificial swarm of grasshoppers and optimized using its algorithm.Tuning of the consequent part parameters are accomplished using extreme learning machine.The optimized IT2-FLS(GOAIT2FELM)obtained the optimal premise parameters based on tuned consequent part parameters and is then applied on the Australian national electricity market data for the forecasting of electricity loads and prices.The forecasting performance of the proposed model is compared with other population-based optimized IT2-FLS including genetic algorithm and artificial bee colony optimization algorithm.Analysis of the performance,on the same data-sets,reveals that the proposed GOAIT2FELM could be a better approach for improving the accuracy of the IT2-FLS as compared to other variants of the optimized IT2-FLS.展开更多
文摘Blasting is a common method of breaking rock in surface mines.Although the fragmentation with proper size is the main purpose,other undesirable effects such as flyrock are inevitable.This study is carried out to evaluate the capability of a novel kernel-based extreme learning machine algorithm,called kernel extreme learning machine(KELM),by which the flyrock distance(FRD) is predicted.Furthermore,the other three data-driven models including local weighted linear regression(LWLR),response surface methodology(RSM) and boosted regression tree(BRT) are also developed to validate the main model.A database gathered from three quarry sites in Malaysia is employed to construct the proposed models using 73 sets of spacing,burden,stemming length and powder factor data as inputs and FRD as target.Afterwards,the validity of the models is evaluated by comparing the corresponding values of some statistical metrics and validation tools.Finally,the results verify that the proposed KELM model on account of highest correlation coefficient(R) and lowest root mean square error(RMSE) is more computationally efficient,leading to better predictive capability compared to LWLR,RSM and BRT models for all data sets.
基金supported by the NSC under Grant No.NSC-100-2221-E-110-083-MY3 and NSC-101-2622-E-110-011-CC3"Aim for the Top University Plan"of the National Sun-Yat-Sen University and Ministry of Education
文摘In this paper, we propose two weighted learning methods for the construction of single hidden layer feedforward neural networks. Both methods incorporate weighted least squares. Our idea is to allow the training instances nearer to the query to offer bigger contributions to the estimated output. By minimizing the weighted mean square error function, optimal networks can be obtained. The results of a number of experiments demonstrate the effectiveness of our proposed methods.
基金partially supported by the National Natural Science Foundation of China(Nos.61402089,61472069,and 61501101)the Fundamental Research Funds for the Central Universities(Nos.N161904001,N161602003,and N150408001)+2 种基金the Natural Science Foundation of Liaoning Province(No.2015020553)the China Postdoctoral Science Foundation(No.2016M591447)the Postdoctoral Science Foundation of Northeastern University(No.20160203)
文摘The Extreme Learning Machine(ELM) and its variants are effective in many machine learning applications such as Imbalanced Learning(IL) or Big Data(BD) learning. However, they are unable to solve both imbalanced and large-volume data learning problems. This study addresses the IL problem in BD applications. The Distributed and Weighted ELM(DW-ELM) algorithm is proposed, which is based on the Map Reduce framework. To confirm the feasibility of parallel computation, first, the fact that matrix multiplication operators are decomposable is illustrated.Then, to further improve the computational efficiency, an Improved DW-ELM algorithm(IDW-ELM) is developed using only one Map Reduce job. The successful operations of the proposed DW-ELM and IDW-ELM algorithms are finally validated through experiments.
基金financially supported by the National Natural Science Foundation (No. 81660727)。
文摘Objective: By optimizing the extreme learning machine network with particle swarm optimization, we established a syndrome classification and prediction model for primary liver cancer(PLC), classified and predicted the syndrome diagnosis of medical record data for PLC and compared and analyzed the prediction results with different algorithms and the clinical diagnosis results. This paper provides modern technical support for clinical diagnosis and treatment, and improves the objectivity, accuracy and rigor of the classification of traditional Chinese medicine(TCM) syndromes.Methods: From three top-level TCM hospitals in Nanchang, 10,602 electronic medical records from patients with PLC were collected, dating from January 2009 to May 2020. We removed the electronic medical records of 542 cases of syndromes and adopted the cross-validation method in the remaining10,060 electronic medical records, which were randomly divided into a training set and a test set.Based on fuzzy mathematics theory, we quantified the syndrome-related factors of TCM symptoms and signs, and information from the TCM four diagnostic methods. Next, using an extreme learning machine network with particle swarm optimization, we constructed a neural network syndrome classification and prediction model that used "TCM symptoms + signs + tongue diagnosis information + pulse diagnosis information" as input, and PLC syndrome as output. This approach was used to mine the nonlinear relationship between clinical data in electronic medical records and different syndrome types. The accuracy rate of classification was used to compare this model to other machine learning classification models.Results: The classification accuracy rate of the model developed here was 86.26%. The classification accuracy rates of models using support vector machine and Bayesian networks were 82.79% and 85.84%,respectively. The classification accuracy rates of the models for all syndromes in this paper were between82.15% and 93.82%.Conclusion: Compared with the case of data processed using traditional binary inputs, the experiment shows that the medical record data processed by fuzzy mathematics was more accurate, and closer to clinical findings. In addition, the model developed here was more refined, more accurate, and quicker than other classification models. This model provides reliable diagnosis for clinical treatment of PLC and a method to study of the rules of syndrome differentiation and treatment in TCM.
文摘Classification of imbalanced data is a well explored issue in the data mining and machine learning community where one class representation is overwhelmed by other classes.The Imbalanced distribution of data is a natural occurrence in real world datasets,so needed to be dealt with carefully to get important insights.In case of imbalance in data sets,traditional classifiers have to sacrifice their performances,therefore lead to misclassifications.This paper suggests a weighted nearest neighbor approach in a fuzzy manner to deal with this issue.We have adapted the‘existing algorithm modification solution’to learn from imbalanced datasets that classify data without manipulating the natural distribution of data unlike the other popular data balancing methods.The K nearest neighbor is a non-parametric classification method that is mostly used in machine learning problems.Fuzzy classification with the nearest neighbor clears the belonging of an instance to classes and optimal weights with improved nearest neighbor concept helping to correctly classify imbalanced data.The proposed hybrid approach takes care of imbalance nature of data and reduces the inaccuracies appear in applications of original and traditional classifiers.Results show that it performs well over the existing fuzzy nearest neighbor and weighted neighbor strategies for imbalanced learning.
文摘The estimation of the fuzzy membership function parameters for interval type 2 fuzzy logic system(IT2-FLS)is a challenging task in the presence of uncertainty and imprecision.Grasshopper optimization algorithm(GOA)is a fresh population based meta-heuristic algorithm that mimics the swarming behavior of grasshoppers in nature,which has good convergence ability towards optima.The main objective of this paper is to apply GOA to estimate the optimal parameters of the Gaussian membership function in an IT2-FLS.The antecedent part parameters(Gaussian membership function parameters)are encoded as a population of artificial swarm of grasshoppers and optimized using its algorithm.Tuning of the consequent part parameters are accomplished using extreme learning machine.The optimized IT2-FLS(GOAIT2FELM)obtained the optimal premise parameters based on tuned consequent part parameters and is then applied on the Australian national electricity market data for the forecasting of electricity loads and prices.The forecasting performance of the proposed model is compared with other population-based optimized IT2-FLS including genetic algorithm and artificial bee colony optimization algorithm.Analysis of the performance,on the same data-sets,reveals that the proposed GOAIT2FELM could be a better approach for improving the accuracy of the IT2-FLS as compared to other variants of the optimized IT2-FLS.