To solve the multi-class fault diagnosis tasks, decision tree support vector machine (DTSVM), which combines SVM and decision tree using the concept of dichotomy, is proposed. Since the classification performance of...To solve the multi-class fault diagnosis tasks, decision tree support vector machine (DTSVM), which combines SVM and decision tree using the concept of dichotomy, is proposed. Since the classification performance of DTSVM highly depends on its structure, to cluster the multi-classes with maximum distance between the clustering centers of the two sub-classes, genetic algorithm is introduced into the formation of decision tree, so that the most separable classes would be separated at each node of decisions tree. Numerical simulations conducted on three datasets compared with "one-against-all" and "one-against-one" demonstrate the proposed method has better performance and higher generalization ability than the two conventional methods.展开更多
This paper presents the fault diagnosis of face milling tool based on machine learning approach.While machining,spindle vibration signals in feed direction under healthy and faulty conditions of the milling tool are a...This paper presents the fault diagnosis of face milling tool based on machine learning approach.While machining,spindle vibration signals in feed direction under healthy and faulty conditions of the milling tool are acquired.A set of discrete wavelet features is extracted from the vibration signals using discrete wavelet transform(DWT)technique.The decision tree technique is used to select significant features out of all extracted wavelet features.C-support vector classification(C-SVC)andν-support vector classification(ν-SVC)models with different kernel functions of support vector machine(SVM)are used to study and classify the tool condition based on selected features.From the results obtained,C-SVC is the best model thanν-SVC and it can be able to give 94.5%classification accuracy for face milling of special steel alloy 42CrMo4.展开更多
Credit card companies must be able to identify fraudulent credit card transactions so that clients are not charged for items they did not purchase. Previously, many machine learning approaches and classifiers were use...Credit card companies must be able to identify fraudulent credit card transactions so that clients are not charged for items they did not purchase. Previously, many machine learning approaches and classifiers were used to detect fraudulent transactions. However, because fraud patterns are always changing, it is becoming increasingly vital to investigate new frauds and develop the model based on the new patterns. The purpose of this research is to create a machine learning classifier that not only detects fraud but also detects legitimate transactions. As a result, the model should have excellent accuracy, precision, recall, and f1-score. As a result, we began with a large dataset in this study and used four machine learning classifiers: Support Vector Machine (SVM), Decision Tree, Naïve Bayes, and Random Forest. The random forest classifier scored 99.96% overall accuracy with the best precision, recall, f1-score, and Matthews correlation coefficient in the experiments.展开更多
Renewable energy has garnered attention due to the need for sustainable energy sources.Wind power has emerged as an alternative that has contributed to the transition towards cleaner energy.As the importance of wind e...Renewable energy has garnered attention due to the need for sustainable energy sources.Wind power has emerged as an alternative that has contributed to the transition towards cleaner energy.As the importance of wind energy grows,it can be crucial to provide forecasts that optimize its performance potential.Artificial intelligence(AI)methods have risen in prominence due to how well they can handle complicated systems while enhancing the accuracy of prediction.This study explored the area of AI to predict wind-energy production at a wind farm in Yalova,Turkey,using four different AI approaches:support vector machines(SVMs),decision trees,adaptive neuro-fuzzy inference systems(ANFIS)and artificial neural networks(ANNs).Wind speed and direction were considered as essential input parameters,with wind energy as the target parameter,and models are thoroughly evaluated using metrics such as the mean absolute percentage error(MAPE),coefficient of determination(R~2),and mean absolute error(MAE).The findings accentuate the superior performance of the SVM,which delivered the lowest MAPE(2.42%),the highest R~2(0.95),and the lowest MAE(71.21%)compared with actual values,while ANFIS was less effective in this context.The main aim of this comparative analysis was to rank the models to move to the next step in improving the least efficient methods by combining them with optimization algorithms,such as metaheuristic algorithms.展开更多
Heart failure is now widely spread throughout the world.Heart disease affects approximately 48%of the population.It is too expensive and also difficult to cure the disease.This research paper represents machine learni...Heart failure is now widely spread throughout the world.Heart disease affects approximately 48%of the population.It is too expensive and also difficult to cure the disease.This research paper represents machine learning models to predict heart failure.The fundamental concept is to compare the correctness of various Machine Learning(ML)algorithms and boost algorithms to improve models’accuracy for prediction.Some supervised algorithms like K-Nearest Neighbor(KNN),Support Vector Machine(SVM),Decision Trees(DT),Random Forest(RF),Logistic Regression(LR)are considered to achieve the best results.Some boosting algorithms like Extreme Gradient Boosting(XGBoost)and Cat-Boost are also used to improve the prediction using Artificial Neural Networks(ANN).This research also focuses on data visualization to identify patterns,trends,and outliers in a massive data set.Python and Scikit-learns are used for ML.Tensor Flow and Keras,along with Python,are used for ANN model train-ing.The DT and RF algorithms achieved the highest accuracy of 95%among the classifiers.Meanwhile,KNN obtained a second height accuracy of 93.33%.XGBoost had a gratified accuracy of 91.67%,SVM,CATBoost,and ANN had an accuracy of 90%,and LR had 88.33%accuracy.展开更多
Because of the increasing attention on environmental issues, especially air pollution, predicting whether a day is polluted or not is necessary to people’s health. In order to solve this problem, this research is cla...Because of the increasing attention on environmental issues, especially air pollution, predicting whether a day is polluted or not is necessary to people’s health. In order to solve this problem, this research is classifying ground ozone level based on big data and machine learning models, where polluted ozone day has class 1 and non-ozone day has class 0. The dataset used in this research was derived from the UCI Website, containing various environmental factors in Houston, Galveston and Brazoria area that could possibly affect the occurrence of ozone pollution [1]. This dataset is first filled up for further process, next standardized to ensure every feature has the same weight, and then split into training set and testing set. After this, five different machine learning models are used in the prediction of ground ozone level and their final accuracy scores are compared. In conclusion, among Logistic Regression, Decision Tree, Random Forest, AdaBoost, and Support Vector Machine (SVM), the last one has the highest test score of 0.949. This research utilizes relatively simple methods of forecasting and calculates the first accuracy scores in predicting ground ozone level;it can thus be a reference for environmentalists. Moreover, the direct comparison among five different models provides machine learning field an insight to determine the most accurate model. In the future, Neural Network can also be utilized to predict air pollution, and its test scores can be compared with the previous five methods to conclude the accuracy of Neuron Network.展开更多
In the last decade, a few valuable types of research have been conducted to discriminate fractured zones from non-fractured ones. In this paper, petrophysical and image logs of eight wells were utilized to detect frac...In the last decade, a few valuable types of research have been conducted to discriminate fractured zones from non-fractured ones. In this paper, petrophysical and image logs of eight wells were utilized to detect fractured zones. Decision tree, random forest, support vector machine, and deep learning were four classifiers applied over petrophysical logs and image logs for both training and testing. The output of classifiers was fused by ordered weighted averaging data fusion to achieve more reliable, accurate, and general results. Accuracy of close to 99% has been achieved. This study reports a significant improvement compared to the existing work that has an accuracy of close to 80%.展开更多
Students in South African Universities come from different socio-cultural backgrounds, countries and high schools. This suggests that these students have different experiences which impact on their levels of grasping ...Students in South African Universities come from different socio-cultural backgrounds, countries and high schools. This suggests that these students have different experiences which impact on their levels of grasping information in class as they potentially use different lenses on tuition. The current practice in Universities in contributing to the academic performance of students includes the use of tutors, the use of mobile devices for first year students, use of student assistants and the use of different feedback measures. What is problematic about the current practice is that students are quitting university in high numbers. In this study, knowledge has been drawn from data through the use of machine learning algorithms. Bayesian networks, support vector machines (SVMs) and decision trees algorithms were used individually in this work to construct predictive models for the academic performance of students. The best model was constructed using SVM and it gave a prediction of 72.87% and a prediction cost of 139. The model does predict the performance of students in advance of the year-end examinations outcome. The results suggest that South African Universities must recognize the diversity in student population and thus provide students with better support and equip them with the necessary knowledge that will enable them to tap into their full potential and thus enhance their skills.展开更多
基金supported by the National Natural Science Foundation of China (60604021 60874054)
文摘To solve the multi-class fault diagnosis tasks, decision tree support vector machine (DTSVM), which combines SVM and decision tree using the concept of dichotomy, is proposed. Since the classification performance of DTSVM highly depends on its structure, to cluster the multi-classes with maximum distance between the clustering centers of the two sub-classes, genetic algorithm is introduced into the formation of decision tree, so that the most separable classes would be separated at each node of decisions tree. Numerical simulations conducted on three datasets compared with "one-against-all" and "one-against-one" demonstrate the proposed method has better performance and higher generalization ability than the two conventional methods.
文摘This paper presents the fault diagnosis of face milling tool based on machine learning approach.While machining,spindle vibration signals in feed direction under healthy and faulty conditions of the milling tool are acquired.A set of discrete wavelet features is extracted from the vibration signals using discrete wavelet transform(DWT)technique.The decision tree technique is used to select significant features out of all extracted wavelet features.C-support vector classification(C-SVC)andν-support vector classification(ν-SVC)models with different kernel functions of support vector machine(SVM)are used to study and classify the tool condition based on selected features.From the results obtained,C-SVC is the best model thanν-SVC and it can be able to give 94.5%classification accuracy for face milling of special steel alloy 42CrMo4.
文摘Credit card companies must be able to identify fraudulent credit card transactions so that clients are not charged for items they did not purchase. Previously, many machine learning approaches and classifiers were used to detect fraudulent transactions. However, because fraud patterns are always changing, it is becoming increasingly vital to investigate new frauds and develop the model based on the new patterns. The purpose of this research is to create a machine learning classifier that not only detects fraud but also detects legitimate transactions. As a result, the model should have excellent accuracy, precision, recall, and f1-score. As a result, we began with a large dataset in this study and used four machine learning classifiers: Support Vector Machine (SVM), Decision Tree, Naïve Bayes, and Random Forest. The random forest classifier scored 99.96% overall accuracy with the best precision, recall, f1-score, and Matthews correlation coefficient in the experiments.
文摘Renewable energy has garnered attention due to the need for sustainable energy sources.Wind power has emerged as an alternative that has contributed to the transition towards cleaner energy.As the importance of wind energy grows,it can be crucial to provide forecasts that optimize its performance potential.Artificial intelligence(AI)methods have risen in prominence due to how well they can handle complicated systems while enhancing the accuracy of prediction.This study explored the area of AI to predict wind-energy production at a wind farm in Yalova,Turkey,using four different AI approaches:support vector machines(SVMs),decision trees,adaptive neuro-fuzzy inference systems(ANFIS)and artificial neural networks(ANNs).Wind speed and direction were considered as essential input parameters,with wind energy as the target parameter,and models are thoroughly evaluated using metrics such as the mean absolute percentage error(MAPE),coefficient of determination(R~2),and mean absolute error(MAE).The findings accentuate the superior performance of the SVM,which delivered the lowest MAPE(2.42%),the highest R~2(0.95),and the lowest MAE(71.21%)compared with actual values,while ANFIS was less effective in this context.The main aim of this comparative analysis was to rank the models to move to the next step in improving the least efficient methods by combining them with optimization algorithms,such as metaheuristic algorithms.
基金Taif University Researchers Supporting Project Number(TURSP-2020/73)Taif University,Taif,Saudi Arabia.
文摘Heart failure is now widely spread throughout the world.Heart disease affects approximately 48%of the population.It is too expensive and also difficult to cure the disease.This research paper represents machine learning models to predict heart failure.The fundamental concept is to compare the correctness of various Machine Learning(ML)algorithms and boost algorithms to improve models’accuracy for prediction.Some supervised algorithms like K-Nearest Neighbor(KNN),Support Vector Machine(SVM),Decision Trees(DT),Random Forest(RF),Logistic Regression(LR)are considered to achieve the best results.Some boosting algorithms like Extreme Gradient Boosting(XGBoost)and Cat-Boost are also used to improve the prediction using Artificial Neural Networks(ANN).This research also focuses on data visualization to identify patterns,trends,and outliers in a massive data set.Python and Scikit-learns are used for ML.Tensor Flow and Keras,along with Python,are used for ANN model train-ing.The DT and RF algorithms achieved the highest accuracy of 95%among the classifiers.Meanwhile,KNN obtained a second height accuracy of 93.33%.XGBoost had a gratified accuracy of 91.67%,SVM,CATBoost,and ANN had an accuracy of 90%,and LR had 88.33%accuracy.
文摘Because of the increasing attention on environmental issues, especially air pollution, predicting whether a day is polluted or not is necessary to people’s health. In order to solve this problem, this research is classifying ground ozone level based on big data and machine learning models, where polluted ozone day has class 1 and non-ozone day has class 0. The dataset used in this research was derived from the UCI Website, containing various environmental factors in Houston, Galveston and Brazoria area that could possibly affect the occurrence of ozone pollution [1]. This dataset is first filled up for further process, next standardized to ensure every feature has the same weight, and then split into training set and testing set. After this, five different machine learning models are used in the prediction of ground ozone level and their final accuracy scores are compared. In conclusion, among Logistic Regression, Decision Tree, Random Forest, AdaBoost, and Support Vector Machine (SVM), the last one has the highest test score of 0.949. This research utilizes relatively simple methods of forecasting and calculates the first accuracy scores in predicting ground ozone level;it can thus be a reference for environmentalists. Moreover, the direct comparison among five different models provides machine learning field an insight to determine the most accurate model. In the future, Neural Network can also be utilized to predict air pollution, and its test scores can be compared with the previous five methods to conclude the accuracy of Neuron Network.
文摘In the last decade, a few valuable types of research have been conducted to discriminate fractured zones from non-fractured ones. In this paper, petrophysical and image logs of eight wells were utilized to detect fractured zones. Decision tree, random forest, support vector machine, and deep learning were four classifiers applied over petrophysical logs and image logs for both training and testing. The output of classifiers was fused by ordered weighted averaging data fusion to achieve more reliable, accurate, and general results. Accuracy of close to 99% has been achieved. This study reports a significant improvement compared to the existing work that has an accuracy of close to 80%.
文摘Students in South African Universities come from different socio-cultural backgrounds, countries and high schools. This suggests that these students have different experiences which impact on their levels of grasping information in class as they potentially use different lenses on tuition. The current practice in Universities in contributing to the academic performance of students includes the use of tutors, the use of mobile devices for first year students, use of student assistants and the use of different feedback measures. What is problematic about the current practice is that students are quitting university in high numbers. In this study, knowledge has been drawn from data through the use of machine learning algorithms. Bayesian networks, support vector machines (SVMs) and decision trees algorithms were used individually in this work to construct predictive models for the academic performance of students. The best model was constructed using SVM and it gave a prediction of 72.87% and a prediction cost of 139. The model does predict the performance of students in advance of the year-end examinations outcome. The results suggest that South African Universities must recognize the diversity in student population and thus provide students with better support and equip them with the necessary knowledge that will enable them to tap into their full potential and thus enhance their skills.