Suppressed fuzzy c-means (S-FCM) clustering algorithm with the intention of combining the higher speed of hard c-means clustering algorithm and the better classification performance of fuzzy c-means clustering algorit...Suppressed fuzzy c-means (S-FCM) clustering algorithm with the intention of combining the higher speed of hard c-means clustering algorithm and the better classification performance of fuzzy c-means clustering algorithm had been studied by many researchers and applied in many fields. In the algorithm, how to select the suppressed rate is a key step. In this paper, we give a method to select the fixed suppressed rate by the structure of the data itself. The experimental results show that the proposed method is a suitable way to select the suppressed rate in suppressed fuzzy c-means clustering algorithm.展开更多
The hardness prediction model was established by support vector regression(SVR).In order to avoid exaggerating the contribution of very tiny alloying elements,a weighted fuzzy C-means(WFCM)algorithm was proposed for d...The hardness prediction model was established by support vector regression(SVR).In order to avoid exaggerating the contribution of very tiny alloying elements,a weighted fuzzy C-means(WFCM)algorithm was proposed for data clustering using improved Mahalanobis distance based on random forest importance values,which could play a full role of important features and avoid clustering center overlap.The samples were divided into two classes.The top 10 features of each class were selected to form two feature subsets for better performance of the model.The dimension and dispersion of features decreased in such feature subsets.Comparing four machine learning algorithms,SVR had the best performance and was chosen to modeling.The hyper-parameters of the SVR model were optimized by particle swarm optimization.The samples in validation set were classified according to minimum distance of sample to clustering centers,and then the SVR model trained by feature subset of corresponding class was used for prediction.Compared with the feature subset of original data set,the predicted values of model trained by feature subsets of classified samples by WFCM had higher correlation coefficient and lower root mean square error.It indicated that WFCM was an effective method to reduce the dispersion of features and improve the accuracy of model.展开更多
文摘Suppressed fuzzy c-means (S-FCM) clustering algorithm with the intention of combining the higher speed of hard c-means clustering algorithm and the better classification performance of fuzzy c-means clustering algorithm had been studied by many researchers and applied in many fields. In the algorithm, how to select the suppressed rate is a key step. In this paper, we give a method to select the fixed suppressed rate by the structure of the data itself. The experimental results show that the proposed method is a suitable way to select the suppressed rate in suppressed fuzzy c-means clustering algorithm.
基金supported by the National Research and Development Project of China (2020YFB2008400).
文摘The hardness prediction model was established by support vector regression(SVR).In order to avoid exaggerating the contribution of very tiny alloying elements,a weighted fuzzy C-means(WFCM)algorithm was proposed for data clustering using improved Mahalanobis distance based on random forest importance values,which could play a full role of important features and avoid clustering center overlap.The samples were divided into two classes.The top 10 features of each class were selected to form two feature subsets for better performance of the model.The dimension and dispersion of features decreased in such feature subsets.Comparing four machine learning algorithms,SVR had the best performance and was chosen to modeling.The hyper-parameters of the SVR model were optimized by particle swarm optimization.The samples in validation set were classified according to minimum distance of sample to clustering centers,and then the SVR model trained by feature subset of corresponding class was used for prediction.Compared with the feature subset of original data set,the predicted values of model trained by feature subsets of classified samples by WFCM had higher correlation coefficient and lower root mean square error.It indicated that WFCM was an effective method to reduce the dispersion of features and improve the accuracy of model.