Partition-based clustering with weighted feature is developed in the framework of shadowed sets. The objects in the core and boundary regions, generated by shadowed sets-based clustering, have different impact on the ...Partition-based clustering with weighted feature is developed in the framework of shadowed sets. The objects in the core and boundary regions, generated by shadowed sets-based clustering, have different impact on the prototype of each cluster. By integrating feature weights, a formula for weight calculation is introduced to the clustering algorithm. The selection of weight exponent is crucial for good result and the weights are updated iteratively with each partition of clusters. The convergence of the weighted algorithms is given, and the feasible cluster validity indices of data mining application are utilized. Experimental results on both synthetic and real-life numerical data with different feature weights demonstrate that the weighted algorithm is better than the other unweighted algorithms.展开更多
In this paper,we elaborate on residual-driven Fuzzy C-Means(FCM)for image segmentation,which is the first approach that realizes accurate residual(noise/outliers)estimation and enables noise-free image to participate ...In this paper,we elaborate on residual-driven Fuzzy C-Means(FCM)for image segmentation,which is the first approach that realizes accurate residual(noise/outliers)estimation and enables noise-free image to participate in clustering.We propose a residual-driven FCM framework by integrating into FCM a residual-related regularization term derived from the distribution characteristic of different types of noise.Built on this framework,a weighted?2-norm regularization term is presented by weighting mixed noise distribution,thus resulting in a universal residual-driven FCM algorithm in presence of mixed or unknown noise.Besides,with the constraint of spatial information,the residual estimation becomes more reliable than that only considering an observed image itself.Supporting experiments on synthetic,medical,and real-world images are conducted.The results demonstrate the superior effectiveness and efficiency of the proposed algorithm over its peers.展开更多
The hardness prediction model was established by support vector regression(SVR).In order to avoid exaggerating the contribution of very tiny alloying elements,a weighted fuzzy C-means(WFCM)algorithm was proposed for d...The hardness prediction model was established by support vector regression(SVR).In order to avoid exaggerating the contribution of very tiny alloying elements,a weighted fuzzy C-means(WFCM)algorithm was proposed for data clustering using improved Mahalanobis distance based on random forest importance values,which could play a full role of important features and avoid clustering center overlap.The samples were divided into two classes.The top 10 features of each class were selected to form two feature subsets for better performance of the model.The dimension and dispersion of features decreased in such feature subsets.Comparing four machine learning algorithms,SVR had the best performance and was chosen to modeling.The hyper-parameters of the SVR model were optimized by particle swarm optimization.The samples in validation set were classified according to minimum distance of sample to clustering centers,and then the SVR model trained by feature subset of corresponding class was used for prediction.Compared with the feature subset of original data set,the predicted values of model trained by feature subsets of classified samples by WFCM had higher correlation coefficient and lower root mean square error.It indicated that WFCM was an effective method to reduce the dispersion of features and improve the accuracy of model.展开更多
Solder bump technology has been widely used in electronic packaging. With the development of solder bumps towards higher density and finer pitch, it is more difficult to inspect the defects of solder bumps as they are...Solder bump technology has been widely used in electronic packaging. With the development of solder bumps towards higher density and finer pitch, it is more difficult to inspect the defects of solder bumps as they are hidden in the package. A nondestructive method using the transient active thermography has been proposed to inspect the defects of a solder bump, and we aim at developing an intelligent diagnosis system to eliminate the influence of emissivity unevenness and non-uniform heating on defects recognition in active infrared testing. An improved fuzzy c-means(FCM) algorithm based on the entropy weights is investigated in this paper. The captured thermograms are preprocessed to enhance the thermal contrast between the defective and good bumps. Hot spots corresponding to 16 solder bumps are segmented from the thermal images. The statistical features are calculated and selected appropriately to characterize the status of solder bumps in FCM clustering. The missing bump is identified in the FCM result, which is also validated by the principle component analysis. The intelligent diagnosis system using FCM algorithm with the entropy weights is effective for defects recognition in electronic packages.展开更多
Wind farm power prediction is proposed based on adaptive feature weight entropy fuzzy clustering algorithm.According to the fuzzy clustering method,a large number of historical data of a wind farm in Inner Mongolia ar...Wind farm power prediction is proposed based on adaptive feature weight entropy fuzzy clustering algorithm.According to the fuzzy clustering method,a large number of historical data of a wind farm in Inner Mongolia are analyzed and classified.Model of adaptive entropy weight for clustering is built.Wind power prediction model based on adaptive entropy fuzzy clustering feature weights is built.Simulation results show that the proposed method could distinguish the abnormal data and forecast more accurately and compute fastly.展开更多
When the attributes of unknown targets are not just numerical attributes,but hybrid attributes containing linguistic attributes,the existing recognition methods are not effective.In addition,it is more difficult to id...When the attributes of unknown targets are not just numerical attributes,but hybrid attributes containing linguistic attributes,the existing recognition methods are not effective.In addition,it is more difficult to identify the unknown targets densely distributed in the feature space,especially when there is interval overlap between attribute measurements of different target classes.To address these problems,a novel method based on intuitionistic fuzzy comprehensive evaluation model(IFCEM)is proposed.For numerical attributes,targets in the database are divided into individual classes and overlapping classes,and for linguistic attributes,continuous interval-valued linguistic term set(CIVLTS)is used to describe target characteristic.A cloud modelbased method and an area-based method are proposed to obtain intuitionistic fuzzy decision information of query target on numerical attributes and linguistic attributes respectively.An improved inverse weighted kernel fuzzy c-means(IWK-FCM)algorithm is proposed for solution of attribute weight vector.The possibility matrix is applied to determine the identity and category of query target.Finally,a case study composed of parameter sensitivity analysis,recognition accuracy analysis.and comparison with other methods,is taken to verify the superiority of the proposed method.展开更多
Gravity inversion requires much computation,and inversion results are often non-unique.The first problem is often due to the large number of grid cells.Edge detection method,i.e.,tilt angle method of analytical signal...Gravity inversion requires much computation,and inversion results are often non-unique.The first problem is often due to the large number of grid cells.Edge detection method,i.e.,tilt angle method of analytical signal amplitude(TAS),helps to identify the boundaries of underground geological anomalies at different depths,which can be used to optimize the grid and reduce the number of grid cells.The requirement of smooth inversion is that the boundaries of the meshing area should be continuous rather than jagged.In this paper,the optimized meshing strategy is improved,and the optimized meshing region obtained by the TAS is changed to a regular region to facilitate the smooth inversion.For the second problem,certain constraints can be used to improve the accuracy of inversion.The results of analytic signal amplitude(ASA)are used to delineate the central distribution of geological bodies.We propose a new method using the results of ASA to perform local constraints to reduce the non-uniqueness of inversion.The guided fuzzy c-means(FCM)clustering algorithm combined with priori petrophysical information is also used to reduce the non-uniqueness of gravity inversion.The Open Acc technology is carried out to speed up the computation for parallelizing the serial program on GPU.In general,the TAS is used to reduce the number of grid cells.The local weighting and priori petrophysical constraint are used in conjunction with the FCM algorithm during the inversion,which improves the accuracy of inversion.The inversion is accelerated by the Open Acc technology on GPU.The proposed method is validated using synthetic data,and the results show that the efficiency and accuracy of gravity inversion are greatly improved by using the proposed method.展开更多
To improve the computational performance of the fuzzy C-means(FCM)algorithm used in dataset clus-tering with large numbers,the concepts of the equivalent samples and the weighting samples based on eigenvalue distribut...To improve the computational performance of the fuzzy C-means(FCM)algorithm used in dataset clus-tering with large numbers,the concepts of the equivalent samples and the weighting samples based on eigenvalue distribution of the samples in the feature space were intro-duced and a novel fast cluster algorithm named weighted fuzzy C-means(WFCM)algorithm was put forward,which came from the traditional FCM algorithm.It was proved that the cluster results were equivalent in dataset with two different cluster algorithms:WFCM and FCM.Furthermore,the WFCM algorithm had better computational performance than the ordinary FCM algorithm.The experiment of the gray image segmentation showed that the WFCM algorithm is a fast and effective cluster algorithm.展开更多
The opening of gated community to expand the micro-road network in the urban traffic system is a hot topic on the urban congestion. To satisfy the demand of opening early choosing case, this paper proposed a comprehen...The opening of gated community to expand the micro-road network in the urban traffic system is a hot topic on the urban congestion. To satisfy the demand of opening early choosing case, this paper proposed a comprehensive selecting framework on qualified communities and its appropriate opening time. Firstly, the static influential factors on internal road structure, boundary road structure and traffic flow are qualitatively analyzed. Then, an evaluation opening state index system based on describing accurately traffic flow state is obtained, which takes the opening factors into account at the boundary road network. In this structure, the modified fuzzy C-means (FCM) method calculates the fuzzy entropy weight and range of each opening states index. Finally, the simulation results show that the proposed method is capable of selecting qualified community and the optimum opening time.展开更多
Purpose–The purpose of this paper is to consider the concept of Fuzzy Radial Basis Function Neural Networks with Information Granulation(IG-FRBFNN)and their optimization realized by means of the Multiobjective Partic...Purpose–The purpose of this paper is to consider the concept of Fuzzy Radial Basis Function Neural Networks with Information Granulation(IG-FRBFNN)and their optimization realized by means of the Multiobjective Particle Swarm Optimization(MOPSO).Design/methodology/approach–In fuzzy modeling,complexity,interpretability(or simplicity)as well as accuracy of the obtained model are essential design criteria.Since the performance of the IG-RBFNN model is directly affected by some parameters,such as the fuzzification coefficient used in the FCM,the number of rules and the orders of the polynomials in the consequent parts of the rules,the authors carry out both structural as well as parametric optimization of the network.A multi-objective Particle Swarm Optimization using Crowding Distance(MOPSO-CD)as well as O/WLS learning-based optimization are exploited to carry out the structural and parametric optimization of the model,respectively,while the optimization is of multiobjective character as it is aimed at the simultaneous minimization of complexity and maximization of accuracy.Findings–The performance of the proposed model is illustrated with the aid of three examples.The proposed optimization method leads to an accurate and highly interpretable fuzzy model.Originality/value–A MOPSO-CD as well as O/WLS learning-based optimization are exploited,respectively,to carry out the structural and parametric optimization of the model.As a result,the proposed methodology is interesting for designing an accurate and highly interpretable fuzzy model.展开更多
The furnace process is very important in boiler operation,and furnace pressure works as an important parameter in furnace process.Therefore,there is a need to analyze and monitor the pressure signal in furnace.However...The furnace process is very important in boiler operation,and furnace pressure works as an important parameter in furnace process.Therefore,there is a need to analyze and monitor the pressure signal in furnace.However,little work has been conducted on the relationship with the pressure sequence and boiler’s load under different working conditions.Since pressure sequence contains complex information,it demands feature extraction methods from multi-aspect consideration.In this paper,fuzzy c-means analysis method based on weighted validity index(VFCM)has been proposed for the working condition classification based on feature extraction.To deal with the fluctuating and time-varying pressure sequence,feature extraction is taken as nonlinear analysis based on entropy theory.Three kinds of entropy values,extracted from pressure sequence in time-frequency domain,are studied as the clustering objects for work condition classification.Weighted validity index,taking the close and separation degree into consideration,is calculated on the base of Silhouette index and Krzanowski-Lai index to obtain the optimal clustering number.Each time FCM runs,the weighted validity index evaluates the clustering result and the optimal clustering number will be obtained when it reaches the maximum value.Four datasets from UCI Machine Learning Repository are presented to certify the effectiveness in VFCM.Pressure sequences got from a 300 MW boiler are then taken for case study.The result of the pressure sequence case study with an error rate of 0.5332%shows the valuable information on boiler’s load and pressure sequence in furnace.The relationship between boiler’s load and entropy values extracted from pressure sequence is proposed.Moreover,the method can be considered to be a reference method for data mining in other fluctuating and time-varying sequences.展开更多
基金Supported by the National Natural Science Foundation of China(61139002)~~
文摘Partition-based clustering with weighted feature is developed in the framework of shadowed sets. The objects in the core and boundary regions, generated by shadowed sets-based clustering, have different impact on the prototype of each cluster. By integrating feature weights, a formula for weight calculation is introduced to the clustering algorithm. The selection of weight exponent is crucial for good result and the weights are updated iteratively with each partition of clusters. The convergence of the weighted algorithms is given, and the feasible cluster validity indices of data mining application are utilized. Experimental results on both synthetic and real-life numerical data with different feature weights demonstrate that the weighted algorithm is better than the other unweighted algorithms.
基金supported in part by the Doctoral Students’Short Term Study Abroad Scholarship Fund of Xidian Universitythe National Natural Science Foundation of China(61873342,61672400,62076189)+1 种基金the Recruitment Program of Global Expertsthe Science and Technology Development Fund,MSAR(0012/2019/A1)。
文摘In this paper,we elaborate on residual-driven Fuzzy C-Means(FCM)for image segmentation,which is the first approach that realizes accurate residual(noise/outliers)estimation and enables noise-free image to participate in clustering.We propose a residual-driven FCM framework by integrating into FCM a residual-related regularization term derived from the distribution characteristic of different types of noise.Built on this framework,a weighted?2-norm regularization term is presented by weighting mixed noise distribution,thus resulting in a universal residual-driven FCM algorithm in presence of mixed or unknown noise.Besides,with the constraint of spatial information,the residual estimation becomes more reliable than that only considering an observed image itself.Supporting experiments on synthetic,medical,and real-world images are conducted.The results demonstrate the superior effectiveness and efficiency of the proposed algorithm over its peers.
基金supported by the National Research and Development Project of China (2020YFB2008400).
文摘The hardness prediction model was established by support vector regression(SVR).In order to avoid exaggerating the contribution of very tiny alloying elements,a weighted fuzzy C-means(WFCM)algorithm was proposed for data clustering using improved Mahalanobis distance based on random forest importance values,which could play a full role of important features and avoid clustering center overlap.The samples were divided into two classes.The top 10 features of each class were selected to form two feature subsets for better performance of the model.The dimension and dispersion of features decreased in such feature subsets.Comparing four machine learning algorithms,SVR had the best performance and was chosen to modeling.The hyper-parameters of the SVR model were optimized by particle swarm optimization.The samples in validation set were classified according to minimum distance of sample to clustering centers,and then the SVR model trained by feature subset of corresponding class was used for prediction.Compared with the feature subset of original data set,the predicted values of model trained by feature subsets of classified samples by WFCM had higher correlation coefficient and lower root mean square error.It indicated that WFCM was an effective method to reduce the dispersion of features and improve the accuracy of model.
基金supported by the National Natural Science Foundation of China(Grant Nos.51305179&51305177)the Natural Science Foundation of Jiangsu Higher Education Institutions(Grant No.13KJB510009)
文摘Solder bump technology has been widely used in electronic packaging. With the development of solder bumps towards higher density and finer pitch, it is more difficult to inspect the defects of solder bumps as they are hidden in the package. A nondestructive method using the transient active thermography has been proposed to inspect the defects of a solder bump, and we aim at developing an intelligent diagnosis system to eliminate the influence of emissivity unevenness and non-uniform heating on defects recognition in active infrared testing. An improved fuzzy c-means(FCM) algorithm based on the entropy weights is investigated in this paper. The captured thermograms are preprocessed to enhance the thermal contrast between the defective and good bumps. Hot spots corresponding to 16 solder bumps are segmented from the thermal images. The statistical features are calculated and selected appropriately to characterize the status of solder bumps in FCM clustering. The missing bump is identified in the FCM result, which is also validated by the principle component analysis. The intelligent diagnosis system using FCM algorithm with the entropy weights is effective for defects recognition in electronic packages.
基金supported by the Natural Science Foundation of China under contact(61233007)
文摘Wind farm power prediction is proposed based on adaptive feature weight entropy fuzzy clustering algorithm.According to the fuzzy clustering method,a large number of historical data of a wind farm in Inner Mongolia are analyzed and classified.Model of adaptive entropy weight for clustering is built.Wind power prediction model based on adaptive entropy fuzzy clustering feature weights is built.Simulation results show that the proposed method could distinguish the abnormal data and forecast more accurately and compute fastly.
基金supported by the Youth Foundation of the National Science Foundation of China(62001503)the Excellent Youth Scholar of the National Defense Science and Technology Foundation of China(2017-JCJQ-ZQ-003)the Special Fund for Taishan Scholar Project(ts201712072).
文摘When the attributes of unknown targets are not just numerical attributes,but hybrid attributes containing linguistic attributes,the existing recognition methods are not effective.In addition,it is more difficult to identify the unknown targets densely distributed in the feature space,especially when there is interval overlap between attribute measurements of different target classes.To address these problems,a novel method based on intuitionistic fuzzy comprehensive evaluation model(IFCEM)is proposed.For numerical attributes,targets in the database are divided into individual classes and overlapping classes,and for linguistic attributes,continuous interval-valued linguistic term set(CIVLTS)is used to describe target characteristic.A cloud modelbased method and an area-based method are proposed to obtain intuitionistic fuzzy decision information of query target on numerical attributes and linguistic attributes respectively.An improved inverse weighted kernel fuzzy c-means(IWK-FCM)algorithm is proposed for solution of attribute weight vector.The possibility matrix is applied to determine the identity and category of query target.Finally,a case study composed of parameter sensitivity analysis,recognition accuracy analysis.and comparison with other methods,is taken to verify the superiority of the proposed method.
基金supported by the National Key Research and Development Program of China Project(Grant No.2018YFC0603502)
文摘Gravity inversion requires much computation,and inversion results are often non-unique.The first problem is often due to the large number of grid cells.Edge detection method,i.e.,tilt angle method of analytical signal amplitude(TAS),helps to identify the boundaries of underground geological anomalies at different depths,which can be used to optimize the grid and reduce the number of grid cells.The requirement of smooth inversion is that the boundaries of the meshing area should be continuous rather than jagged.In this paper,the optimized meshing strategy is improved,and the optimized meshing region obtained by the TAS is changed to a regular region to facilitate the smooth inversion.For the second problem,certain constraints can be used to improve the accuracy of inversion.The results of analytic signal amplitude(ASA)are used to delineate the central distribution of geological bodies.We propose a new method using the results of ASA to perform local constraints to reduce the non-uniqueness of inversion.The guided fuzzy c-means(FCM)clustering algorithm combined with priori petrophysical information is also used to reduce the non-uniqueness of gravity inversion.The Open Acc technology is carried out to speed up the computation for parallelizing the serial program on GPU.In general,the TAS is used to reduce the number of grid cells.The local weighting and priori petrophysical constraint are used in conjunction with the FCM algorithm during the inversion,which improves the accuracy of inversion.The inversion is accelerated by the Open Acc technology on GPU.The proposed method is validated using synthetic data,and the results show that the efficiency and accuracy of gravity inversion are greatly improved by using the proposed method.
文摘To improve the computational performance of the fuzzy C-means(FCM)algorithm used in dataset clus-tering with large numbers,the concepts of the equivalent samples and the weighting samples based on eigenvalue distribution of the samples in the feature space were intro-duced and a novel fast cluster algorithm named weighted fuzzy C-means(WFCM)algorithm was put forward,which came from the traditional FCM algorithm.It was proved that the cluster results were equivalent in dataset with two different cluster algorithms:WFCM and FCM.Furthermore,the WFCM algorithm had better computational performance than the ordinary FCM algorithm.The experiment of the gray image segmentation showed that the WFCM algorithm is a fast and effective cluster algorithm.
文摘The opening of gated community to expand the micro-road network in the urban traffic system is a hot topic on the urban congestion. To satisfy the demand of opening early choosing case, this paper proposed a comprehensive selecting framework on qualified communities and its appropriate opening time. Firstly, the static influential factors on internal road structure, boundary road structure and traffic flow are qualitatively analyzed. Then, an evaluation opening state index system based on describing accurately traffic flow state is obtained, which takes the opening factors into account at the boundary road network. In this structure, the modified fuzzy C-means (FCM) method calculates the fuzzy entropy weight and range of each opening states index. Finally, the simulation results show that the proposed method is capable of selecting qualified community and the optimum opening time.
基金This work was supported by National Research Foundation of Korea Grant funded by the Korean Government(NRF-2010-D00065)the Grant of the Korean Ministry of Education,Science and Technology(The Regional Core Research Program/Center of Healthcare Technology Development)the GRRC program of Gyeonggi province[GRRC SUWON 2011-B2,Center for U-city Security&Surveillance Technology].
文摘Purpose–The purpose of this paper is to consider the concept of Fuzzy Radial Basis Function Neural Networks with Information Granulation(IG-FRBFNN)and their optimization realized by means of the Multiobjective Particle Swarm Optimization(MOPSO).Design/methodology/approach–In fuzzy modeling,complexity,interpretability(or simplicity)as well as accuracy of the obtained model are essential design criteria.Since the performance of the IG-RBFNN model is directly affected by some parameters,such as the fuzzification coefficient used in the FCM,the number of rules and the orders of the polynomials in the consequent parts of the rules,the authors carry out both structural as well as parametric optimization of the network.A multi-objective Particle Swarm Optimization using Crowding Distance(MOPSO-CD)as well as O/WLS learning-based optimization are exploited to carry out the structural and parametric optimization of the model,respectively,while the optimization is of multiobjective character as it is aimed at the simultaneous minimization of complexity and maximization of accuracy.Findings–The performance of the proposed model is illustrated with the aid of three examples.The proposed optimization method leads to an accurate and highly interpretable fuzzy model.Originality/value–A MOPSO-CD as well as O/WLS learning-based optimization are exploited,respectively,to carry out the structural and parametric optimization of the model.As a result,the proposed methodology is interesting for designing an accurate and highly interpretable fuzzy model.
基金supported by the National Natural Science Foundation of China(Grant No.51176030)Jiangsu Science and Technology Department(Grant No.BY2015070-17)
文摘The furnace process is very important in boiler operation,and furnace pressure works as an important parameter in furnace process.Therefore,there is a need to analyze and monitor the pressure signal in furnace.However,little work has been conducted on the relationship with the pressure sequence and boiler’s load under different working conditions.Since pressure sequence contains complex information,it demands feature extraction methods from multi-aspect consideration.In this paper,fuzzy c-means analysis method based on weighted validity index(VFCM)has been proposed for the working condition classification based on feature extraction.To deal with the fluctuating and time-varying pressure sequence,feature extraction is taken as nonlinear analysis based on entropy theory.Three kinds of entropy values,extracted from pressure sequence in time-frequency domain,are studied as the clustering objects for work condition classification.Weighted validity index,taking the close and separation degree into consideration,is calculated on the base of Silhouette index and Krzanowski-Lai index to obtain the optimal clustering number.Each time FCM runs,the weighted validity index evaluates the clustering result and the optimal clustering number will be obtained when it reaches the maximum value.Four datasets from UCI Machine Learning Repository are presented to certify the effectiveness in VFCM.Pressure sequences got from a 300 MW boiler are then taken for case study.The result of the pressure sequence case study with an error rate of 0.5332%shows the valuable information on boiler’s load and pressure sequence in furnace.The relationship between boiler’s load and entropy values extracted from pressure sequence is proposed.Moreover,the method can be considered to be a reference method for data mining in other fluctuating and time-varying sequences.