期刊文献+
共找到92篇文章
< 1 2 5 >
每页显示 20 50 100
Plant Leaf Diseases Classification Using Improved K-Means Clustering and SVM Algorithm for Segmentation
1
作者 Mona Jamjoom Ahmed Elhadad +1 位作者 Hussein Abulkasim Safia Abbas 《Computers, Materials & Continua》 SCIE EI 2023年第7期367-382,共16页
Several pests feed on leaves,stems,bases,and the entire plant,causing plant illnesses.As a result,it is vital to identify and eliminate the disease before causing any damage to plants.Manually detecting plant disease ... Several pests feed on leaves,stems,bases,and the entire plant,causing plant illnesses.As a result,it is vital to identify and eliminate the disease before causing any damage to plants.Manually detecting plant disease and treating it is pretty challenging in this period.Image processing is employed to detect plant disease since it requires much effort and an extended processing period.The main goal of this study is to discover the disease that affects the plants by creating an image processing system that can recognize and classify four different forms of plant diseases,including Phytophthora infestans,Fusarium graminearum,Puccinia graminis,tomato yellow leaf curl.Therefore,this work uses the Support vector machine(SVM)classifier to detect and classify the plant disease using various steps like image acquisition,Pre-processing,Segmentation,feature extraction,and classification.The gray level co-occurrence matrix(GLCM)and the local binary pattern features(LBP)are used to identify the disease-affected portion of the plant leaf.According to experimental data,the proposed technology can correctly detect and diagnose plant sickness with a 97.2 percent accuracy. 展开更多
关键词 SVM machine learning GLCM algorithm k-means clustering LBP
下载PDF
Genetic Algorithm Combined with the K-Means Algorithm:A Hybrid Technique for Unsupervised Feature Selection
2
作者 Hachemi Bennaceur Meznah Almutairy Norah Alhussain 《Intelligent Automation & Soft Computing》 SCIE 2023年第9期2687-2706,共20页
The dimensionality of data is increasing very rapidly,which creates challenges for most of the current mining and learning algorithms,such as large memory requirements and high computational costs.The literature inclu... The dimensionality of data is increasing very rapidly,which creates challenges for most of the current mining and learning algorithms,such as large memory requirements and high computational costs.The literature includes much research on feature selection for supervised learning.However,feature selection for unsupervised learning has only recently been studied.Finding the subset of features in unsupervised learning that enhances the performance is challenging since the clusters are indeterminate.This work proposes a hybrid technique for unsupervised feature selection called GAk-MEANS,which combines the genetic algorithm(GA)approach with the classical k-Means algorithm.In the proposed algorithm,a new fitness func-tion is designed in addition to new smart crossover and mutation operators.The effectiveness of this algorithm is demonstrated on various datasets.Fur-thermore,the performance of GAk-MEANS has been compared with other genetic algorithms,such as the genetic algorithm using the Sammon Error Function and the genetic algorithm using the Sum of Squared Error Function.Additionally,the performance of GAk-MEANS is compared with the state-of-the-art statistical unsupervised feature selection techniques.Experimental results show that GAk-MEANS consistently selects subsets of features that result in better classification accuracy compared to others.In particular,GAk-MEANS is able to significantly reduce the size of the subset of selected features by an average of 86.35%(72%–96.14%),which leads to an increase of the accuracy by an average of 3.78%(1.05%–6.32%)compared to using all features.When compared with the genetic algorithm using the Sammon Error Function,GAk-MEANS is able to reduce the size of the subset of selected features by 41.29%on average,improve the accuracy by 5.37%,and reduce the time by 70.71%.When compared with the genetic algorithm using the Sum of Squared Error Function,GAk-MEANS on average is able to reduce the size of the subset of selected features by 15.91%,and improve the accuracy by 9.81%,but the time is increased by a factor of 3.When compared with the machine-learning based methods,we observed that GAk-MEANS is able to increase the accuracy by 13.67%on average with an 88.76%average increase in time. 展开更多
关键词 Genetic algorithm unsupervised feature selection k-means clustering
下载PDF
Stability prediction of hard rock pillar using support vector machine optimized by three metaheuristic algorithms 被引量:3
3
作者 Chuanqi Li Jian Zhou +1 位作者 Kun Du Daniel Dias 《International Journal of Mining Science and Technology》 SCIE EI CAS CSCD 2023年第8期1019-1036,共18页
Hard rock pillar is one of the important structures in engineering design and excavation in underground mines.Accurate and convenient prediction of pillar stability is of great significance for underground space safet... Hard rock pillar is one of the important structures in engineering design and excavation in underground mines.Accurate and convenient prediction of pillar stability is of great significance for underground space safety.This paper aims to develop hybrid support vector machine(SVM)models improved by three metaheuristic algorithms known as grey wolf optimizer(GWO),whale optimization algorithm(WOA)and sparrow search algorithm(SSA)for predicting the hard rock pillar stability.An integrated dataset containing 306 hard rock pillars was established to generate hybrid SVM models.Five parameters including pillar height,pillar width,ratio of pillar width to height,uniaxial compressive strength and pillar stress were set as input parameters.Two global indices,three local indices and the receiver operating characteristic(ROC)curve with the area under the ROC curve(AUC)were utilized to evaluate all hybrid models’performance.The results confirmed that the SSA-SVM model is the best prediction model with the highest values of all global indices and local indices.Nevertheless,the performance of the SSASVM model for predicting the unstable pillar(AUC:0.899)is not as good as those for stable(AUC:0.975)and failed pillars(AUC:0.990).To verify the effectiveness of the proposed models,5 field cases were investigated in a metal mine and other 5 cases were collected from several published works.The validation results indicated that the SSA-SVM model obtained a considerable accuracy,which means that the combination of SVM and metaheuristic algorithms is a feasible approach to predict the pillar stability. 展开更多
关键词 Underground pillar stability Hard rock Support vector machine Metaheuristic algorithms
下载PDF
Estimating wheat fractional vegetation cover using a density peak k-means algorithm based on hyperspectral image data 被引量:3
4
作者 LIU Da-zhong YANG Fei-fei LIU Sheng-ping 《Journal of Integrative Agriculture》 SCIE CAS CSCD 2021年第11期2880-2891,共12页
Fractional vegetation cover(FVC)is an important parameter to measure crop growth.In studies of crop growth monitoring,it is very important to extract FVC quickly and accurately.As the most widely used FVC extraction m... Fractional vegetation cover(FVC)is an important parameter to measure crop growth.In studies of crop growth monitoring,it is very important to extract FVC quickly and accurately.As the most widely used FVC extraction method,the photographic method has the advantages of simple operation and high extraction accuracy.However,when soil moisture and acquisition times vary,the extraction results are less accurate.To accommodate various conditions of FVC extraction,this study proposes a new FVC extraction method that extracts FVC from a normalized difference vegetation index(NDVI)greyscale image of wheat by using a density peak k-means(DPK-means)algorithm.In this study,Yangfumai 4(YF4)planted in pots and Yangmai 16(Y16)planted in the field were used as the research materials.With a hyperspectral imaging camera mounted on a tripod,ground hyperspectral images of winter wheat under different soil conditions(dry and wet)were collected at 1 m above the potted wheat canopy.Unmanned aerial vehicle(UAV)hyperspectral images of winter wheat at various stages were collected at 50 m above the field wheat canopy by a UAV equipped with a hyperspectral camera.The pixel dichotomy method and DPK-means algorithm were used to classify vegetation pixels and non-vegetation pixels in NDVI greyscale images of wheat,and the extraction effects of the two methods were compared and analysed.The results showed that extraction by pixel dichotomy was influenced by the acquisition conditions and its error distribution was relatively scattered,while the extraction effect of the DPK-means algorithm was less affected by the acquisition conditions and its error distribution was concentrated.The absolute values of error were 0.042 and 0.044,the root mean square errors(RMSE)were 0.028 and 0.030,and the fitting accuracy R2 of the FVC was 0.87 and 0.93,under dry and wet soil conditions and under various time conditions,respectively.This study found that the DPK-means algorithm was capable of achieving more accurate results than the pixel dichotomy method in various soil and time conditions and was an accurate and robust method for FVC extraction. 展开更多
关键词 fractional vegetation cover k-means algorithm NDVI vegetation index WHEAT
下载PDF
Polarimetric Meteorological Satellite Data Processing Software Classification Based on Principal Component Analysis and Improved K-Means Algorithm 被引量:1
5
作者 Manyun Lin Xiangang Zhao +3 位作者 Cunqun Fan Lizi Xie Lan Wei Peng Guo 《Journal of Geoscience and Environment Protection》 2017年第7期39-48,共10页
With the increasing variety of application software of meteorological satellite ground system, how to provide reasonable hardware resources and improve the efficiency of software is paid more and more attention. In th... With the increasing variety of application software of meteorological satellite ground system, how to provide reasonable hardware resources and improve the efficiency of software is paid more and more attention. In this paper, a set of software classification method based on software operating characteristics is proposed. The method uses software run-time resource consumption to describe the software running characteristics. Firstly, principal component analysis (PCA) is used to reduce the dimension of software running feature data and to interpret software characteristic information. Then the modified K-means algorithm was used to classify the meteorological data processing software. Finally, it combined with the results of principal component analysis to explain the significance of various types of integrated software operating characteristics. And it is used as the basis for optimizing the allocation of software hardware resources and improving the efficiency of software operation. 展开更多
关键词 Principal COMPONENT ANALYSIS Improved k-mean algorithm METEOROLOGICAL Data Processing FEATURE ANALYSIS SIMILARITY algorithm
下载PDF
Similarity matrix-based K-means algorithm for text clustering
6
作者 曹奇敏 郭巧 吴向华 《Journal of Beijing Institute of Technology》 EI CAS 2015年第4期566-572,共7页
K-means algorithm is one of the most widely used algorithms in the clustering analysis. To deal with the problem caused by the random selection of initial center points in the traditional al- gorithm, this paper propo... K-means algorithm is one of the most widely used algorithms in the clustering analysis. To deal with the problem caused by the random selection of initial center points in the traditional al- gorithm, this paper proposes an improved K-means algorithm based on the similarity matrix. The im- proved algorithm can effectively avoid the random selection of initial center points, therefore it can provide effective initial points for clustering process, and reduce the fluctuation of clustering results which are resulted from initial points selections, thus a better clustering quality can be obtained. The experimental results also show that the F-measure of the improved K-means algorithm has been greatly improved and the clustering results are more stable. 展开更多
关键词 text clustering k-means algorithm similarity matrix F-MEASURE
下载PDF
A Hybrid Method Combining Improved K-means Algorithm with BADA Model for Generating Nominal Flight Profiles
7
作者 Tang Xinmin Gu Junwei +2 位作者 Shen Zhiyuan Chen Ping Li Bo 《Transactions of Nanjing University of Aeronautics and Astronautics》 EI CSCD 2016年第4期414-424,共11页
A high-precision nominal flight profile,involving controllers′intentions is critical for 4Dtrajectory estimation in modern automatic air traffic control systems.We proposed a novel method to effectively improve the a... A high-precision nominal flight profile,involving controllers′intentions is critical for 4Dtrajectory estimation in modern automatic air traffic control systems.We proposed a novel method to effectively improve the accuracy of the nominal flight profile,including the nominal altitude profile and the speed profile.First,considering the characteristics of trajectory data,we developed an improved K-means algorithm.The approach was to measure the similarity between different altitude profiles by integrating the space warp edit distance algorithm,thereby to acquire several fitted nominal flight altitude profiles.This approach breaks the constraints of traditional K-means algorithms.Second,to eliminate the influence of meteorological factors,we introduced historical gridded binary data to determine the en-route wind speed and temperature via inverse distance weighted interpolation.Finally,we facilitated the true airspeed determined by speed triangle relationships and the calibrated airspeed determined by aircraft data model to extract a more accurate nominal speed profile from each cluster,therefore we could describe the airspeed profiles above and below the airspeed transition altitude,respectively.Our experimental results showed that the proposed method could obtain a highly accurate nominal flight profile,which reflects the actual aircraft flight status. 展开更多
关键词 air transportation flight profile k-means algorithm space warp edit distance(SWED)algorithm trajectory prediction
下载PDF
An Improved K-Means Algorithm Based on Initial Clustering Center Optimization
8
作者 LI Taihao NAREN Tuya +2 位作者 ZHOU Jianshe REN Fuji LIU Shupeng 《ZTE Communications》 2017年第B12期43-46,共4页
The K-means algorithm is widely known for its simplicity and fastness in text clustering.However,the selection of the initial clus?tering center with the traditional K-means algorithm is some random,and therefore,the ... The K-means algorithm is widely known for its simplicity and fastness in text clustering.However,the selection of the initial clus?tering center with the traditional K-means algorithm is some random,and therefore,the fluctuations and instability of the clustering results are strongly affected by the initial clustering center.This paper proposed an algorithm to select the initial clustering center to eliminate the uncertainty of central point selection.The experiment results show that the improved K-means clustering algorithm is superior to the traditional algorithm. 展开更多
关键词 CLUSTERING k-means algorithm initial clustering center
下载PDF
A State of Art Analysis of Telecommunication Data by k-Means and k-Medoids Clustering Algorithms
9
作者 T. Velmurugan 《Journal of Computer and Communications》 2018年第1期190-202,共13页
Cluster analysis is one of the major data analysis methods widely used for many practical applications in emerging areas of data mining. A good clustering method will produce high quality clusters with high intra-clus... Cluster analysis is one of the major data analysis methods widely used for many practical applications in emerging areas of data mining. A good clustering method will produce high quality clusters with high intra-cluster similarity and low inter-cluster similarity. Clustering techniques are applied in different domains to predict future trends of available data and its uses for the real world. This research work is carried out to find the performance of two of the most delegated, partition based clustering algorithms namely k-Means and k-Medoids. A state of art analysis of these two algorithms is implemented and performance is analyzed based on their clustering result quality by means of its execution time and other components. Telecommunication data is the source data for this analysis. The connection oriented broadband data is given as input to find the clustering quality of the algorithms. Distance between the server locations and their connection is considered for clustering. Execution time for each algorithm is analyzed and the results are compared with one another. Results found in comparison study are satisfactory for the chosen application. 展开更多
关键词 k-means algorithm k-Medoids algorithm DATA CLUSTERING Time COMPLEXITY TELECOMMUNICATION DATA
下载PDF
Multiple Parameter Based Clustering (MPC): Prospective Analysis for Effective Clustering in Wireless Sensor Network (WSN) Using K-Means Algorithm
10
作者 Md. Asif Khan Israfil Tamim +1 位作者 Emdad Ahmed M. Abdul Awal 《Wireless Sensor Network》 2012年第1期18-24,共7页
In wireless sensor network cluster architecture is useful because of its inherent suitability for data fusion. In this paper we represent a new approach called Multiple Parameter based Clustering (MPC) embedded with t... In wireless sensor network cluster architecture is useful because of its inherent suitability for data fusion. In this paper we represent a new approach called Multiple Parameter based Clustering (MPC) embedded with the traditional k-means algorithm which takes different parameters (Node energy level, Euclidian distance from the base station, RSSI, Latency of data to reach base station) into consideration to form clusters. Then the effectiveness of the clusters is evaluated based on the uniformity of the node distribution, Node range per cluster, Intra and Inter cluster distance and required energy level of each centroid. Our result shows that by varying multiple parameters we can create clusters with more uniformly distributed nodes, minimize intra and maximize inter cluster distance and elect less power consuming centroid. 展开更多
关键词 k-means algorithm Energy Efficient UNIFORM Distribution RSSI LATENCY
下载PDF
Hybrid Genetic Algorithm with K-Means for Clustering Problems
11
作者 Ahamed Al Malki Mohamed M. Rizk +1 位作者 M. A. El-Shorbagy A. A. Mousa 《Open Journal of Optimization》 2016年第2期71-83,共14页
The K-means method is one of the most widely used clustering methods and has been implemented in many fields of science and technology. One of the major problems of the k-means algorithm is that it may produce empty c... The K-means method is one of the most widely used clustering methods and has been implemented in many fields of science and technology. One of the major problems of the k-means algorithm is that it may produce empty clusters depending on initial center vectors. Genetic Algorithms (GAs) are adaptive heuristic search algorithm based on the evolutionary principles of natural selection and genetics. This paper presents a hybrid version of the k-means algorithm with GAs that efficiently eliminates this empty cluster problem. Results of simulation experiments using several data sets prove our claim. 展开更多
关键词 Cluster Analysis Genetic algorithm k-means
下载PDF
An Approximation Algorithm Based on Seeding Algorithm for Fuzzy k-Means Problem with Penalties
12
作者 Wen-Zhao Liu Min Li 《Journal of the Operations Research Society of China》 EI CSCD 2024年第2期387-409,共23页
As a classic NP-hard problem in machine learning and computational geometry,the k-means problem aims to partition the given dataset into k clusters according to the minimal squared Euclidean distance.Different from k-... As a classic NP-hard problem in machine learning and computational geometry,the k-means problem aims to partition the given dataset into k clusters according to the minimal squared Euclidean distance.Different from k-means problem and most of its variants,fuzzy k-means problem belongs to the soft clustering problem,where each given data point has relationship to every center point.Compared to fuzzy k-means problem,fuzzy k-means problem with penalties allows that some data points need not be clustered instead of being paid penalties.In this paper,we propose an O(αk In k)-approximation algorithm based on seeding algorithm for fuzzy k-means problem with penalties,whereαinvolves the ratio of the maximal penalty value to the minimal one.Furthermore,we implement numerical experiments to show the effectiveness of our algorithm. 展开更多
关键词 Approximation algorithm.Seeding algorithm Fuzzy k-means problem with penalties
原文传递
Community Detection in Aviation Network Based on K-means and Complex Network
13
作者 Hang He Zhenhan Zhao +1 位作者 Weiwei Luo Jinghui Zhang 《Computer Systems Science & Engineering》 SCIE EI 2021年第11期251-264,共14页
With the increasing number of airports and the expansion of their scale,the aviation network has become complex and hierarchical.In order to investigate the complex network characteristics of aviation networks,this pa... With the increasing number of airports and the expansion of their scale,the aviation network has become complex and hierarchical.In order to investigate the complex network characteristics of aviation networks,this paper constructs a Chinese aviation network model and carries out related research based on complex network theory and K-means algorithm.Initially,the P-space model is employed to construct the Chinese aviation network model.Then,complex network indicators such as degree,clustering coefficient,average path length,betweenness and coreness are selected to investigate the complex characteristics and hierarchical features of aviation networks and explore their causes.Secondly,using K-means clustering algorithm,five values are obtained as the initial clustering parameter K values for each of the aviation network hierarchies classified according to five complex network indicators.Meanwhile,clustering simulation experiments are conducted to obtain the visual clustering results of Chinese aviation network nodes under different K values,as well as silhouette coefficients for evaluating the clustering effect of each indicator in order to obtain the hierarchical classification of aviation networks under different indicators.Finally,the silhouette coefficient is optimal when the K value is 4.Thus,the clustering results of the four layers of the aviation network can be obtained.According to the experimental results,the complex network association discovery method combined with K-means algorithm has better applicability and simplicity,while the accuracy is improved. 展开更多
关键词 k-means algorithm complex network community detection aviation network
下载PDF
A Tradeoff Between Accuracy and Speed for K-Means Seed Determination
14
作者 Farzaneh Khorasani Morteza Mohammadi Zanjireh +1 位作者 Mahdi Bahaghighat Qin Xin 《Computer Systems Science & Engineering》 SCIE EI 2022年第3期1085-1098,共14页
With a sharp increase in the information volume,analyzing and retrieving this vast data volume is much more essential than ever.One of the main techniques that would be beneficial in this regard is called the Clusteri... With a sharp increase in the information volume,analyzing and retrieving this vast data volume is much more essential than ever.One of the main techniques that would be beneficial in this regard is called the Clustering method.Clustering aims to classify objects so that all objects within a cluster have similar features while other objects in different clusters are as distinct as possible.One of the most widely used clustering algorithms with the well and approved performance in different applications is the k-means algorithm.The main problem of the k-means algorithm is its performance which can be directly affected by the selection in the primary clusters.Lack of attention to this crucial issue has consequences such as creating empty clusters and decreasing the convergence time.Besides,the selection of appropriate initial seeds can reduce the cluster’s inconsistency.In this paper,we present a new method to determine the initial seeds of the k-mean algorithm to improve the accuracy and decrease the number of iterations of the algorithm.For this purpose,a new method is proposed considering the average distance between objects to determine the initial seeds.Our method attempts to provide a proper tradeoff between the accuracy and speed of the clustering algorithm.The experimental results showed that our proposed approach outperforms the Chithra with 1.7%and 2.1%in terms of clustering accuracy for Wine and Abalone detection data,respectively.Furthermore,achieved results indicate that comparing with the Reverse Nearest Neighbor(RNN)search approach,the proposed method has a higher convergence speed. 展开更多
关键词 Data clustering k-means algorithm information retrieval outlier detection clustering accuracy unsupervised learning
下载PDF
P-ROCK: A Sustainable Clustering Algorithm for Large Categorical Datasets
15
作者 Ayman Altameem Ramesh Chandra Poonia +2 位作者 Ankit Kumar Linesh Raja Abdul Khader Jilani Saudagar 《Intelligent Automation & Soft Computing》 SCIE 2023年第1期553-566,共14页
Data clustering is crucial when it comes to data processing and analytics.The new clustering method overcomes the challenge of evaluating and extracting data from big data.Numerical or categorical data can be grouped.... Data clustering is crucial when it comes to data processing and analytics.The new clustering method overcomes the challenge of evaluating and extracting data from big data.Numerical or categorical data can be grouped.Existing clustering methods favor numerical data clustering and ignore categorical data clustering.Until recently,the only way to cluster categorical data was to convert it to a numeric representation and then cluster it using current numeric clustering methods.However,these algorithms could not use the concept of categorical data for clustering.Following that,suggestions for expanding traditional categorical data processing methods were made.In addition to expansions,several new clustering methods and extensions have been proposed in recent years.ROCK is an adaptable and straightforward algorithm for calculating the similarity between data sets to cluster them.This paper aims to modify the algo-rithm by creating a parameterized version that takes specific algorithm parameters as input and outputs satisfactory cluster structures.The parameterized ROCK algorithm is the name given to the modified algorithm(P-ROCK).The proposed modification makes the original algorithm moreflexible by using user-defined parameters.A detailed hypothesis was developed later validated with experimental results on real-world datasets using our proposed P-ROCK algorithm.A comparison with the original ROCK algorithm is also provided.Experiment results show that the proposed algorithm is on par with the original ROCK algorithm with an accuracy of 97.9%.The proposed P-ROCK algorithm has improved the runtime and is moreflexible and scalable. 展开更多
关键词 ROCK k-means algorithm clustering approaches unsupervised learning K-histogram
下载PDF
Enhanced Detection of Cerebral Atherosclerosis Using Hybrid Algorithm of Image Segmentation
16
作者 Shakunthala Masi Helenprabha Kuttiappan 《Intelligent Automation & Soft Computing》 SCIE 2023年第4期733-744,共12页
In medical science for envisaging human body’s phenomenal structure a major part has been driven by image processing techniques.Major objective of this work is to detect of cerebral atherosclerosis for image segmenta... In medical science for envisaging human body’s phenomenal structure a major part has been driven by image processing techniques.Major objective of this work is to detect of cerebral atherosclerosis for image segmentation applica-tion.Detection of some abnormal structures in human body has become a difficult task to complete with some simple images.For expounding and distinguishing neural architecture of human brain in an effective manner,MRI(Magnetic Reso-nance Imaging)is one of the most suitable and significant technique.Here we work on detection of Cerebral Atherosclerosis from MRI images of patients.Cer-ebral Atherosclerosis is a cerebral vascular disease causes narrowing of the arteries due to buildup of fatty plaque inside the blood vessels of the brain.It leads to Ischemic stroke if not diagnosed early.Stroke affects majorly old age people and percentage of affected women is more compared to men.Results:Preproces-sing is done by using alpha trimmed meanfilter which is used to remove noise and also it enhances the image.Segmentation of cerebral atherosclerosis is done by using K-means clustering,Contextual clustering,and proposed Hybrid algo-rithm.Various parameters like Correlation,Pixel density,energy is determined and from the analysis of parameters it is determined that proposed Hybrid algo-rithm is efficient. 展开更多
关键词 ATHEROSCLEROSIS Ischemic stroke Alpha trimmed meanfilter k-means Contextual clustering Hybrid algorithm
下载PDF
Micro Calcification Detection in Mammogram Images Using Contiguous Convolutional Neural Network Algorithm
17
作者 P.Gomathi C.Muniraj P.S.Periasamy 《Computer Systems Science & Engineering》 SCIE EI 2023年第5期1887-1899,共13页
The mortality rate decreases as the early detection of Breast Cancer(BC)methods are emerging very fast,and when the starting stage of BC is detected,it is curable.The early detection of the disease depends on the imag... The mortality rate decreases as the early detection of Breast Cancer(BC)methods are emerging very fast,and when the starting stage of BC is detected,it is curable.The early detection of the disease depends on the image processing techniques,and it is used to identify the disease easily and accurately,especially the micro calcifications are visible on mammography when they are 0.1 mm or bigger,and cancer cells are about 0.03 mm,which is crucial for identifying in the BC area.To achieve this micro calcification in the BC images,it is necessary to focus on the four main steps presented in this work.There are three significant stages of the process assigned to find the BC using a thermal image;the image processing procedures are described below.In the first stage of the process,the Gaussian filter technique is implemented to magnify the screening image.During the second stage,BC detection is separated from the pre-processed image.The Proposed Versatile K-means clustering(VKC)algorithm with segmentation is used to identify the BC detection form of the screening image.The centroids are then recalculated using proposed VKC,which takes the mean of all data points allocated to that centroid’s cluster,lowering the overall intracluster variance in comparison to the prior phase.The“means”in K-means refers to the process of averaging the data and determining a new centroid.This process eliminates unnecessary areas of interest.First,the mammogram screening image information is taken from the patient and begins with the Contiguous Convolutional Neural Network(CCNN)method.The proposed CCNN is used to classify the Micro calcification in the BC spot using the feature values is the fourth stage of the process.The assess the presence of high-definition digital infrared thermography technology and knowledge base and suggests that future diagnostic and treatment services in breast cancer imaging will be developed.The use of sophisticated CCNN techniques in thermography is being developed to attain a greater level of consistency.The implemented(CCNN)technique’s performance is examined with different classification parameters like Recall,Precision,F-measure and accuracy.Finally,the Breast Cancer stages will be classified based on the true positive and true negative values. 展开更多
关键词 Contiguous Convolutional Neural Network(CCNN) Gaussian filter Versatile k-means Clustering(VKC)algorithm mammogram cancer detection
下载PDF
Local search yields a PTAS for fixed-dimensional k-means problem with penalties
18
作者 Fan Yuan Da-Chuan Xu +1 位作者 Dong-Lei Du Dong-Mei Zhang 《Journal of the Operations Research Society of China》 EI CSCD 2024年第2期351-362,共12页
We study a problem called the k-means problem with penalties(k-MPWP),which is a natural generalization of the typical k-means problem.In this problem,we have a set D of client points in R^(d),a set F of possible cente... We study a problem called the k-means problem with penalties(k-MPWP),which is a natural generalization of the typical k-means problem.In this problem,we have a set D of client points in R^(d),a set F of possible centers in R^(d),and a penalty cost Pj>O for each point j∈D.We are also given an integer k which is the size of the center point set.We want to find a center point set S■F with size k,choose a penalized subset of clients P■D,and assign every client in D\P to its open center.Our goal is to minimize the sum of the squared distances between every point in D\P to its assigned centre point and the sum of the penalty costs for all clients in P.By using the multi-swap local search technique and under the fixed-dimensional Euclidean space setting,we present a polynomial-time approximation scheme(PTAS)for the k-MPWP. 展开更多
关键词 Approximation algorithm k-means Local search PENALTY
原文传递
Optimization of Product Distribution for MIP Units Using Data Mining
19
作者 Wang Qing Zhang Xiaoguo +4 位作者 Mei Junwei Gao Zhibo Yang Kuizhi Yang Dawei Ouyang Fusheng 《China Petroleum Processing & Petrochemical Technology》 SCIE CAS CSCD 2024年第2期146-157,共12页
Based on data from a petrochemical company’s MIP unit over the past three years,19 input variables and 2 output variables were selected for modeling using the maximum information coefficient and Pearson correlation c... Based on data from a petrochemical company’s MIP unit over the past three years,19 input variables and 2 output variables were selected for modeling using the maximum information coefficient and Pearson correlation coefficient among 155 variables,which included properties of feedstock oil and spent catalyst,operational variables,and material flows.The distillation range variables were reduced using factor analysis,and the feedstock oils were clustered into three types using the K-means++algorithm.Each feedstock oil type was then used as an input variable for modeling.An XGBoost model and a back propagation(BP)neural network model with a structure of 20-15-15-2 were developed to predict the combined yield of gasoline and propylene,as well as the coke yield.In the test set,the BP neural network model demonstrated better fitting and generalization abilities with a mean absolute percentage error and determination coefficient of 1.48%and 0.738,respectively,compared to the XGBoost model.It was therefore chosen for further optimization work.The genetic algorithm was utilized to optimize operational variables in order to increase the combined yield of gasoline and propylene while controlling the growth of coke yield.Seven commercial test results in the MIP unit showed an average increase of 1.39 percentage points for the combined yield of gasoline and propylene and an average decrease of 0.11 percentage points for coke yield.These results indicate that the model effectively improves the combined yield of gasoline and propylene while controlling the increase in coke yield. 展开更多
关键词 MIP process k-means++ BP neural network model XGBoost algorithm genetic algorithm
下载PDF
Exploring Motor Imagery EEG: Enhanced EEG Microstate Analysis with GMD-Driven Density Canopy Method
20
作者 Xin Xiong Jing Zhang +3 位作者 Sanli Yi Chunwu Wang Ruixiang Liu Jianfeng He 《Computers, Materials & Continua》 SCIE EI 2024年第6期4659-4681,共23页
The analysis of microstates in EEG signals is a crucial technique for understanding the spatiotemporal dynamics of brain electrical activity.Traditional methods such as Atomic Agglomerative Hierarchical Clustering(AAH... The analysis of microstates in EEG signals is a crucial technique for understanding the spatiotemporal dynamics of brain electrical activity.Traditional methods such as Atomic Agglomerative Hierarchical Clustering(AAHC),K-means clustering,Principal Component Analysis(PCA),and Independent Component Analysis(ICA)are limited by a fixed number of microstate maps and insufficient capability in cross-task feature extraction.Tackling these limitations,this study introduces a Global Map Dissimilarity(GMD)-driven density canopy K-means clustering algorithm.This innovative approach autonomously determines the optimal number of EEG microstate topographies and employs Gaussian kernel density estimation alongside the GMD index for dynamic modeling of EEG data.Utilizing this advanced algorithm,the study analyzes the Motor Imagery(MI)dataset from the GigaScience database,GigaDB.The findings reveal six distinct microstates during actual right-hand movement and five microstates across other task conditions,with microstate C showing superior performance in all task states.During imagined movement,microstate A was significantly enhanced.Comparison with existing algorithms indicates a significant improvement in clustering performance by the refined method,with an average Calinski-Harabasz Index(CHI)of 35517.29 and a Davis-Bouldin Index(DBI)average of 2.57.Furthermore,an information-theoretical analysis of the microstate sequences suggests that imagined movement exhibits higher complexity and disorder than actual movement.By utilizing the extracted microstate sequence parameters as features,the improved algorithm achieved a classification accuracy of 98.41%in EEG signal categorization for motor imagery.A performance of 78.183%accuracy was achieved in a four-class motor imagery task on the BCI-IV-2a dataset.These results demonstrate the potential of the advanced algorithm in microstate analysis,offering a more effective tool for a deeper understanding of the spatiotemporal features of EEG signals. 展开更多
关键词 EEG microstate motor imagery k-means clustering algorithm gaus sian kernel function shannon entropy Lempel-Ziv complexity
下载PDF
上一页 1 2 5 下一页 到第
使用帮助 返回顶部