期刊文献+
共找到8,638篇文章
< 1 2 250 >
每页显示 20 50 100
Comprehensive K-Means Clustering
1
作者 Ethan Xiao 《Journal of Computer and Communications》 2024年第3期146-159,共14页
The k-means algorithm is a popular data clustering technique due to its speed and simplicity. However, it is susceptible to issues such as sensitivity to the chosen seeds, and inaccurate clusters due to poor initial s... The k-means algorithm is a popular data clustering technique due to its speed and simplicity. However, it is susceptible to issues such as sensitivity to the chosen seeds, and inaccurate clusters due to poor initial seeds, particularly in complex datasets or datasets with non-spherical clusters. In this paper, a Comprehensive K-Means Clustering algorithm is presented, in which multiple trials of k-means are performed on a given dataset. The clustering results from each trial are transformed into a five-dimensional data point, containing the scope values of the x and y coordinates of the clusters along with the number of points within that cluster. A graph is then generated displaying the configuration of these points using Principal Component Analysis (PCA), from which we can observe and determine the common clustering patterns in the dataset. The robustness and strength of these patterns are then examined by observing the variance of the results of each trial, wherein a different subset of the data keeping a certain percentage of original data points is clustered. By aggregating information from multiple trials, we can distinguish clusters that consistently emerge across different runs from those that are more sensitive or unlikely, hence deriving more reliable conclusions about the underlying structure of complex datasets. Our experiments show that our algorithm is able to find the most common associations between different dimensions of data over multiple trials, often more accurately than other algorithms, as well as measure stability of these clusters, an ability that other k-means algorithms lack. 展开更多
关键词 k-means clustering
下载PDF
基于K-means/RPF的大型遮蔽空间人员定位算法
2
作者 白泽坤 苏中 吴学佳 《传感器与微系统》 北大核心 2025年第1期157-160,164,共5页
针对大型遮蔽空间惯性/地图匹配算法中粒子贫化和子粒子群迷路效应导致定位精度下降的问题,提出一种基于K-means聚类的回溯粒子滤波(RPF)人员定位算法。首先,用行人航位推算(PDR)中航向更新、步频检测及步长估计得到初始运动轨迹;然后,... 针对大型遮蔽空间惯性/地图匹配算法中粒子贫化和子粒子群迷路效应导致定位精度下降的问题,提出一种基于K-means聚类的回溯粒子滤波(RPF)人员定位算法。首先,用行人航位推算(PDR)中航向更新、步频检测及步长估计得到初始运动轨迹;然后,设计RPF算法,提高存活粒子有效性和多样性,缓解粒子贫化,提高人员定位精度;最后,通过K-means聚类算法解决子粒子群的迷路效应,修正人员轨迹出现在非可行域的现象。实验结果表明:本文算法抑制了粒子贫化和子粒子群迷路效应,人员平均定位误差相比惯性定位和标准粒子滤波降低了81.20%和51.48%。 展开更多
关键词 大型遮蔽空间 k-means聚类 回溯粒子滤波 粒子贫化 迷路效应
下载PDF
Improved k-means clustering algorithm 被引量:16
3
作者 夏士雄 李文超 +2 位作者 周勇 张磊 牛强 《Journal of Southeast University(English Edition)》 EI CAS 2007年第3期435-438,共4页
In allusion to the disadvantage of having to obtain the number of clusters of data sets in advance and the sensitivity to selecting initial clustering centers in the k-means algorithm, an improved k-means clustering a... In allusion to the disadvantage of having to obtain the number of clusters of data sets in advance and the sensitivity to selecting initial clustering centers in the k-means algorithm, an improved k-means clustering algorithm is proposed. First, the concept of a silhouette coefficient is introduced, and the optimal clustering number Kopt of a data set with unknown class information is confirmed by calculating the silhouette coefficient of objects in clusters under different K values. Then the distribution of the data set is obtained through hierarchical clustering and the initial clustering-centers are confirmed. Finally, the clustering is completed by the traditional k-means clustering. By the theoretical analysis, it is proved that the improved k-means clustering algorithm has proper computational complexity. The experimental results of IRIS testing data set show that the algorithm can distinguish different clusters reasonably and recognize the outliers efficiently, and the entropy generated by the algorithm is lower. 展开更多
关键词 clustering k-means algorithm silhouette coefficient
下载PDF
基于改进K-means聚类算法的分布式储能集群划分方法
4
作者 刘春雨 《东北电力技术》 2025年第1期1-5,共5页
随着规模化分布式电源及储能的接入,配电网的功率返送、节点过电压等问题愈加显著,对电网规划、运行监视和调度控制等造成一定影响,也不利于储能大范围发展。为此提出一种适用于规模化分布式储能的集群划分方法,基于功率节点电压灵敏度... 随着规模化分布式电源及储能的接入,配电网的功率返送、节点过电压等问题愈加显著,对电网规划、运行监视和调度控制等造成一定影响,也不利于储能大范围发展。为此提出一种适用于规模化分布式储能的集群划分方法,基于功率节点电压灵敏度的电气距离模块度指标,对经典K-means算法进行改进,设计节点指数法、肘部法则优化初始聚类中心选择和集群数确定。以IEEE33系统算例进行验证,结果表明所提集群划分方法具有较强的电气耦合性、准确性和运算效率。 展开更多
关键词 分布式储能 集群划分 配电网结构 k-means聚类算法 划分指标
下载PDF
我国公共图书馆发展水平的K-means聚类分析
5
作者 熊朝松 《江苏科技信息》 2025年第2期71-74,115,共5页
文章首先基于第七次全国县级以上公共图书馆评估定级上等级馆名单数据,统计我国31个省市自治区公共图书馆进入一级图书馆、二级图书馆、三级图书馆以及未定等级数量,计算出比例;随后基于比例数据,利用K-means聚类分析算法对我国31个省... 文章首先基于第七次全国县级以上公共图书馆评估定级上等级馆名单数据,统计我国31个省市自治区公共图书馆进入一级图书馆、二级图书馆、三级图书馆以及未定等级数量,计算出比例;随后基于比例数据,利用K-means聚类分析算法对我国31个省市自治区进行数据分析,用来研究我国公共图书馆发展水平情况。数据分析结果显示,我国31个省市自治区公共图书馆发展水平可以分为3个档次:领先水平,包括北京等8地;中等水平,包括河北等17地;落后水平,包括辽宁等6地。最后,文章根据聚类分析结果提出了助力我国公共图书馆高质量均衡发展的相关建议与看法。 展开更多
关键词 公共图书馆 发展水平 k-means聚类分析
下载PDF
基于K-means的海南物流职业本科学生学习行为分析
6
作者 李芸嘉 《物流科技》 2025年第3期178-181,共4页
在现代物流行业中,培养具备全面专业知识和实践能力的物流职业本科学生至关重要。为了更好地了解学生在学习过程中的行为和学习习惯,文章将采用K-means聚类模型方法,对现代物流管理(职业本科)学生的学习行为进行深入探索与分类。通过这... 在现代物流行业中,培养具备全面专业知识和实践能力的物流职业本科学生至关重要。为了更好地了解学生在学习过程中的行为和学习习惯,文章将采用K-means聚类模型方法,对现代物流管理(职业本科)学生的学习行为进行深入探索与分类。通过这项研究,旨在为优化物流教育、提高学生学习效果提供有益的参考建议。文章将介绍研究设计、数据收集与处理方法,并对结果进行解析与讨论。 展开更多
关键词 物流 职业本科 学情分析 k-means聚类
下载PDF
Hierarchical hesitant fuzzy K-means clustering algorithm 被引量:21
7
作者 CHEN Na XU Ze-shui XIA Mei-mei 《Applied Mathematics(A Journal of Chinese Universities)》 SCIE CSCD 2014年第1期1-17,共17页
Due to the limitation and hesitation in one's knowledge, the membership degree of an element to a given set usually has a few different values, in which the conventional fuzzy sets are invalid. Hesitant fuzzy sets ar... Due to the limitation and hesitation in one's knowledge, the membership degree of an element to a given set usually has a few different values, in which the conventional fuzzy sets are invalid. Hesitant fuzzy sets are a powerful tool to treat this case. The present paper focuses on investigating the clustering technique for hesitant fuzzy sets based on the K-means clustering algorithm which takes the results of hierarchical clustering as the initial clusters. Finally, two examples demonstrate the validity of our algorithm. 展开更多
关键词 90B50 68T10 62H30 Hesitant fuzzy set hierarchical clustering k-means clustering intuitionisitc fuzzy set
下载PDF
An efficient enhanced k-means clustering algorithm 被引量:30
8
作者 FAHIM A.M SALEM A.M +1 位作者 TORKEY F.A RAMADAN M.A 《Journal of Zhejiang University-Science A(Applied Physics & Engineering)》 SCIE EI CAS CSCD 2006年第10期1626-1633,共8页
In k-means clustering, we are given a set of n data points in d-dimensional space R^d and an integer k and the problem is to determine a set of k points in R^d, called centers, so as to minimize the mean squared dista... In k-means clustering, we are given a set of n data points in d-dimensional space R^d and an integer k and the problem is to determine a set of k points in R^d, called centers, so as to minimize the mean squared distance from each data point to its nearest center. In this paper, we present a simple and efficient clustering algorithm based on the k-means algorithm, which we call enhanced k-means algorithm. This algorithm is easy to implement, requiring a simple data structure to keep some information in each iteration to be used in the next iteration. Our experimental results demonstrated that our scheme can improve the computational speed of the k-means algorithm by the magnitude in the total number of distance calculations and the overall time of computation. 展开更多
关键词 clustering algorithms cluster analysis k-means algorithm Data analysis
下载PDF
Blind source separation by weighted K-means clustering 被引量:5
9
作者 Yi Qingming 《Journal of Systems Engineering and Electronics》 SCIE EI CSCD 2008年第5期882-887,共6页
Blind separation of sparse sources (BSSS) is discussed. The BSSS method based on the conventional K-means clustering is very fast and is also easy to implement. However, the accuracy of this method is generally not ... Blind separation of sparse sources (BSSS) is discussed. The BSSS method based on the conventional K-means clustering is very fast and is also easy to implement. However, the accuracy of this method is generally not satisfactory. The contribution of the vector x(t) with different modules is theoretically proved to be unequal, and a weighted K-means clustering method is proposed on this grounds. The proposed algorithm is not only as fast as the conventional K-means clustering method, but can also achieve considerably accurate results, which is demonstrated by numerical experiments. 展开更多
关键词 blind source separation underdetermined mixing sparse representation weighted k-means clustering.
下载PDF
K-MEANS CLUSTERING FOR CLASSIFICATION OF THE NORTHWESTERN PACIFIC TROPICAL CYCLONE TRACKS 被引量:4
10
作者 余锦华 郑颖青 +2 位作者 吴启树 林金凎 龚振彬 《Journal of Tropical Meteorology》 SCIE 2016年第2期127-135,共9页
Based on the Joint Typhoon Warning Center(JTWC) best-track dataset between 1965 and 2009 and the characteristic parameters including tropical cyclone(TC) position,intensity,path length and direction,a method for objec... Based on the Joint Typhoon Warning Center(JTWC) best-track dataset between 1965 and 2009 and the characteristic parameters including tropical cyclone(TC) position,intensity,path length and direction,a method for objective classification of the Northwestern Pacific tropical cyclone tracks is established by using k-means Clustering.The TC lifespan,energy,active season and landfall probability of seven clusters of tropical cyclone tracks are comparatively analyzed.The characteristics of these parameters are quite different among different tropical cyclone track clusters.From the trend of the past two decades,the frequency of the western recurving cluster(accounting for 21.3% of the total) increased,and the lifespan elongated slightly,which differs from the other clusters.The annual variation of the Power Dissipation Index(PDI) of most clusters mainly depended on the TC intensity and frequency.However,the annual variation of the PDI in the northwestern moving then recurving cluster and the pelagic west-northwest moving cluster mainly depended on the frequency. 展开更多
关键词 tropical cyclone classification of tracks k-means clustering character of cluster
下载PDF
Optimization of constitutive parameters of foundation soils k-means clustering analysis 被引量:7
11
作者 Muge Elif Orakoglu Cevdet Emin Ekinci 《Research in Cold and Arid Regions》 CSCD 2013年第5期626-636,共11页
The goal of this study was to optimize the constitutive parameters of foundation soils using a k-means algorithm with clustering analysis. A database was collected from unconfined compression tests, Proctor tests and ... The goal of this study was to optimize the constitutive parameters of foundation soils using a k-means algorithm with clustering analysis. A database was collected from unconfined compression tests, Proctor tests and grain distribution tests of soils taken from three different types of foundation pits: raft foundations, partial raft foundations and strip foundations. k-means algorithm with clustering analysis was applied to determine the most appropriate foundation type given the un- confined compression strengths and other parameters of the different soils. 展开更多
关键词 foundation soil regression model k-means clustering analysis
下载PDF
Geochemical and Geostatistical Studies for Estimating Gold Grade in Tarq Prospect Area by K-Means Clustering Method 被引量:7
12
作者 Adel Shirazy Aref Shirazi +1 位作者 Mohammad Hossein Ferdossi Mansour Ziaii 《Open Journal of Geology》 2019年第6期306-326,共21页
Tarq geochemical 1:100,000 Sheet is located in Isfahan province which is investigated by Iran’s Geological and Explorations Organization using stream sediment analyzes. This area has stratigraphy of Precambrian to Qu... Tarq geochemical 1:100,000 Sheet is located in Isfahan province which is investigated by Iran’s Geological and Explorations Organization using stream sediment analyzes. This area has stratigraphy of Precambrian to Quaternary rocks and is located in the Central Iran zone. According to the presence of signs of gold mineralization in this area, it is necessary to identify important mineral areas in this area. Therefore, finding information is necessary about the relationship and monitoring the elements of gold, arsenic, and antimony relative to each other in this area to determine the extent of geochemical halos and to estimate the grade. Therefore, a well-known and useful K-means method is used for monitoring the elements in the present study, this is a clustering method based on minimizing the total Euclidean distances of each sample from the center of the classes which are assigned to them. In this research, the clustering quality function and the utility rate of the sample have been used in the desired cluster (S(i)) to determine the optimum number of clusters. Finally, with regard to the cluster centers and the results, the equations were used to predict the amount of the gold element based on four parameters of arsenic and antimony grade, length and width of sampling points. 展开更多
关键词 GOLD Tarq k-means clustering Method Estimation of the ELEMENTS GRADE k-means
下载PDF
Classification of Northeast China Cold Vortex Activity Paths in Early Summer Based on K-means Clustering and Their Climate Impact 被引量:11
13
作者 Yihe FANG Haishan CHEN +3 位作者 Yi LIN Chunyu ZHAO Yitong LIN Fang ZHOU 《Advances in Atmospheric Sciences》 SCIE CAS CSCD 2021年第3期400-412,共13页
The classification of the Northeast China Cold Vortex(NCCV)activity paths is an important way to analyze its characteristics in detail.Based on the daily precipitation data of the northeastern China(NEC)region,and the... The classification of the Northeast China Cold Vortex(NCCV)activity paths is an important way to analyze its characteristics in detail.Based on the daily precipitation data of the northeastern China(NEC)region,and the atmospheric circulation field and temperature field data of ERA-Interim for every six hours,the NCCV processes during the early summer(June)seasons from 1979 to 2018 were objectively identified.Then,the NCCV processes were classified using a machine learning method(k-means)according to the characteristic parameters of the activity path information.The rationality of the classification results was verified from two aspects,as follows:(1)the atmospheric circulation configuration of the NCCV on various paths;and(2)its influences on the climate conditions in the NEC.The obtained results showed that the activity paths of the NCCV could be divided into four types according to such characteristics as the generation origin,movement direction,and movement velocity of the NCCV.These included the generation-eastward movement type in the east of the Mongolia Plateau(eastward movement type or type A);generation-southeast longdistance movement type in the upstream of the Lena River(southeast long-distance movement type or type B);generationeastward less-movement type near Lake Baikal(eastward less-movement type or type C);and the generation-southward less-movement type in eastern Siberia(southward less-movement type or type D).There were obvious differences observed in the atmospheric circulation configuration and the climate impact of the NCCV on the four above-mentioned types of paths,which indicated that the classification results were reasonable. 展开更多
关键词 northeastern China early summer Northeast China Cold Vortex classification of activity paths machine learning method k-means clustering high-pressure blocking
下载PDF
Application of Self-Organizing Feature Map Neural Network Based on K-means Clustering in Network Intrusion Detection 被引量:5
14
作者 Ling Tan Chong Li +1 位作者 Jingming Xia Jun Cao 《Computers, Materials & Continua》 SCIE EI 2019年第7期275-288,共14页
Due to the widespread use of the Internet,customer information is vulnerable to computer systems attack,which brings urgent need for the intrusion detection technology.Recently,network intrusion detection has been one... Due to the widespread use of the Internet,customer information is vulnerable to computer systems attack,which brings urgent need for the intrusion detection technology.Recently,network intrusion detection has been one of the most important technologies in network security detection.The accuracy of network intrusion detection has reached higher accuracy so far.However,these methods have very low efficiency in network intrusion detection,even the most popular SOM neural network method.In this paper,an efficient and fast network intrusion detection method was proposed.Firstly,the fundamental of the two different methods are introduced respectively.Then,the selforganizing feature map neural network based on K-means clustering(KSOM)algorithms was presented to improve the efficiency of network intrusion detection.Finally,the NSLKDD is used as network intrusion data set to demonstrate that the KSOM method can significantly reduce the number of clustering iteration than SOM method without substantially affecting the clustering results and the accuracy is much higher than Kmeans method.The Experimental results show that our method can relatively improve the accuracy of network intrusion and significantly reduce the number of clustering iteration. 展开更多
关键词 k-means clustering self-organizing feature map neural network network security intrusion detection NSL-KDD data set
下载PDF
Investigation of the J-TEXT plasma events by k-means clustering algorithm 被引量:1
15
作者 李建超 张晓卿 +11 位作者 张昱 Abba Alhaji BALA 柳惠平 周帼红 王能超 李达 陈忠勇 杨州军 陈志鹏 董蛟龙 丁永华 the J-TEXT Team 《Plasma Science and Technology》 SCIE EI CAS CSCD 2023年第8期38-43,共6页
Various types of plasma events emerge in specific parameter ranges and exhibit similar characteristics in diagnostic signals,which can be applied to identify these events.A semisupervised machine learning algorithm,th... Various types of plasma events emerge in specific parameter ranges and exhibit similar characteristics in diagnostic signals,which can be applied to identify these events.A semisupervised machine learning algorithm,the k-means clustering algorithm,is utilized to investigate and identify plasma events in the J-TEXT plasma.This method can cluster diverse plasma events with homogeneous features,and then these events can be identified if given few manually labeled examples based on physical understanding.A survey of clustered events reveals that the k-means algorithm can make plasma events(rotating tearing mode,sawtooth oscillations,and locked mode)gathering in Euclidean space composed of multi-dimensional diagnostic data,like soft x-ray emission intensity,edge toroidal rotation velocity,the Mirnov signal amplitude and so on.Based on the cluster analysis results,an approximate analytical model is proposed to rapidly identify plasma events in the J-TEXT plasma.The cluster analysis method is conducive to data markers of massive diagnostic data. 展开更多
关键词 k-means cluster analysis plasma event machine learning
下载PDF
K-means Find Density Peaks in Molecular Conformation Clustering 被引量:1
16
作者 Guiyan Wang Ting Fu +5 位作者 Hong Ren Peijun Xu Qiuhan Guo Xiaohong Mou Yan Li Guohui Li 《Chinese Journal of Chemical Physics》 SCIE EI CAS CSCD 2022年第2期353-368,I0026-I0030,I0003,共22页
Performing cluster analysis on molecular conformation is an important way to find the representative conformation in the molecular dynamics trajectories.Usually,it is a critical step for interpreting complex conformat... Performing cluster analysis on molecular conformation is an important way to find the representative conformation in the molecular dynamics trajectories.Usually,it is a critical step for interpreting complex conformational changes or interaction mechanisms.As one of the density-based clustering algorithms,find density peaks(FDP)is an accurate and reasonable candidate for the molecular conformation clustering.However,facing the rapidly increasing simulation length due to the increase in computing power,the low computing efficiency of FDP limits its application potential.Here we propose a marginal extension to FDP named K-means find density peaks(KFDP)to solve the mass source consuming problem.In KFDP,the points are initially clustered by a high efficiency clustering algorithm,such as K-means.Cluster centers are defined as typical points with a weight which represents the cluster size.Then,the weighted typical points are clustered again by FDP,and then are refined as core,boundary,and redefined halo points.In this way,KFDP has comparable accuracy as FDP but its computational complexity is reduced from O(n^(2))to O(n).We apply and test our KFDP method to the trajectory data of multiple small proteins in terms of torsion angle,secondary structure or contact map.The comparing results with K-means and density-based spatial clustering of applications with noise show the validation of the proposed KFDP. 展开更多
关键词 k-means find density peaks Molecular clustering Density-based spatial clustering of applications with noise
下载PDF
基于K-means聚类和BP神经网络的电梯能耗实时监测方法 被引量:2
17
作者 彭诚 《通化师范学院学报》 2024年第4期50-56,共7页
针对现有方法在对电梯能耗进行监测时,存在监测精度低、用时长、监测结果不理想的问题,该文提出一种基于K-means聚类算法和BP神经网络相结合的电梯能耗实时监测方法 .在经过清洗的能耗数据中提取影响建筑能耗实时监测的主要因素特征值,... 针对现有方法在对电梯能耗进行监测时,存在监测精度低、用时长、监测结果不理想的问题,该文提出一种基于K-means聚类算法和BP神经网络相结合的电梯能耗实时监测方法 .在经过清洗的能耗数据中提取影响建筑能耗实时监测的主要因素特征值,利用相似系数法进行相似度计算,获取相似系数.对相似电梯能耗数据进行小波分解获取高低频序列,分别采用LSSVM-GSA检测方法和均方加权处理方法对低频和高频部分进行处理,将两个结果进行重构,得到最终的实时监测结果 .仿真实验结果表明:所提方法能够获取高精度、低耗时、高稳定性的监测结果 . 展开更多
关键词 电梯能耗 k-means聚类算法 BP神经网络 数据清洗
下载PDF
光伏波动平抑下改进K-means的电池储能动态分组控制策略 被引量:2
18
作者 余洋 陆文韬 +3 位作者 陈东阳 刘霡 夏雨星 郑晓明 《电力系统保护与控制》 EI CSCD 北大核心 2024年第7期1-11,共11页
针对电池储能系统(battery energy storage system,BESS)进行光伏波动平抑时寿命损耗高及荷电状态(state of charge,SOC)一致性差的问题,提出了光伏波动平抑下改进K-means的BESS动态分组控制策略。首先,采用最小最大调度方法获取光伏并... 针对电池储能系统(battery energy storage system,BESS)进行光伏波动平抑时寿命损耗高及荷电状态(state of charge,SOC)一致性差的问题,提出了光伏波动平抑下改进K-means的BESS动态分组控制策略。首先,采用最小最大调度方法获取光伏并网指令。其次,设计了改进侏儒猫鼬优化算法(improved dwarf mongoose optimizer,IDMO),并利用它对传统K-means聚类算法进行改进,加快了聚类速度。接着,制定了电池单元动态分组原则,并根据电池单元SOC利用改进K-means将其分为3个电池组。然后,设计了基于充放电函数的电池单元SOC一致性功率分配方法,并据此提出BESS双层功率分配策略,上层确定电池组充放电顺序及指令,下层计算电池单元充放电指令。对所提策略进行仿真验证,结果表明,所设计的IDMO具有更高的寻优精度及更快的寻优速度。所提BESS平抑光伏波动策略在有效平抑波动的同时,降低了BESS运行寿命损耗并提高了电池单元SOC的均衡性。 展开更多
关键词 电池储能系统 波动平抑 功率分配 改进侏儒猫鼬优化算法 改进k-means聚类算法
下载PDF
A Low Energy Consumption Clustering Routing Protocol Based on K-Means 被引量:1
19
作者 Shan Zhong Guihua Wang +3 位作者 Xiaohui Leng Xiaona Wang Lian Xue Yue Gu 《Journal of Software Engineering and Applications》 2012年第12期1013-1015,共3页
A low energy consumption clustering algorism based on LEACH is researched. Firstly, A method for obtaining the number of clusters k is given, then using the K-Means algorism to divide all the sensor nodes in WSN to th... A low energy consumption clustering algorism based on LEACH is researched. Firstly, A method for obtaining the number of clusters k is given, then using the K-Means algorism to divide all the sensor nodes in WSN to the k clusters, in the whole network life cycle the cluster will not changed, and when the energy of cluster head is lower than some threshold, it will be changed. The data transmission between cluster heads to sink node use mixed model, namely, the cluster near the sink node using the single hop and the one far from it using the multiple hops. The experiment shows our method has the longer life cycle and more received information. It is an effective clustering protocol. 展开更多
关键词 clustering ROUTING LOW ENERGY CONSUMPTION k-means
下载PDF
基于K-means聚类和特征空间增强的噪声标签深度学习算法 被引量:2
20
作者 吕佳 邱小龙 《智能系统学报》 CSCD 北大核心 2024年第2期267-277,共11页
深度学习中神经网络的性能依赖于高质量的样本,然而噪声标签会降低网络的分类准确率。为降低噪声标签对网络性能的影响,噪声标签学习算法被提出。该算法首先将训练样本集划分成干净样本集和噪声样本集,然后使用半监督学习算法对噪声样... 深度学习中神经网络的性能依赖于高质量的样本,然而噪声标签会降低网络的分类准确率。为降低噪声标签对网络性能的影响,噪声标签学习算法被提出。该算法首先将训练样本集划分成干净样本集和噪声样本集,然后使用半监督学习算法对噪声样本集赋予伪标签。然而,错误的伪标签以及训练样本数量不足的问题仍然限制着噪声标签学习算法性能的提升。为解决上述问题,提出基于K-means聚类和特征空间增强的噪声标签深度学习算法。首先,该算法利用K-means聚类算法对干净样本集进行标签聚类,并根据噪声样本集与聚类中心的距离大小筛选出难以分类的噪声样本,以提高训练样本的质量;其次,使用mixup算法扩充干净样本集和噪声样本集,以增加训练样本的数量;最后,采用特征空间增强算法抑制mixup算法新生成的噪声样本,从而提高网络的分类准确率。并在CIFAR10、CIFAR100、MNIST和ANIMAL-10共4个数据集上试验验证了该算法的有效性。 展开更多
关键词 噪声标签学习 深度学习 半监督学习 机器学习 神经网络 k-means聚类 特征空间增强 mixup算法
下载PDF
上一页 1 2 250 下一页 到第
使用帮助 返回顶部