期刊文献+
共找到2,201篇文章
< 1 2 111 >
每页显示 20 50 100
ALLIED FUZZY c-MEANS CLUSTERING MODEL 被引量:2
1
作者 武小红 周建江 《Transactions of Nanjing University of Aeronautics and Astronautics》 EI 2006年第3期208-213,共6页
A novel model of fuzzy clustering, i.e. an allied fuzzy c means (AFCM) model is proposed based on the combination of advantages of fuzzy c means (FCM) and possibilistic c means (PCM) clustering. PCM is sensitive... A novel model of fuzzy clustering, i.e. an allied fuzzy c means (AFCM) model is proposed based on the combination of advantages of fuzzy c means (FCM) and possibilistic c means (PCM) clustering. PCM is sensitive to initializations and often generates coincident clusters. AFCM overcomes this shortcoming and it is an ex tension of PCM. Membership and typicality values can be simultaneously produced in AFCM. Experimental re- suits show that noise data can be well processed, coincident clusters are avoided and clustering accuracy is better. 展开更多
关键词 fuzzy c-means clustering possibilistic c means clustering allied fuzzy c-means clustering
下载PDF
A New Method of Wind Turbine Bearing Fault Diagnosis Based on Multi-Masking Empirical Mode Decomposition and Fuzzy C-Means Clustering 被引量:11
2
作者 Yongtao Hu Shuqing Zhang +3 位作者 Anqi Jiang Liguo Zhang Wanlu Jiang Junfeng Li 《Chinese Journal of Mechanical Engineering》 SCIE EI CAS CSCD 2019年第3期156-167,共12页
Based on Multi-Masking Empirical Mode Decomposition (MMEMD) and fuzzy c-means (FCM) clustering, a new method of wind turbine bearing fault diagnosis FCM-MMEMD is proposed, which can determine the fault accurately and ... Based on Multi-Masking Empirical Mode Decomposition (MMEMD) and fuzzy c-means (FCM) clustering, a new method of wind turbine bearing fault diagnosis FCM-MMEMD is proposed, which can determine the fault accurately and timely. First, FCM clustering is employed to classify the data into different clusters, which helps to estimate whether there is a fault and how many fault types there are. If fault signals exist, the fault vibration signals are then demodulated and decomposed into different frequency bands by MMEMD in order to be analyzed further. In order to overcome the mode mixing defect of empirical mode decomposition (EMD), a novel method called MMEMD is proposed. It is an improvement to masking empirical mode decomposition (MEMD). By adding multi-masking signals to the signals to be decomposed in different levels, it can restrain low-frequency components from mixing in highfrequency components effectively in the sifting process and then suppress the mode mixing. It has the advantages of easy implementation and strong ability of suppressing modal mixing. The fault type is determined by Hilbert envelope finally. The results of simulation signal decomposition showed the high performance of MMEMD. Experiments of bearing fault diagnosis in wind turbine bearing fault diagnosis proved the validity and high accuracy of the new method. 展开更多
关键词 Wind TURBINE BEARING FAULTS diagnosis Multi-masking empirical mode decomposition (MMEMD) fuzzy c-mean (FcM) clustering
下载PDF
A Fixed Suppressed Rate Selection Method for Suppressed Fuzzy C-Means Clustering Algorithm 被引量:2
3
作者 Jiulun Fan Jing Li 《Applied Mathematics》 2014年第8期1275-1283,共9页
Suppressed fuzzy c-means (S-FCM) clustering algorithm with the intention of combining the higher speed of hard c-means clustering algorithm and the better classification performance of fuzzy c-means clustering algorit... Suppressed fuzzy c-means (S-FCM) clustering algorithm with the intention of combining the higher speed of hard c-means clustering algorithm and the better classification performance of fuzzy c-means clustering algorithm had been studied by many researchers and applied in many fields. In the algorithm, how to select the suppressed rate is a key step. In this paper, we give a method to select the fixed suppressed rate by the structure of the data itself. The experimental results show that the proposed method is a suitable way to select the suppressed rate in suppressed fuzzy c-means clustering algorithm. 展开更多
关键词 HARD c-means clustering ALGORITHM fuzzy c-means clustering ALGORITHM Suppressed fuzzy c-means clustering ALGORITHM Suppressed RATE
下载PDF
Paraspinal Muscle Segmentation in CT Images Using GSM-Based Fuzzy C-Means Clustering
4
作者 Yong Wei Xiuping Tao +1 位作者 Bin Xu Arend P. Castelein 《Journal of Computer and Communications》 2014年第9期70-77,共8页
Minimally Invasive Spine surgery (MISS) was developed to treat disorders of the spine with less disruption to the muscles. Surgeons use CT images to monitor the volume of muscles after operation in order to evaluate t... Minimally Invasive Spine surgery (MISS) was developed to treat disorders of the spine with less disruption to the muscles. Surgeons use CT images to monitor the volume of muscles after operation in order to evaluate the progress of patient recovery. The first step in the task is to segment the muscle regions from other tissues/organs in CT images. However, manual segmentation of muscle regions is not only inaccurate, but also time consuming. In this work, Gray Space Map (GSM) is used in fuzzy c-means clustering algorithm to segment muscle regions in CT images. GSM com- bines both spatial and intensity information of pixels. Experiments show that the proposed GSM- based fuzzy c-means clustering muscle CT image segmentation yields very good results. 展开更多
关键词 cT Image SEGMENTATION Gray Space Map (GSM) fuzzy c-means clustering MINIMALLY Invasive SPINE Surgery (MISS)
下载PDF
Automated Colorization of Grayscale Images Using Texture Descriptors and a Modified Fuzzy C-Means Clustering
5
作者 Christophe Gauge Sreela Sasi 《Journal of Intelligent Learning Systems and Applications》 2012年第2期135-143,共9页
A novel example-based process for Automated Colorization of grayscale images using Texture Descriptors (ACTD) without any human intervention is proposed. By analyzing a set of sample color images, coherent regions of ... A novel example-based process for Automated Colorization of grayscale images using Texture Descriptors (ACTD) without any human intervention is proposed. By analyzing a set of sample color images, coherent regions of homogeneous textures are extracted. A multi-channel filtering technique is used for texture-based image segmentation, combined with a modified Fuzzy C-means (FCM) clustering algorithm. This modified FCM clustering algorithm includes both the local spatial information from neighboring pixels, and the spatial Euclidian distance to the cluster’s center of gravity. For each area of interest, state-of-the-art texture descriptors are then computed and stored, along with corresponding color information. These texture descriptors and the color information are used for colorization of a grayscale image with similar textures. Given a grayscale image to be colorized, the segmentation and feature extraction processes are repeated. The texture descriptors are used to perform Content-Based Image Retrieval (CBIR). The colorization process is performed by Chroma replacement. This research finds numerous applications, ranging from classic film restoration and enhancement, to adding valuable information into medical and satellite imaging. Also, this can be used to enhance the detection of objects from x-ray images at the airports. 展开更多
关键词 Image Processing Pattern Recognition cOMPUTER VISION fuzzy c-means clustering GABOR
下载PDF
Hybrid Clustering Using Firefly Optimization and Fuzzy C-Means Algorithm
6
作者 Krishnamoorthi Murugasamy Kalamani Murugasamy 《Circuits and Systems》 2016年第9期2339-2348,共10页
Classifying the data into a meaningful group is one of the fundamental ways of understanding and learning the valuable information. High-quality clustering methods are necessary for the valuable and efficient analysis... Classifying the data into a meaningful group is one of the fundamental ways of understanding and learning the valuable information. High-quality clustering methods are necessary for the valuable and efficient analysis of the increasing data. The Firefly Algorithm (FA) is one of the bio-inspired algorithms and it is recently used to solve the clustering problems. In this paper, Hybrid F-Firefly algorithm is developed by combining the Fuzzy C-Means (FCM) with FA to improve the clustering accuracy with global optimum solution. The Hybrid F-Firefly algorithm is developed by incorporating FCM operator at the end of each iteration in FA algorithm. This proposed algorithm is designed to utilize the goodness of existing algorithm and to enhance the original FA algorithm by solving the shortcomings in the FCM algorithm like the trapping in local optima and sensitive to initial seed points. In this research work, the Hybrid F-Firefly algorithm is implemented and experimentally tested for various performance measures under six different benchmark datasets. From the experimental results, it is observed that the Hybrid F-Firefly algorithm significantly improves the intra-cluster distance when compared with the existing algorithms like K-means, FCM and FA algorithm. 展开更多
关键词 clustering OPTIMIZATION K-means fuzzy c-means Firefly Algorithm F-Firefly
下载PDF
基于K-means聚类及模糊判别的卷烟包灰性能综合评价方法
7
作者 楚文娟 郭丽霞 +5 位作者 程东旭 王红霞 崔廷 冯银龙 王建民 鲁平 《轻工学报》 CAS 北大核心 2024年第6期93-100,共8页
为实现卷烟包灰性能的综合评价和评价结果具象化,以49个卷烟的灰色、裂口率、缩灰率、碳线宽度、碳线整齐度测定结果为原始变量,先运用K-means聚类、模糊判别法将原始变量转换为具象化的得分数据,再运用Critic赋权法赋予各项指标权重,... 为实现卷烟包灰性能的综合评价和评价结果具象化,以49个卷烟的灰色、裂口率、缩灰率、碳线宽度、碳线整齐度测定结果为原始变量,先运用K-means聚类、模糊判别法将原始变量转换为具象化的得分数据,再运用Critic赋权法赋予各项指标权重,建立了一种卷烟包灰性能综合评价方法。结果表明:将原始变量转换成区间为60~100、平均值在80左右的得分,可使评价结果具象化且更加符合认知习惯;5项指标的权重由高到低依次为裂口率(0.27)>缩灰率(0.25)>灰色(0.18)>碳线整齐度(0.16)>碳线宽度(0.14);卷烟包灰性能可划分为优、良、差三档,各档得分区间依次为(85,100]、[75,85]、[60,75);不同档次代表性卷烟的灰柱视觉效果对比结果证明,综合得分可客观反映卷烟包灰性能的优劣。 展开更多
关键词 卷烟 包灰性能 K-means聚类 模糊判别 critic赋权法
下载PDF
Soil pore identification with the adaptive fuzzy C-means method based on computed tomography images 被引量:5
8
作者 Yue Zhao Qiaoling Han +1 位作者 Yandong Zhao Jinhao Liu 《Journal of Forestry Research》 SCIE CAS CSCD 2019年第3期1043-1052,共10页
The complex geometry and topology of soil is widely recognised as the key driver in many ecological processes. X-ray computed tomography (CT) provides insight into the internal structure of soil pores automatically an... The complex geometry and topology of soil is widely recognised as the key driver in many ecological processes. X-ray computed tomography (CT) provides insight into the internal structure of soil pores automatically and accurately. Until recently, there have not been methods to identify soil pore structures. This has restricted the development of soil science, particularly regarding pore geometry and spatial distribution. Through the adoption of the fuzzy clustering theory and the establishment of pore identification rules, a novel pore identification method is described to extract pore structures from CT soil images. The robustness of the adaptive fuzzy C-means method (AFCM), the adaptive threshold method, and Image-Pro Plus tools were compared on soil specimens under different conditions, such as frozen, saturated, and dry situations. The results demonstrate that the AFCM method is suitable for identifying pore clusters, especially tiny pores, under various soil conditions. The method would provide an optional technique for the study of soil micromorphology. 展开更多
关键词 cT soil IMAGES fuzzy c-means fuzzy clustering theory PORE IDENTIFIcATION rule
下载PDF
Kernel method-based fuzzy clustering algorithm 被引量:2
9
作者 WuZhongdong GaoXinbo +1 位作者 XieWeixin YuJianping 《Journal of Systems Engineering and Electronics》 SCIE EI CSCD 2005年第1期160-166,共7页
The fuzzy C-means clustering algorithm(FCM) to the fuzzy kernel C-means clustering algorithm(FKCM) to effectively perform cluster analysis on the diversiform structures are extended, such as non-hyperspherical data, d... The fuzzy C-means clustering algorithm(FCM) to the fuzzy kernel C-means clustering algorithm(FKCM) to effectively perform cluster analysis on the diversiform structures are extended, such as non-hyperspherical data, data with noise, data with mixture of heterogeneous cluster prototypes, asymmetric data, etc. Based on the Mercer kernel, FKCM clustering algorithm is derived from FCM algorithm united with kernel method. The results of experiments with the synthetic and real data show that the FKCM clustering algorithm is universality and can effectively unsupervised analyze datasets with variform structures in contrast to FCM algorithm. It is can be imagined that kernel-based clustering algorithm is one of important research direction of fuzzy clustering analysis. 展开更多
关键词 fuzzy clustering analysis kernel method fuzzy c-means clustering.
下载PDF
一种改进的 Fuzzy c-means 聚类算法 被引量:4
10
作者 胡钟山 丁震 +2 位作者 杨静宇 唐振民 邬永革 《南京理工大学学报》 EI CAS CSCD 1997年第4期337-340,共4页
该文提出了一种改进的fuzzyc-means算法(MFCM)。此算法是将传统算法(FCM)直接对样本集聚类变为对特征集聚类,从而极大提高了fuzzyc-means的速度。证明了MFCM与FCM在分类效果上的等价性,且... 该文提出了一种改进的fuzzyc-means算法(MFCM)。此算法是将传统算法(FCM)直接对样本集聚类变为对特征集聚类,从而极大提高了fuzzyc-means的速度。证明了MFCM与FCM在分类效果上的等价性,且MFCM较FCM有较低的时间复杂性,讨论了MFCM与FCM空间复杂性的关系。最后数值实验证实了结论。 展开更多
关键词 模糊聚类 模式识别 聚类分析 MFcM
下载PDF
基于空间加权距离的自适应Fuzzy C-Means算法研究 被引量:2
11
作者 王海起 朱锦 王劲峰 《测绘与空间地理信息》 2014年第2期18-21,24,共5页
空间聚类不仅应考虑GIS对象属性特征的相似性,还应考虑对象的空间邻近性。不同属性、位置特征在聚类中起到的作用不同。采用信息熵方法计算空间距离中各属性距离、位置距离的权重,权值大小用于度量相应特征在fuzzy c-means隶属度计算时... 空间聚类不仅应考虑GIS对象属性特征的相似性,还应考虑对象的空间邻近性。不同属性、位置特征在聚类中起到的作用不同。采用信息熵方法计算空间距离中各属性距离、位置距离的权重,权值大小用于度量相应特征在fuzzy c-means隶属度计算时的作用大小,并引入相似性指标,当两个聚类之间的相似度高于某个合并阈值时,则对应的一对聚类进行合并,从而克服需预先设置聚类类数的问题。通过应用实例的聚类有效性分析,与普通空间距离相比,基于空间加权距离的FCM算法具有稳定性和有效性。 展开更多
关键词 fuzzy e—means 空间加权距离 信息熵 自适应聚类合并
下载PDF
Fault Pattern Recognition based on Kernel Method and Fuzzy C-means
12
作者 SUN Yebei ZHAO Rongzhen TANG Xiaobin 《International Journal of Plant Engineering and Management》 2016年第4期231-240,共10页
A method about fault identification is proposed to solve the relationship among fault features of large rotating machinery, which is extremely complicated and nonlinear. This paper studies the rotor test-rig and the c... A method about fault identification is proposed to solve the relationship among fault features of large rotating machinery, which is extremely complicated and nonlinear. This paper studies the rotor test-rig and the clustering of data sets and fault pattern recognitions. The present method firstly maps the data from their original space to a high dimensional Kernel space which makes the highly nonlinear data in low-dimensional space become linearly separable in Kernel space. It highlights the differences among the features of the data set. Then fuzzy C-means (FCM) is conducted in the Kernel space. Each data is assigned to the nearest class by computing the distance to the clustering center. Finally, test set is used to judge the results. The convergence rate and clustering accuracy are better than traditional FCM. The study shows that the method is effective for the accuracy of pattern recognition on rotating machinery. 展开更多
关键词 Kernel method fuzzy c-means FcM pattern recognition clustering
下载PDF
Study on the Development and Implementation of Different Big Data Clustering Methods
13
作者 Jean Pierre Ntayagabiri Jérémie Ndikumagenge +1 位作者 Longin Ndayisaba Boribo Kikunda Philippe 《Open Journal of Applied Sciences》 2023年第7期1163-1177,共15页
Clustering is an unsupervised learning method used to organize raw data in such a way that those with the same (similar) characteristics are found in the same class and those that are dissimilar are found in different... Clustering is an unsupervised learning method used to organize raw data in such a way that those with the same (similar) characteristics are found in the same class and those that are dissimilar are found in different classes. In this day and age, the very rapid increase in the amount of data being produced brings new challenges in the analysis and storage of this data. Recently, there is a growing interest in key areas such as real-time data mining, which reveal an urgent need to process very large data under strict performance constraints. The objective of this paper is to survey four algorithms including K-Means algorithm, FCM algorithm, EM algorithm and BIRCH, used for data clustering and then show their strengths and weaknesses. Another task is to compare the results obtained by applying each of these algorithms to the same data and to give a conclusion based on these results. 展开更多
关键词 clustering K-means fuzzy c-means Expectation Maximization BIRcH
下载PDF
基于GWO-FCM的输油泵故障诊断模型自学习框架
14
作者 郭俊霞 谢自力 +2 位作者 毛申申 魏聪聪 邢健 《北京化工大学学报(自然科学版)》 CAS CSCD 北大核心 2024年第6期79-86,共8页
随着输油泵场站无人化建设的发展,企业对输油泵故障诊断技术的要求也越来越高。目前,被广泛使用的利用机器学习算法进行输油泵故障诊断的方法都只能针对模型训练集中已包含的几类故障进行诊断,在企业的实际使用中,仍会出现其他不包含在... 随着输油泵场站无人化建设的发展,企业对输油泵故障诊断技术的要求也越来越高。目前,被广泛使用的利用机器学习算法进行输油泵故障诊断的方法都只能针对模型训练集中已包含的几类故障进行诊断,在企业的实际使用中,仍会出现其他不包含在训练集中的故障而不能被正确自动识别、诊断。针对上述问题,设计了一种输油泵故障诊断模型自学习框架,通过信号处理技术结合深度学习提取深层故障特征,提高工业现场数据的可分性;通过模糊C均值聚类结合相似度度量判别已知故障和未知故障,对出现的未知故障模式进行识别和记录;利用频繁出现的未知故障数据重训练模型,在原有诊断功能的基础上提高对未知故障的识别、诊断及学习能力。为验证方法的有效性,使用工业现场采集的输油泵数据进行实验,结果表明,现有诊断方法所提出的输油泵故障诊断模型自学习框架能够实现对未知故障的准确识别。 展开更多
关键词 输油泵 故障诊断 自学习 模糊c均值聚类
下载PDF
基于FCM和EO-SVM水轮机尾水管压力脉动特征识别 被引量:1
15
作者 刘茜媛 王利英 +1 位作者 张路遥 曹庆皎 《水电能源科学》 北大核心 2024年第1期162-165,共4页
为有效识别水轮机尾水管压力脉动特征,提出了一种基于模糊C均值聚类、平衡优化器算法与支持向量机的识别方法。该方法首先采用平衡优化器算法优化SVM的惩罚因子和核函数以获得更好的SVM参数组合,构建EO-SVM识别模型以实现其在水轮机尾... 为有效识别水轮机尾水管压力脉动特征,提出了一种基于模糊C均值聚类、平衡优化器算法与支持向量机的识别方法。该方法首先采用平衡优化器算法优化SVM的惩罚因子和核函数以获得更好的SVM参数组合,构建EO-SVM识别模型以实现其在水轮机尾水管压力脉动特征识别中的应用。然后采用模糊C均值聚类算法将待分类的压力脉动特征进行初始聚类,将其分为四类,并依据聚类结果选择最靠近每类中心的样本作为EO-SVM模型的训练样本。将SVM和EO-SVM两种模型的识别分类结果进行比较,验证了所提EO-SVM模型的有效性。 展开更多
关键词 压力脉动 小波包分析 模糊c均值聚类 平衡优化器算法 支持向量机
下载PDF
Partition region-based suppressed fuzzy C-means algorithm 被引量:1
16
作者 Kun Zhang Weiren Kong +4 位作者 Peipei Liu Jiao Shi Yu Lei Jie Zou Min Liu 《Journal of Systems Engineering and Electronics》 SCIE EI CSCD 2017年第5期996-1008,共13页
Aimed at the problem that the traditional suppressed fuzzy C-means clustering algorithms ignore the real needs of different objects, applying the same suppressed parameter for modifying membership degrees of all the o... Aimed at the problem that the traditional suppressed fuzzy C-means clustering algorithms ignore the real needs of different objects, applying the same suppressed parameter for modifying membership degrees of all the objects, a novel partition region-based suppressed fuzzy C-means clustering algorithm with better capacity of adaptability and robustness is proposed in this paper. The model based on the real needs of different objects is built, making it clear to decide whether to proceed with further determination; in addition, the external user-defined suppressed parameter is automatically selected according to the intrinsic structural characteristic of each dataset, making the proposed method become robust to the fluctuations in the incoming dataset and initial conditions. Experimental results show that the proposed method is more robust than its counterparts and overcomes the weakness of the original suppressed clustering algorithm in most cases. 展开更多
关键词 shadowed set suppressed fuzzy c-means clustering automatically parameter selection soft computing techniques
下载PDF
基于FCM-LSTM的光热发电出力短期预测 被引量:1
17
作者 刘振路 郭军红 +2 位作者 李薇 贾宏涛 陈卓 《工程科学学报》 EI CSCD 北大核心 2024年第1期178-186,共9页
对光热电站的出力进行短期预测,可以有效应对太阳能随机性和波动性带来的影响,为电网调度做好准备.该文以青海某光热电站为例,首先使用模糊C均值聚类算法对预处理后的实验数据进行分类,然后通过分析不同聚类类型下出力和气象数据中各因... 对光热电站的出力进行短期预测,可以有效应对太阳能随机性和波动性带来的影响,为电网调度做好准备.该文以青海某光热电站为例,首先使用模糊C均值聚类算法对预处理后的实验数据进行分类,然后通过分析不同聚类类型下出力和气象数据中各因子间的关联程度,充分挖掘出数据间的关系,确定不同类型预测模型的输入变量,进而构建出不同类别下的长短期记忆神经网络预测模型.结果表明,与传统长短期记忆神经网络模型、BP神经网络模型、支持向量机模型和随机森林模型的预测结果相比,基于模糊C均值聚类的长短期记忆神经网络预测模型效果良好,大幅减少了预测误差,验证了该预测模型的有效性. 展开更多
关键词 光热电站 气象因素 短期出力预测 长短期记忆神经网络 模糊c均值聚类
下载PDF
基于模糊C-Means的改进型KNN分类算法 被引量:12
18
作者 朱付保 谢利杰 +1 位作者 汤萌萌 朱颢东 《华中师范大学学报(自然科学版)》 CAS 北大核心 2017年第6期754-759,共6页
KNN算法是一种思想简单且容易实现的分类算法,但在训练集较大以及特征属性较多时候,其效率低、时间开销大.针对这一问题,论文提出了基于模糊C-means的改进型KNN分类算法,该算法在传统的KNN分类算法基础上引入了模糊C-means理论,通过对... KNN算法是一种思想简单且容易实现的分类算法,但在训练集较大以及特征属性较多时候,其效率低、时间开销大.针对这一问题,论文提出了基于模糊C-means的改进型KNN分类算法,该算法在传统的KNN分类算法基础上引入了模糊C-means理论,通过对样本数据进行聚类处理,用形成的子簇代替该子簇所有的样本集,以减少训练集的数量,从而减少KNN分类过程的工作量、提高分类效率,使KNN算法更好地应用于数据挖掘.通过理论分析和实验结果表明,论文所提算法在面对较大数据时能有效提高算法的效率和精确性,满足处理数据的需求. 展开更多
关键词 模糊cmeans 聚类 KNN分类
下载PDF
基于改进的模糊C-Means航迹聚类方法研究 被引量:18
19
作者 王超 王明明 王飞 《中国民航大学学报》 CAS 2013年第3期14-18,共5页
为指导飞行程序的改善和发现管制员的指挥模式,在分析历史飞行航迹特征基础上,应用最小描绘长度(MDL)原理对航迹特征点进行划分,运用融合了遗传算法和模拟退火算法的改进的模糊C-Means算法对特征点进行聚类,通过最长公共子序列(LCS)算... 为指导飞行程序的改善和发现管制员的指挥模式,在分析历史飞行航迹特征基础上,应用最小描绘长度(MDL)原理对航迹特征点进行划分,运用融合了遗传算法和模拟退火算法的改进的模糊C-Means算法对特征点进行聚类,通过最长公共子序列(LCS)算法得到航迹相似性矩阵,利用矩阵得到航迹簇,最后形成中心航迹,算例仿真验证了新算法的有效性。 展开更多
关键词 航迹聚类 遗传模拟退火算法 模糊cmeans 最长公共子序列
下载PDF
Instance reduction for supervised learning using input-output clustering method
20
作者 YODJAIPHET Anusorn THEERA-UMPON Nipon AUEPHANWIRIYAKUL Sansanee 《Journal of Central South University》 SCIE EI CAS CSCD 2015年第12期4740-4748,共9页
A method that applies clustering technique to reduce the number of samples of large data sets using input-output clustering is proposed.The proposed method clusters the output data into groups and clusters the input d... A method that applies clustering technique to reduce the number of samples of large data sets using input-output clustering is proposed.The proposed method clusters the output data into groups and clusters the input data in accordance with the groups of output data.Then,a set of prototypes are selected from the clustered input data.The inessential data can be ultimately discarded from the data set.The proposed method can reduce the effect from outliers because only the prototypes are used.This method is applied to reduce the data set in regression problems.Two standard synthetic data sets and three standard real-world data sets are used for evaluation.The root-mean-square errors are compared from support vector regression models trained with the original data sets and the corresponding instance-reduced data sets.From the experiments,the proposed method provides good results on the reduction and the reconstruction of the standard synthetic and real-world data sets.The numbers of instances of the synthetic data sets are decreased by 25%-69%.The reduction rates for the real-world data sets of the automobile miles per gallon and the 1990 census in CA are 46% and 57%,respectively.The reduction rate of 96% is very good for the electrocardiogram(ECG) data set because of the redundant and periodic nature of ECG signals.For all of the data sets,the regression results are similar to those from the corresponding original data sets.Therefore,the regression performance of the proposed method is good while only a fraction of the data is needed in the training process. 展开更多
关键词 instance reduction input-output clustering fuzzy c-means clustering support vector regression supervised learning
下载PDF
上一页 1 2 111 下一页 到第
使用帮助 返回顶部