期刊文献+
共找到91篇文章
< 1 2 5 >
每页显示 20 50 100
Fast global kernel fuzzy c-means clustering algorithm for consonant/vowel segmentation of speech signal 被引量:2
1
作者 Xian ZANG Felipe P. VISTA IV Kil To CHONG 《Journal of Zhejiang University-Science C(Computers and Electronics)》 SCIE EI 2014年第7期551-563,共13页
We propose a novel clustering algorithm using fast global kernel fuzzy c-means-F(FGKFCM-F), where F refers to kernelized feature space. This algorithm proceeds in an incremental way to derive the near-optimal solution... We propose a novel clustering algorithm using fast global kernel fuzzy c-means-F(FGKFCM-F), where F refers to kernelized feature space. This algorithm proceeds in an incremental way to derive the near-optimal solution by solving all intermediate problems using kernel-based fuzzy c-means-F(KFCM-F) as a local search procedure. Due to the incremental nature and the nonlinear properties inherited from KFCM-F, this algorithm overcomes the two shortcomings of fuzzy c-means(FCM): sen- sitivity to initialization and inability to use nonlinear separable data. An accelerating scheme is developed to reduce the compu-tational complexity without significantly affecting the solution quality. Experiments are carried out to test the proposed algorithm on a nonlinear artificial dataset and a real-world dataset of speech signals for consonant/vowel segmentation. Simulation results demonstrate the effectiveness of the proposed algorithm in improving clustering performance on both types of datasets. 展开更多
关键词 fuzzy c-means clustering kernel method Global optimization Consonant/vowel segmentation
原文传递
A Fixed Suppressed Rate Selection Method for Suppressed Fuzzy C-Means Clustering Algorithm 被引量:2
2
作者 Jiulun Fan Jing Li 《Applied Mathematics》 2014年第8期1275-1283,共9页
Suppressed fuzzy c-means (S-FCM) clustering algorithm with the intention of combining the higher speed of hard c-means clustering algorithm and the better classification performance of fuzzy c-means clustering algorit... Suppressed fuzzy c-means (S-FCM) clustering algorithm with the intention of combining the higher speed of hard c-means clustering algorithm and the better classification performance of fuzzy c-means clustering algorithm had been studied by many researchers and applied in many fields. In the algorithm, how to select the suppressed rate is a key step. In this paper, we give a method to select the fixed suppressed rate by the structure of the data itself. The experimental results show that the proposed method is a suitable way to select the suppressed rate in suppressed fuzzy c-means clustering algorithm. 展开更多
关键词 HARD c-means clustering algorithm fuzzy c-means clustering algorithm Suppressed fuzzy c-means clustering algorithm Suppressed RATE
下载PDF
Hybrid Clustering Using Firefly Optimization and Fuzzy C-Means Algorithm
3
作者 Krishnamoorthi Murugasamy Kalamani Murugasamy 《Circuits and Systems》 2016年第9期2339-2348,共10页
Classifying the data into a meaningful group is one of the fundamental ways of understanding and learning the valuable information. High-quality clustering methods are necessary for the valuable and efficient analysis... Classifying the data into a meaningful group is one of the fundamental ways of understanding and learning the valuable information. High-quality clustering methods are necessary for the valuable and efficient analysis of the increasing data. The Firefly Algorithm (FA) is one of the bio-inspired algorithms and it is recently used to solve the clustering problems. In this paper, Hybrid F-Firefly algorithm is developed by combining the Fuzzy C-Means (FCM) with FA to improve the clustering accuracy with global optimum solution. The Hybrid F-Firefly algorithm is developed by incorporating FCM operator at the end of each iteration in FA algorithm. This proposed algorithm is designed to utilize the goodness of existing algorithm and to enhance the original FA algorithm by solving the shortcomings in the FCM algorithm like the trapping in local optima and sensitive to initial seed points. In this research work, the Hybrid F-Firefly algorithm is implemented and experimentally tested for various performance measures under six different benchmark datasets. From the experimental results, it is observed that the Hybrid F-Firefly algorithm significantly improves the intra-cluster distance when compared with the existing algorithms like K-means, FCM and FA algorithm. 展开更多
关键词 clustering OPTIMIZATION K-MEANS fuzzy c-means Firefly algorithm F-Firefly
下载PDF
Kernel method-based fuzzy clustering algorithm 被引量:2
4
作者 WuZhongdong GaoXinbo +1 位作者 XieWeixin YuJianping 《Journal of Systems Engineering and Electronics》 SCIE EI CSCD 2005年第1期160-166,共7页
The fuzzy C-means clustering algorithm(FCM) to the fuzzy kernel C-means clustering algorithm(FKCM) to effectively perform cluster analysis on the diversiform structures are extended, such as non-hyperspherical data, d... The fuzzy C-means clustering algorithm(FCM) to the fuzzy kernel C-means clustering algorithm(FKCM) to effectively perform cluster analysis on the diversiform structures are extended, such as non-hyperspherical data, data with noise, data with mixture of heterogeneous cluster prototypes, asymmetric data, etc. Based on the Mercer kernel, FKCM clustering algorithm is derived from FCM algorithm united with kernel method. The results of experiments with the synthetic and real data show that the FKCM clustering algorithm is universality and can effectively unsupervised analyze datasets with variform structures in contrast to FCM algorithm. It is can be imagined that kernel-based clustering algorithm is one of important research direction of fuzzy clustering analysis. 展开更多
关键词 fuzzy clustering analysis kernel method fuzzy c-means clustering.
下载PDF
Gene Coding Sequence Identification Using Kernel Fuzzy C-Mean Clustering and Takagi-Sugeno Fuzzy Model
5
作者 Tianlei Zang Kai Liao +2 位作者 Zhongmin Sun Zhengyou He Qingquan Qian 《国际计算机前沿大会会议论文集》 2015年第1期78-79,共2页
Sequence analysis technology under big data provides unprecedented opportunities for modern life science. A novel gene coding sequence identification method is proposed in this paper. Firstly, an improved short-time F... Sequence analysis technology under big data provides unprecedented opportunities for modern life science. A novel gene coding sequence identification method is proposed in this paper. Firstly, an improved short-time Fourier transform algorithm based on Morlet wavelet is applied to extract the power spectrum of DNA sequence. Then, threshold value determination method based on kernel fuzzy C-mean clustering is used to combine Signal to Noise Ratio (SNR) data of exon and intron into a sequence, classify the sequence into two types, calculate the weighted sum of two SNR clustering centers obtained and the discrimination threshold value. Finally, exon interval endpoint identification algorithm based on Takagi-Sugeno fuzzy identification model is presented to train Takagi-Sugeno model, optimize model parameters with Levenberg-Marquardt least square method, complete model and determine fuzzy rule. To verify the effectiveness of the proposed method, example tests are conducted on typical gene sequence sample data. 展开更多
关键词 gene IDENTIFICATION power spectrum analysis THRESHOLD value determination kernel fuzzy c-mean clustering TAKAGI-SUGENO fuzzy IDENTIFICATION
下载PDF
Substation clustering based on improved KFCM algorithm with adaptive optimal clustering number selection 被引量:1
6
作者 Yanhui Xu Yihao Gao +4 位作者 Yundan Cheng Yuhang Sun Xuesong Li Xianxian Pan Hao Yu 《Global Energy Interconnection》 EI CSCD 2023年第4期505-516,共12页
The premise and basis of load modeling are substation load composition inquiries and cluster analyses.However,the traditional kernel fuzzy C-means(KFCM)algorithm is limited by artificial clustering number selection an... The premise and basis of load modeling are substation load composition inquiries and cluster analyses.However,the traditional kernel fuzzy C-means(KFCM)algorithm is limited by artificial clustering number selection and its convergence to local optimal solutions.To overcome these limitations,an improved KFCM algorithm with adaptive optimal clustering number selection is proposed in this paper.This algorithm optimizes the KFCM algorithm by combining the powerful global search ability of genetic algorithm and the robust local search ability of simulated annealing algorithm.The improved KFCM algorithm adaptively determines the ideal number of clusters using the clustering evaluation index ratio.Compared with the traditional KFCM algorithm,the enhanced KFCM algorithm has robust clustering and comprehensive abilities,enabling the efficient convergence to the global optimal solution. 展开更多
关键词 Load substation clustering Simulated annealing genetic algorithm kernel fuzzy c-means algorithm clustering evaluation
下载PDF
Fault Pattern Recognition based on Kernel Method and Fuzzy C-means
7
作者 SUN Yebei ZHAO Rongzhen TANG Xiaobin 《International Journal of Plant Engineering and Management》 2016年第4期231-240,共10页
A method about fault identification is proposed to solve the relationship among fault features of large rotating machinery, which is extremely complicated and nonlinear. This paper studies the rotor test-rig and the c... A method about fault identification is proposed to solve the relationship among fault features of large rotating machinery, which is extremely complicated and nonlinear. This paper studies the rotor test-rig and the clustering of data sets and fault pattern recognitions. The present method firstly maps the data from their original space to a high dimensional Kernel space which makes the highly nonlinear data in low-dimensional space become linearly separable in Kernel space. It highlights the differences among the features of the data set. Then fuzzy C-means (FCM) is conducted in the Kernel space. Each data is assigned to the nearest class by computing the distance to the clustering center. Finally, test set is used to judge the results. The convergence rate and clustering accuracy are better than traditional FCM. The study shows that the method is effective for the accuracy of pattern recognition on rotating machinery. 展开更多
关键词 kernel method fuzzy c-means FCM pattern recognition clustering
下载PDF
CONSIDERING NEIGHBORHOOD INFORMATION IN IMAGE FUZZY CLUSTERING 被引量:2
8
作者 Huang Ning Zhu Minhui Zhang Shourong(The Nat. Key Lab of Microwave Imaging Tech, Inst. of Electronics, CAS, Beijing 100080) 《Journal of Electronics(China)》 2002年第3期307-310,共4页
Fuzzy C-means clustering algorithm is a classical non-supervised classification method.For image classification, fuzzy C-means clustering algorithm makes decisions on a pixel-by-pixel basis and does not take advantage... Fuzzy C-means clustering algorithm is a classical non-supervised classification method.For image classification, fuzzy C-means clustering algorithm makes decisions on a pixel-by-pixel basis and does not take advantage of spatial information, regardless of the pixels' correlation. In this letter, a novel fuzzy C-means clustering algorithm is introduced, which is based on image's neighborhood system. During classification procedure, the novel algorithm regards all pixels'fuzzy membership as a random field. The neighboring pixels' fuzzy membership information is used for the algorithm's iteration procedure. As a result, the algorithm gives a more smooth classification result and cuts down the computation time. 展开更多
关键词 Remote sensing clustering fuzzy c-means clustering algorithm
下载PDF
Modified possibilistic clustering model based on kernel methods
9
作者 武小红 周建江 《Journal of Shanghai University(English Edition)》 CAS 2008年第2期136-140,共5页
A novel model of fuzzy clustering using kernel methods is proposed. This model is called kernel modified possibilistic c-means (KMPCM) model. The proposed model is an extension of the modified possibilistic c-means ... A novel model of fuzzy clustering using kernel methods is proposed. This model is called kernel modified possibilistic c-means (KMPCM) model. The proposed model is an extension of the modified possibilistic c-means (MPCM) algorithm by using kernel methods. Different from MPCM and fuzzy c-means (FCM) model which are based on Euclidean distance, the proposed model is based on kernel-induced distance. Furthermore, with kernel methods the input data can be mapped implicitly into a high-dimensional feature space where the nonlinear pattern now appears linear. It is unnecessary to do calculation in the high-dimensional feature space because the kernel function can do it. Numerical experiments show that KMPCM outperforms FCM and MPCM. 展开更多
关键词 fuzzy clustering kernel methods possibilistic c-means (PCM) kernel modified possibilistic c-means (KMPCM).
下载PDF
Agent Based Segmentation of the MRI Brain Using a Robust C-Means Algorithm
10
作者 Hanane Barrah Abdeljabbar Cherkaoui Driss Sarsri 《Journal of Computer and Communications》 2016年第10期13-21,共9页
In the last decade, the MRI (Magnetic Resonance Imaging) image segmentation has become one of the most active research fields in the medical imaging domain. Because of the fuzzy nature of the MRI images, many research... In the last decade, the MRI (Magnetic Resonance Imaging) image segmentation has become one of the most active research fields in the medical imaging domain. Because of the fuzzy nature of the MRI images, many researchers have adopted the fuzzy clustering approach to segment them. In this work, a fast and robust multi-agent system (MAS) for MRI segmentation of the brain is proposed. This system gets its robustness from a robust c-means algorithm (RFCM) and obtains its fastness from the beneficial properties of agents, such as autonomy, social ability and reactivity. To show the efficiency of the proposed method, we test it on a normal brain brought from the BrainWeb Simulated Brain Database. The experimental results are valuable in both robustness to noise and running times standpoints. 展开更多
关键词 Agents and MAS MR Images fuzzy clustering c-means algorithm Image Segmentation
下载PDF
Abnormal State Detection of OLTC Based on Improved Fuzzy C-means Clustering
11
作者 Hongwei Li Lilong Dou +3 位作者 Shuaibing Li Yongqiang Kang Xingzu Yang Haiying Dong 《Chinese Journal of Electrical Engineering》 CSCD 2023年第1期129-141,共13页
An accurate extraction of vibration signal characteristics of an on-load tap changer(OLTC)during contact switching can effectively help detect its abnormal state.Therefore,an improved fuzzy C-means clustering method f... An accurate extraction of vibration signal characteristics of an on-load tap changer(OLTC)during contact switching can effectively help detect its abnormal state.Therefore,an improved fuzzy C-means clustering method for abnormal state detection of the OLTC contact is proposed.First,the wavelet packet and singular spectrum analysis are used to denoise the vibration signal generated by the moving and static contacts of the OLTC.Then,the Hilbert-Huang transform that is optimized by the ensemble empirical mode decomposition(EEMD)is used to decompose the vibration signal and extract the boundary spectrum features.Finally,the gray wolf algorithm-based fuzzy C-means clustering is used to denoise the signal and determine the abnormal states of the OLTC contact.An analysis of the experimental data shows that the proposed secondary denoising method has a better denoising effect compared to the single denoising method.The EEMD can improve the modal aliasing effect,and the improved fuzzy C-means clustering can effectively identify the abnormal state of the OLTC contacts.The analysis results of field measured data further verify the effectiveness of the proposed method and provide a reference for the abnormal state detection of the OLTC. 展开更多
关键词 On-load tap changer singular spectrum analysis Hilbert-Huang transform gray wolf optimization algorithm fuzzy c-means clustering
原文传递
A NEW UNSUPERVISED CLASSIFICATION ALGORITHM FOR POLARIMETRIC SAR IMAGES BASED ON FUZZY SET THEORY 被引量:2
12
作者 Fu Yusheng Xie Yan Pi Yiming Hou Yinming 《Journal of Electronics(China)》 2006年第4期598-601,共4页
In this letter, a new method is proposed for unsupervised classification of terrain types and man-made objects using POLarimetric Synthetic Aperture Radar (POLSAR) data. This technique is a combi-nation of the usage o... In this letter, a new method is proposed for unsupervised classification of terrain types and man-made objects using POLarimetric Synthetic Aperture Radar (POLSAR) data. This technique is a combi-nation of the usage of polarimetric information of SAR images and the unsupervised classification method based on fuzzy set theory. Image quantization and image enhancement are used to preprocess the POLSAR data. Then the polarimetric information and Fuzzy C-Means (FCM) clustering algorithm are used to classify the preprocessed images. The advantages of this algorithm are the automated classification, its high classifica-tion accuracy, fast convergence and high stability. The effectiveness of this algorithm is demonstrated by ex-periments using SIR-C/X-SAR (Spaceborne Imaging Radar-C/X-band Synthetic Aperture Radar) data. 展开更多
关键词 Radar polarimetry Synthetic Aperture Radar (SAR) fuzzy set theory Unsupervised classification Image quantization Image enhancement fuzzy c-means (FCM) clustering algorithm Membership function
下载PDF
Semi-supervised kernel FCM algorithm for remote sensing image classification
13
作者 刘小芳 HeBinbin LiXiaowen 《High Technology Letters》 EI CAS 2011年第4期427-432,共6页
These problems of nonlinearity, fuzziness and few labeled data were rarely considered in traditional remote sensing image classification. A semi-supervised kernel fuzzy C-means (SSKFCM) algorithm is proposed to over... These problems of nonlinearity, fuzziness and few labeled data were rarely considered in traditional remote sensing image classification. A semi-supervised kernel fuzzy C-means (SSKFCM) algorithm is proposed to overcome these disadvantages of remote sensing image classification in this paper. The SSKFCM algorithm is achieved by introducing a kernel method and semi-supervised learning technique into the standard fuzzy C-means (FCM) algorithm. A set of Beijing-1 micro-satellite's multispectral images are adopted to be classified by several algorithms, such as FCM, kernel FCM (KFCM), semi-supervised FCM (SSFCM) and SSKFCM. The classification results are estimated by corresponding indexes. The results indicate that the SSKFCM algorithm significantly improves the classification accuracy of remote sensing images compared with the others. 展开更多
关键词 remote sensing image classification semi-supervised kernel fuzzy c-means (SSKFCM)algorithm Beijing-1 micro-satellite semi-supcrvisod learning tochnique kernel method
下载PDF
Clustering: from Clusters to Knowledge
14
作者 Peter Grabusts 《Computer Technology and Application》 2013年第6期284-290,共7页
Data analysis and automatic processing is often interpreted as knowledge acquisition. In many cases it is necessary to somehow classify data or find regularities in them. Results obtained in the search of regularities... Data analysis and automatic processing is often interpreted as knowledge acquisition. In many cases it is necessary to somehow classify data or find regularities in them. Results obtained in the search of regularities in intelligent data analyzing applications are mostly represented with the help of IF-THEN rules. With the help of these rules the following tasks are solved: prediction, classification, pattern recognition and others. Using different approaches---clustering algorithms, neural network methods, fuzzy rule processing methods--we can extract rules that in an understandable language characterize the data. This allows interpreting the data, finding relationships in the data and extracting new rules that characterize them. Knowledge acquisition in this paper is defined as the process of extracting knowledge from numerical data in the form of rules. Extraction of rules in this context is based on clustering methods K-means and fuzzy C-means. With the assistance of K-means, clustering algorithm rules are derived from trained neural networks. Fuzzy C-means is used in fuzzy rule based design method. Rule extraction methodology is demonstrated in the Fisher's Iris flower data set samples. The effectiveness of the extracted rules is evaluated. Clustering and rule extraction methodology can be widely used in evaluating and analyzing various economic and financial processes. 展开更多
关键词 Data analysis clustering algorithms K-MEANS fuzzy c-means rule extraction.
下载PDF
Interactive Protein Data Clustering
15
作者 Terje Kristensen Vemund Jakobsen 《Computer Technology and Application》 2011年第10期818-827,共10页
In this paper, the authors present three different algorithms for data clustering. These are Self-Organizing Map (SOM), Neural Gas (NG) and Fuzzy C-Means (FCM) algorithms. SOM and NG algorithms are based on comp... In this paper, the authors present three different algorithms for data clustering. These are Self-Organizing Map (SOM), Neural Gas (NG) and Fuzzy C-Means (FCM) algorithms. SOM and NG algorithms are based on competitive leaming. An important property of these algorithms is that they preserve the topological structure of data. This means that data that is close in input distribution is mapped to nearby locations in the network. The FCM algorithm is an algorithm based on soft clustering which means that the different clusters are not necessarily distinct, but may overlap. This clustering method may be very useful in many biological problems, for instance in genetics, where a gene may belong to different clusters. The different algorithms are compared in terms of their visualization of the clustering of proteomic data. 展开更多
关键词 DATAMINING self-organizing map neural gas fuzzy c-means algorithm and protein clustering.
下载PDF
Employment Quality EvaluationModel Based on Hybrid Intelligent Algorithm
16
作者 Xianhui Gu Xiaokan Wang Shuang Liang 《Computers, Materials & Continua》 SCIE EI 2023年第1期131-139,共9页
In order to solve the defect of large error in current employment quality evaluation,an employment quality evaluation model based on grey correlation degree method and fuzzy C-means(FCM)is proposed.Firstly,it analyzes... In order to solve the defect of large error in current employment quality evaluation,an employment quality evaluation model based on grey correlation degree method and fuzzy C-means(FCM)is proposed.Firstly,it analyzes the related research work of employment quality evaluation,establishes the employment quality evaluation index system,collects the index data,and normalizes the index data;Then,the weight value of employment quality evaluation index is determined by Grey relational analysis method,and some unimportant indexes are removed;Finally,the employment quality evaluation model is established by using fuzzy cluster analysis algorithm,and compared with other employment quality evaluation models.The test results show that the employment quality evaluation accuracy of the design model exceeds 93%,the employment quality evaluation error can meet the requirements of practical application,and the employment quality evaluation effect is much better than the comparison model.The comparison test verifies the superiority of the model. 展开更多
关键词 Employment quality fuzzy c-means clustering algorithm grey correlation analysis method evaluation model index system comparative test
下载PDF
基于改进FCM的冲压件缺陷图像分割算法
17
作者 张玉杰 高晗 《计算机工程》 CAS CSCD 北大核心 2024年第10期342-351,共10页
在工业质检过程中,冲压件缺陷图像分割作为缺陷检测的重要环节,直接影响缺陷检测效果。而传统的模糊C均值(FCM)聚类算法未考虑到空间邻域信息,对于噪声干扰较为敏感,导致分割精度较差,且其整体易受初始值的影响,造成收敛速度变慢。针对... 在工业质检过程中,冲压件缺陷图像分割作为缺陷检测的重要环节,直接影响缺陷检测效果。而传统的模糊C均值(FCM)聚类算法未考虑到空间邻域信息,对于噪声干扰较为敏感,导致分割精度较差,且其整体易受初始值的影响,造成收敛速度变慢。针对上述问题,提出一种改进的FCM算法。采用内核诱导距离中的简单两项代替传统的欧氏距离,将原有的空间像素映射到高维特征空间,提高线性可分概率和计算速度;利用图像像素之间的空间相关性,通过引入改进的马尔可夫随机场对FCM目标函数进行修正,提高算法的抗噪能力以及分割精度;采用秃鹰搜索(BES)算法确定FCM的初始聚类中心,提高算法的收敛速度,同时避免算法陷入局部极值的情况。为验证改进FCM算法的性能,选取划分熵、划分系数、Xie_Beni系数以及迭代次数作为评价指标,并与近年来先进的图像分割算法进行对比。实验结果表明,改进FCM算法具有更好的抗噪能力,能得到更好的缺陷分割效果,对工业生产中的冲压件缺陷检测有一定的应用价值。 展开更多
关键词 模糊C均值聚类 工业应用 冲压件缺陷 内核诱导距离 马尔可夫随机场 秃鹰搜索算法
下载PDF
Improved Kernel Possibilistic Fuzzy Clustering Algorithm Based on Invasive Weed Optimization 被引量:1
18
作者 赵小强 周金虎 《Journal of Shanghai Jiaotong university(Science)》 EI 2015年第2期164-170,共7页
Fuzzy c-means(FCM) clustering algorithm is sensitive to noise points and outlier data, and the possibilistic fuzzy c-means(PFCM) clustering algorithm overcomes the problem well, but PFCM clustering algorithm has some ... Fuzzy c-means(FCM) clustering algorithm is sensitive to noise points and outlier data, and the possibilistic fuzzy c-means(PFCM) clustering algorithm overcomes the problem well, but PFCM clustering algorithm has some problems: it is still sensitive to initial clustering centers and the clustering results are not good when the tested datasets with noise are very unequal. An improved kernel possibilistic fuzzy c-means algorithm based on invasive weed optimization(IWO-KPFCM) is proposed in this paper. This algorithm first uses invasive weed optimization(IWO) algorithm to seek the optimal solution as the initial clustering centers, and introduces kernel method to make the input data from the sample space map into the high-dimensional feature space. Then, the sample variance is introduced in the objection function to measure the compact degree of data. Finally, the improved algorithm is used to cluster data. The simulation results of the University of California-Irvine(UCI) data sets and artificial data sets show that the proposed algorithm has stronger ability to resist noise, higher cluster accuracy and faster convergence speed than the PFCM algorithm. 展开更多
关键词 data mining clustering algorithm possibilistic fuzzy c-means(PFCM) kernel possibilistic fuzzy c-means algorithm based on invasiv
原文传递
Enhancing Multicriteria-Based Recommendations by Alleviating Scalability and Sparsity Issues Using Collaborative Denoising Autoencoder
19
作者 S.Abinaya K.Uttej Kumar 《Computers, Materials & Continua》 SCIE EI 2024年第2期2269-2286,共18页
A Recommender System(RS)is a crucial part of several firms,particularly those involved in e-commerce.In conventional RS,a user may only offer a single rating for an item-that is insufficient to perceive consumer prefe... A Recommender System(RS)is a crucial part of several firms,particularly those involved in e-commerce.In conventional RS,a user may only offer a single rating for an item-that is insufficient to perceive consumer preferences.Nowadays,businesses in industries like e-learning and tourism enable customers to rate a product using a variety of factors to comprehend customers’preferences.On the other hand,the collaborative filtering(CF)algorithm utilizing AutoEncoder(AE)is seen to be effective in identifying user-interested items.However,the cost of these computations increases nonlinearly as the number of items and users increases.To triumph over the issues,a novel expanded stacked autoencoder(ESAE)with Kernel Fuzzy C-Means Clustering(KFCM)technique is proposed with two phases.In the first phase of offline,the sparse multicriteria rating matrix is smoothened to a complete matrix by predicting the users’intact rating by the ESAE approach and users are clustered using the KFCM approach.In the next phase of online,the top-N recommendation prediction is made by the ESAE approach involving only the most similar user from multiple clusters.Hence the ESAE_KFCM model upgrades the prediction accuracy of 98.2%in Top-N recommendation with a minimized recommendation generation time.An experimental check on the Yahoo!Movies(YM)movie dataset and TripAdvisor(TA)travel dataset confirmed that the ESAE_KFCM model constantly outperforms conventional RS algorithms on a variety of assessment measures. 展开更多
关键词 Recommender systems multicriteria rating collaborative filtering sparsity issue scalability issue stacked-autoencoder kernel fuzzy c-means clustering
下载PDF
基于高斯核函数的差分隐私模糊C均值聚类算法的构建与应用
20
作者 曹自雄 陈宇鲜 蒋秀梅 《中国医学装备》 2024年第8期106-112,共7页
目的:提出一种基于高斯核函数的差分隐私模糊C均值聚类算法(DPFCM_GF),旨在优化大数据背景下医疗数据分析和挖掘带来的数据隐私安全问题,为数据隐私保护提供理论基础。方法:针对随机初始化模糊C-均值隶属度矩阵降低算法精度问题,采用最... 目的:提出一种基于高斯核函数的差分隐私模糊C均值聚类算法(DPFCM_GF),旨在优化大数据背景下医疗数据分析和挖掘带来的数据隐私安全问题,为数据隐私保护提供理论基础。方法:针对随机初始化模糊C-均值隶属度矩阵降低算法精度问题,采用最大距离法确定初始中心点,使用聚类中心点的高斯值计算隐私预算分配比率,并添加拉普拉斯噪声以完成差分隐私保护,构建DPFCM_GF。收集整理美国加州大学欧文分校机器学习存储库的心脏病、乳腺癌、甲状腺疾病及糖尿病公开数据集对DPFCM_GF有效性进行验证,收集2019年1月1日至2022年12月31日淮安市第二人民医院收治的756例胃癌和肺癌患者病例数据集,对DPFCM_GF的可用性进行验证,并将分析结果与模糊C均值聚类算法(FCM)以及差分隐私模糊C均值聚类算法(DPFCM)进行对比分析。结果:对于心脏病、乳腺癌、甲状腺疾病及糖尿病公开数据集,DPFCM_GF和DPFCM的最优聚类效果与FCM聚类效果相当;相较于DPFCM,DPFCM_GF迭代时间更快,聚集速度显著,差异有统计学意义(t=4.01、4.71、4.01、12.38,P<0.05)。对于肺癌和胃癌数据集,随着隐私预算ε的增大,DPFCM_GF正确识别率逐渐聚集于91.9%和93.9%,受试者工作特征(ROC)曲线下面积(AUC)值分别为0.79和0.81;当隐私函数ε为0.1、0.5、1和2(ε<3)时,DPFCM_GF聚类效果显著优于DPFCM,且聚类效果更佳,差异有统计学意义(χ^(2)=12.25、87.12、68.58、7.76,P<0.05;χ^(2)=4.74、43.51、42.47、4.89,P<0.05)。结论:DPFCM_GF是一种有效保护医疗数据隐私的方法,同时也可进行数据分析和挖掘任务,具有一定的研究意义和研究前景。 展开更多
关键词 数据隐私 差分隐私 模糊C均值聚类算法 高斯核函数 数据挖掘 隐私预算
下载PDF
上一页 1 2 5 下一页 到第
使用帮助 返回顶部