期刊文献+

域间F-范数正则化迁移谱聚类方法 被引量:5

Transfer Spectral Clustering Based on Inter-Domain F-Norm Regularization
下载PDF
导出
摘要 传统聚类算法在目标数据集被噪声或异常数据大量污染的场景下聚类效果不佳。针对此问题,在经典谱聚类算法(spectral clustering,SC)基础上加入迁移学习知识,提出了新的域间F-范数正则化迁移谱聚类算法(transfer spectral clustering based on inter-domain F-norm regularization,TSC-IDFR)。该算法通过第K最近邻原则为目标域数据从源域(历史数据)获取等量的可参照数据样本,然后基于域间F范数正则化机制,迁移这些源域可参照数据样本的谱聚类特征矩阵,以辅助目标域数据集上的谱聚类过程,从而解决实际问题中由于目标域数据污染带来的聚类难题,最终提高谱聚类效果。通过在模拟数据集和真实数据集上的仿真实验,证明了该算法的有效性。 Traditional clustering algorithm usually has poor clustering performance in the cases where the target data are fairly distorted by noise.In order to address such challenge,based on the classic spectral clustering(SC)algorithm,and by using the strategy of transfer learning,this paper proposes the transfer spectral clustering algorithm based on inter-domain F-norm regularization(TSC-IDFR).For the data in the target domain,TSC-IDFR firstly selects the referenced examples,of which the sampling size is the same as the data size in the target domain,from the source domain(historical data)by means of the principle of the Kth nearest neighbor.Then,in terms of the mechanism of inter-domain F-norm regularization,the matrix composed of the spectral eigenvectors of the selected referenced examples from the source domain is used to assist the spectral clustering on the target data.As such,TSC-IDFR successfully resolves the clustering on the target data set(target domain)even if it contains much noise.The effectiveness of the proposed algorithm has been demonstrated by experimental studies on both synthetic and real data sets.
作者 魏彩娜 钱鹏江 奚臣 WEI Caina;QIAN Pengjiang;XI Chen(School of Digital Media, Jiangnan University, Wuxi, Jiangsu 214122, China)
出处 《计算机科学与探索》 CSCD 北大核心 2018年第3期472-483,共12页 Journal of Frontiers of Computer Science and Technology
基金 国家自然科学基金面上项目 No.61170122 新世纪优秀人才支持计划项目 No.NCET-120882 中央高校基本科研业务费专项资金 No.JUSRP51614A~~
关键词 迁移学习 谱聚类 正则化 transfer learning spectral clustering regularization
  • 相关文献

参考文献7

二级参考文献203

  • 1张敏,于剑.基于划分的模糊聚类算法[J].软件学报,2004,15(6):858-868. 被引量:176
  • 2邓赵红,王士同,吴锡生,胡德文.鲁棒的极大熵聚类算法RMEC及其例外点标识[J].中国工程科学,2004,6(9):38-45. 被引量:12
  • 3FINGER,CHRISTOPHER.A Methodology to Stress Correlations[J].Risk Metrics Monitor,1997(4):3-11.
  • 4LUENBERGER D G.Optimization by Vector Space Methods[M].New York:1969.
  • 5ROHN J.NP-hardness results for some linear and quadratic problems[N].Technical Report No.619.January,1995.
  • 6NICHOLAS J,HIGHAM.Computing the nearest correlation matrix-a problem from finance[J].IMA Journal of Numerical Analysis,2002(22):329-343.
  • 7HAN S P.A successive projection method[J].Math.Prog.,1988(40):1-14.
  • 8Hall L 0, Goldgof D B. Convergence of the Single-Pass and Online Fuzzy C-Means Algorithms. IEEE Trans on Fuzzy Systems, 2011, 19(4): 792-794.
  • 9WU K L, Yang M S. Alternative C-Means Clustering Algorithms. Pattern Recognition, 2002, 35(10): 2267-2278.
  • 10Yang M S. On a Class of Fuzzy Classification Maximum Likelihood Procedures. Fuzzy Sets and Systems, 1993, 57 (3) : 365-375.

共引文献484

同被引文献24

引证文献5

二级引证文献15

相关作者

内容加载中请稍等...

相关机构

内容加载中请稍等...

相关主题

内容加载中请稍等...

浏览历史

内容加载中请稍等...
;
使用帮助 返回顶部