期刊文献+

基于规范割的文本超图聚类 被引量:1

Text Hypergraph Clustering Based on Normalized Cut
下载PDF
导出
摘要 超图聚类方法是目前主流聚类方法之一.它的经典版本出现在超大规模集成电路研究领域.近年来,它的各种改进版本被提出并广泛应用于机器视觉领域.例如,在图像聚类和运动分割方面,它的各种版本常有较好的表现.本文将超图聚类方法引入文本聚类领域.首先,根据文本数据高度稀疏的特点,采用SVD(或PCA)对其进行降维;其次,采用基于大超边的超图规范割聚类对文本的低维投影进行聚类;最后,采用聚类准确率指标对聚类进行评价.在两个文本数据集的实验中,基于超图规范割聚类取得了比传统的k均值聚类和层次聚类更好的聚类表现. Hypergraph based clustering is one of the most popolar clustering methods at present.Its typical version is proposed firstly in the field of Very Large Scale Integration Circuit,while its various generalized versions are applied conprehensively in machine vision in recent years.For example,in image clustering and motion segmentation,its various versions often achieve good performance.In this paper,hypergraph baded clustering is introuduced to cluster text.Firstly,based on high sparsity of text data,we use SVD or PCA to decrease their dimensions,then we cluster the lower dimensional text using hypergraph normalized cut clustering in the large hyperedge case;finally,we evaluate the clustering perfomance using the index of accuracy.In the experiments on two text datasets,the method based on hypergrah normalized cut gets the best clustering accuracy comparing to the typical k means method and the hierarchical clustering method.
作者 檀敬东 TAN Jing-dong(School of Mathematics,Hefei University of Technology,Hefei 230009, China)
出处 《大学数学》 2017年第6期33-36,共4页 College Mathematics
基金 国家自然科学基金项目(61503115)
关键词 超图 规范割 文本聚类 随机聚类模型 hypergraph normalized cut text clustering random cluster models
  • 相关文献

参考文献1

二级参考文献12

  • 1Hughes N P,Tarassenko L. Novel Signal Shape Descriptors through Wavelet Transforms and Dimensionality Reduction[J].Proceedings of Spie,2003.763-773.
  • 2王珏;周志华;周傲英.机器学习及其应用[M]北京:清华大学出版社,2006.
  • 3Li Haifeng,Jiang Tao,Zhang Keshu. Efficient and Robust Feature Fusion by Maximum Margin Criterion[J].IEEE Transactions on Neural Networks,2006,(01):157-165.
  • 4Li Bo,Zheng Chunhou,Huang Deshuang. Locally Linear Discriminant Embedding:An Efficient Method for Face Recognition[J].Pattern Recognition,2008,(12):3813-3821.
  • 5Roweis S T. Nonlinear Dimensionality Reduction by Locally Linear Embedding[J].Science,2000,(5500):2323-2326.doi:10.1126/science.290.5500.2323.
  • 6Shawe-Taylor J,Cristianini N. Kernel Methods for Pattern Analysis[M].Cambridge:Cambridge University Press,2004.
  • 7Chatfield C,Collins A J. Introduction to Multivariate Analysis[M].London:Chapman and Hall,1980.
  • 8Zhang Huaping,Yu Hongkui,Xiong Deyi. HHMM-Based Chinese Lexical Analyzer ICTCLAS[A].Sapporo,Japan,2003.184-187.
  • 9Salton G,McGill M J. Introduction to Modem Information Retrieval[M].New York,USA:McGraw-Hill,1986.
  • 10Barshan E,Ghodsi A,Azimifar Z. Supervised Principal Component Analysis:Visualization,Classification and Regression on Subspaces and Submanifolds[J].Pattern Recognition,2011,(07):1357-1371.

共引文献3

同被引文献3

引证文献1

二级引证文献7

相关作者

内容加载中请稍等...

相关机构

内容加载中请稍等...

相关主题

内容加载中请稍等...

浏览历史

内容加载中请稍等...
;
使用帮助 返回顶部