期刊文献+

结合链接结构聚类的混沌粒子群网页分类规则抽取

A web document categorization rule extraction based on chaos particle swarm optimization combining linkage clustering
下载PDF
导出
摘要 网页分类器设计的核心是对原始分类数据集进行分类规则挖掘,本文提出了一种结合链接结构聚类的混沌粒子群网页分类规则获取算法.算法将聚类和分类结合起来进行分类规则提取:首先用基于K均值的聚类算法对一部分有代表性的链接结构数据聚类,进行类别自动标注,形成训练集;再用混沌粒子群算法对已标注类别的数据提取分类规则.实验结果表明,这种模式充分发挥了基于链接的分类方法受人为因素干扰最小的优点,减少了人工标注类别的工作量,同时提高分类的准确率和效率. The core of classifier is extracting web document categorization rule. An algorithm of web document categorization rule extraction based on chaos particle swarm optimization combining linkage clustering is proposed in this paper. Aiming at advantages of clustering and classifying, the algorithm gains categorization rule by combine them: firstly cluster one part of representative unlabeled linkage data to label category automatically. Then categorization rule is gained by using chaos particle swarm algorithm. The experiment results show this model not only can develop thoroughly the merit of linkage clustering least disturbance from human factor but also can avoid the fault of original data set and alleviate works of classifying by specialist as well as ratio of precision and recall have improved a lot.
作者 童亚拉
出处 《华中师范大学学报(自然科学版)》 CAS CSCD 2008年第4期535-538,共4页 Journal of Central China Normal University:Natural Sciences
基金 国家自然科学基金资助项目(60773009) 国家重点基础研究发展规则"973"基金资助项目(2007AA012290).
关键词 网页分类 规则抽取 混沌粒子群 链接结构聚类 web document categorization rule extraction chaos particle swarm optimization linkage clustering
  • 相关文献

参考文献5

二级参考文献21

  • 1陈展荣,曾毅平.Web汉语料的智能抽取与词汇切分[J].计算机工程与设计,2005,26(6):1422-1424. 被引量:4
  • 2王珏,苗夺谦,周育健.关于Rough Set理论与应用的综述[J].模式识别与人工智能,1996,9(4):337-344. 被引量:264
  • 3Hu X,Int J Computational Intelligence,1995年,11卷,2期,323页
  • 4Yiming Yang. An evaluation of statistical approaches to text categorizaiton. Information Retrieval, 1999,1 : 69-90.
  • 5Qiang Shen, Alexios Chouchoulas. A rough-fuzzy approach forgenerating classification rules. Pattern Recognition, 2002, 35 :2425 - 2438.
  • 6Lili Diao, Keyun Flu, Yuchan Lu, Chunyi Shi. Simple decision trees with Bayesian learning for text categorization.In: Proceedings of the 4th World Congress on Intelligent Control and Automation, Shanghai, China, 2002. Shanghai:IEEE Robotics and Automation Society, 2002. 321 - 325.
  • 7Yanqiu Chen, Nixon M.S., Damper R.I.. Implementing the k-nearest neighbour rule via a neural network. In: IEEE International Conference on Neural Networks, Perth, Western Australia, 1995. Springer Verlag,1995. vol.1, 136 - 140.
  • 8Pawlak Z,Graymala-Bausse J,Slowinskl R,et al.Rough sets.Communications of the ACM,1995,38(11):89-95.
  • 9苗夺谦,王珏.基于粗糙集的多变量决策树构造方法[J].软件学报,1997,8(6):425-431. 被引量:120
  • 10Zhang Yizhong,Zhao Mingsheng,Wu Youshou.The automatic classification of web pages based on neural networks[C].Neural Information Processing,ICONIP2001 Proceedings,2001.570-575.

共引文献413

相关作者

内容加载中请稍等...

相关机构

内容加载中请稍等...

相关主题

内容加载中请稍等...

浏览历史

内容加载中请稍等...
;
使用帮助 返回顶部