期刊文献+

基于最小闭包球的中文博客分类

Chinese Blog Classification Based on Minimum Enclosing Ball
下载PDF
导出
摘要 提出一种基于近似最小闭包球原理的中文博客(Blog)话题分类方法。根据近似最小闭包球原理,将支持向量机的优化求解转换为近似最小闭包球求解,使得只需选择大规模数据集的一个核心子集参与分类器的训练过程,以提高Blog话题分类中大规模训练集的处理能力。在较大规模的Blog数据集上进行中文Blog特征选择及话题分类实验。实验结果表明,该方法不仅准确率可达到支持向量机同等的效果,且可减少训练时间,获得较好的Blog话题分类效果。 A Chinese Blog topic classification method based on approximate minimum enclosing ball is proposed.By transforming the optimization problem of the Support Vector Machine(SVM) to the optimization problem of approximate minimum enclosing ball equivalently,the Blog topic classifier can be trained quickly by only selecting a core subset of the original large scale dataset.The feature selection experiments and topic classification experiments are executed on large scale Blog dataset.Experimental results show that the method can provide good classification precise and quick run-time speed.
出处 《计算机工程》 CAS CSCD 2012年第23期162-165,共4页 Computer Engineering
基金 国家自然科学基金资助项目(60903114 60973100) 广东省自然科学基金资助项目(7301329) 深圳市科技计划基金资助项目(JC201005280463A JC201105160498A)
关键词 博客分类 近似最小闭包球 支持向量机 核心向量机 数据挖掘 新兴媒体 Blog classification approximate minimum enclosing ball Support Vector Machine(SVM) core vector machine data mining new media
  • 相关文献

参考文献7

  • 1杨宇航,赵铁军,于浩,郑德权.Blog研究[J].软件学报,2008,19(4):912-924. 被引量:19
  • 2Cristianini N,Shawe-Taylor J.An Introduction to Support VectorMachines and Other Kernel-based Learning Methods[M].Cambridge,UK:Cambridge University Press,2000.
  • 3Tsang I W,Kwok J T,Cheung P M.Core Vector Machines:FastSVM Training on Very Large Data Sets[J].Journal of MachineLearning Research,2005,6:363-392.
  • 4Badoiu M,Clarkson K L.Optimal Core-sets for Balls[J].Computational Geometry Theory Applications,2008,40(1):14-22.
  • 5Tsang I W,Kocsor A,Kwok J T.Simpler Core Vector Machineswith Enclosing Balls[C]//Proc.of the 24th InternationalConference on Machine Learning.Corvalis,USA:ACM Press,2007:911-918.
  • 6Baeza-Yates R,Ribeiro-Neto B.Modern Information Retrieval[M].[S.l.]:ACM Press,1999.
  • 7Feldman R,Sanger J.The Text Mining Handbook:AdvancedApproaches in Analyzing Unstructured Data[M].Cambridge,UK:Cambridge University Press,2007.

二级参考文献2

共引文献18

相关作者

内容加载中请稍等...

相关机构

内容加载中请稍等...

相关主题

内容加载中请稍等...

浏览历史

内容加载中请稍等...
;
使用帮助 返回顶部