期刊文献+

基于特征族群语义扩散核的半监督农业文本分类 被引量:2

Semi-supervised Agricultural Text Classification Based on Feature Cluster Semantic Diffusion Kernel
下载PDF
导出
摘要 农业文本分类旨在对主流的农业信息网抽取的文本数据集进行分类.在样本充足的情形下,经典的支持向量机方法能取得较好的效果,然而在样本较少或者样本矩阵很稀疏的情形下效果较差.提出了一种基于特征族群语义扩散核(它是语义扩散核的一种)和支持向量机的半监督农业文本分类方法.该方法在经典的支持向量机方法基础上结合特征族群语义扩散核,使得农业文本分类准确率得到一个显著的提升,在训练集样本数量只有原来一半的数量情况下预测原来的测试样本,预测准确率几乎与原来的相同. Agricultural text classification aims to classify the text datasets extracted from mainstream agricultural information networks. With sufficient samples ,the classical support vector machine (SVM) method can achieve good results ,but the classification performance is usually unsatisfactory when the number of the samples is small or the sample matrix is sparse. This paper presents a semi - supervised agricul- tural text classification method which is based on the feature - cluster semantic diffusion kernel and SVM. Combined the classical SVM with feature -cluster semantic diffusion kernel, this method yields a significantly better classification results than those obtained by the classical method. More specifically,when the number of the training samples is reduced to the half of that of the original training set,the prediction accuracy of the proposed method is almost the same as that of the classical method with the original training set.
作者 李伟 汪廷华 郑惠宁 LI Wei, WANG Tinghua, ZHENG Huining(School of Mathematics and Computer Scietwe, Gannan Normal University, C, anzhou 341000 ,P. R. Chin)
出处 《赣南师范大学学报》 2018年第3期66-71,共6页 Journal of Gannan Normal University
基金 国家自然科学基金项目(61562003) 江西省研究生创新专项资金项目(YC2016-S405)
关键词 文本分类 支持向量机 语义扩散核 半监督学习 农业文本 text classification support vector machine (SVM) feature - cluster semantic diffusion kernel agricultural text
  • 相关文献

参考文献9

二级参考文献221

共引文献191

同被引文献10

引证文献2

相关作者

内容加载中请稍等...

相关机构

内容加载中请稍等...

相关主题

内容加载中请稍等...

浏览历史

内容加载中请稍等...
;
使用帮助 返回顶部