期刊文献+

基于信息增益的中医体质多标记分类方法研究

Study on Multi-label Classification Method of TCM Constitutions Based on Information Gain
下载PDF
导出
摘要 目的为降低中医体质传统分类方法主观性误差,兼顾兼夹体质,提出基于信息增益的中医体质多标记分类方法。方法采用多标记方法进行中医体质分类。为解决多标记分类方法中不同特征对分类标签的影响不同的问题,通过体质分类数据计算各特征项的信息增益,计算体质分类特征对分类标签的权重,进而通过加权的多标签分类器,得出体质数据多标记分类。结果与传统判别分析法相比,基于信息增益的多标记分类方法在1-错误率(16.33%)、汉明损失(15.44%)、平均准确率(82.61%)方面均有一定优势。结论基于信息增益的多标记分类方法在保证准确率同时可兼顾兼夹体质,实现对体质特征差异性及趋同性的更好描述。 Objective To propose a multi-label classification method of TCM constitutions based on information gain;To reduce the subjective error of traditional classification methods of TCM constitutions and take into account the combination of constitutions. Methods The multi-label method was used to classify TCM constitutions. In order to solve the problem that different features of multi-label classification method had different influence on the classification label, the information gain of each feature item was calculated by the physique classification data, and the weight of classification features were calculated. Then multi-label classification of physique data was obtained by weighted multi-label classifier. Results Compared with the traditional discriminant analysis method, the multi-label classification method based on information gain had certain advantages in 1-error rate (16.33%), hamming loss (15.44%), and average accuracy (82.61%). Conclusion The multi-label classification method based on information gain can ensure the accuracy. Taking into account the combination of constitutions can realize the better description of the difference in constitution characteristics and convergence.
作者 吕庆莉 LYU Qingli(Basic Medical College,Shaanxi University of Chinese Medicine,Xianyang 712046,China)
出处 《中国中医药信息杂志》 CAS CSCD 2019年第6期97-100,共4页 Chinese Journal of Information on Traditional Chinese Medicine
基金 国家自然科学基金(81503195) 陕西省教育厅重点实验室项目(16JS025) 陕西省科技厅项目(2014k14-02-02)
关键词 中医体质分类 信息增益 多标记分类 TCM constitutions information gain multi-label classification
  • 相关文献

参考文献10

二级参考文献62

  • 1张云涛,龚玲,王永成.An improved TF-IDF approach for text classification[J].Journal of Zhejiang University-Science A(Applied Physics & Engineering),2005,6(1):49-55. 被引量:4
  • 2龚静,周经野.一种基于多重因子加权的文本特征项权值计算方法[J].计算技术与自动化,2007,26(1):81-83. 被引量:10
  • 3GUILLAUMIN M, MENSINK T, VERBEEK J, et al. TagProp : discrim- inative metric learning in nearest neighbor models for image auto-an- notation[ C ]//Proc of International Conference of Computer Vision. 2009:309-316.
  • 4CLARE A, KING R D. Knowledge discovery in multi-label phenotype data [ C]//kecture Notes in Computer Science,vol 2168. 2001:42-53.
  • 5ELISSEEFF A,WESTON J. A kernel method for multi-labeled classifica- tion [ C]//Proc of Annual ACM Conference on Research and I)evelop- ment in Information Retrieval. New York:ACM Press.2005:274-281.
  • 6COMITE F D, GILLERON R,TOMMASI M. Learning multi-label al- ternating decision tree from texts and data [ C ]//Lecture Notes in Computer Science, vol 2734. 2003:35-49.
  • 7SCHAPIRE R E, SINGER Y. BoosTexter:a boosting-based system for text categorization [ J ]. Machine Learning, 2000, 39 ( 2/3 ) : 135- 168.
  • 8ELISSEEFF A, WESTON J. A kernel method for multi-labeled classi- fication [ C ]//Advances in Neural Information Processing Systems. Cambridge : MIT Press,2002:681 -687.
  • 9ZHANG Min-ling, ZHOU Zhi-hua ML-KNN : a lazy learning approach to multi-label learning [ J ]. Parttam Recognition, 2007,40 ( 7 ) : 2038-2048.
  • 10CHEN M S,HAN J H,YU P S. Data mining:an overview from a data- base perspective [ J]. IEEE Trans on Knowledge and Data Engi-.

共引文献178

相关作者

内容加载中请稍等...

相关机构

内容加载中请稍等...

相关主题

内容加载中请稍等...

浏览历史

内容加载中请稍等...
;
使用帮助 返回顶部