期刊文献+

基于改进的RAKEL算法的心电图诊断分类

ECG diagnostic classification based on improved RAKEL algorithm
下载PDF
导出
摘要 心电图(ECG)数据通常包含多种病症,而ECG诊断是一个典型的多标签分类问题。在多标签分类方法中,RAKEL算法将标签集随机分解为若干个大小为k的子集,并建立LP分类器进行训练;然而由于没有充分考虑标签间的相关性,LP分类器中容易产生一些标签组合所对应样本稀少的情况,从而影响预测性能。为了充分考虑标签间的相关性,提出一种基于贝叶斯网络的RAKEL算法BN-RAKEL。首先利用贝叶斯网络找到标签间的相关性,确定候选标签子集;然后对每个标签采用基于信息增益的特征选择算法确定其最优特征空间,并针对每个候选标签子集利用最优特征空间相似性来检测其相关程度,以确定最终的具有强相关性的标签子集;最后在标签子集的最优特征空间上训练LP分类器。在实际的ECG数据集上,与多标签K近邻(ML-KNN)、RAKEL、CC和基于FP-Growth的RAKEL算法FI-RAKEL进行对比,结果显示所提算法在召回率和F-score上最少提高了3.6个百分点和2.3个百分点。实验结果表明,BN-RAKEL算法有较好的预测性能,能有效提升ECG诊断的准确性。 ElectroCardioGram(ECG)data usually contain many diseases,and ECG diagnosis is a typical multi-label classification problem.In RAndom k-labELsets(RAKEL)algorithm,one of multi-label classification methods,all labels are randomly decomposed into several labelsets of size k,and a Label Powerset(LP)classifier is established for training;however,the lack of sufficient consideration of correlation between labels makes the LP classifier obtain quite few samples corresponding to certain label combinations,which affects the prediction performance.To fully consider the correlation between labels,a Bayesian Network-based RAKEL(BN-RAKEL)algorithm was proposed.Firstly,the correlation between labels was found by Bayesian network to determine the candidate labelsets.Then,a feature selection method based on information gain was applied to construct the optimal feature space for each label,and the optimal feature space similarity was used for each candidate label subset to detect its correlation degree,determing the final labelsets with strong correlation.Finally,the LP classifiers were trained in the optimal feature space of the corresponding labelsets.A comparison with K-Nearest Neighbors for Multi-label Learning(ML-KNN),RAKEL,Classifier Chains(CC)and FP-Growth based RAKEL algorithm named FI-RAKEL on the real ECG dataset showed that the proposed algorithm achieved a minimum improvement of 3.6 percentage points and 2.3percentage points in recall and F-score,respectively.Experimental results show that BN-RAKEL algorithm has good prediction performance,and can effectively improve the ECG diagnosis accuracy.
作者 赵静 韩京宇 钱龙 毛毅 ZHAO Jing;HAN Jingyu;QIAN Long;MAO Yi(School of Computer Science,Nanjing University of Posts and Telecommunications,Nanjing Jiangsu 210023,China)
出处 《计算机应用》 CSCD 北大核心 2022年第6期1892-1897,共6页 journal of Computer Applications
基金 国家自然科学基金资助项目(62002174)。
关键词 心电图 多标签 标签相关性 贝叶斯网络 信息增益 特征选择 RAKEL算法 ElectroCardioGram(ECG) multi-label label correlation Bayesian network information gain feature selection RAndom k-labELsets(RAKEL)algorithm
  • 相关文献

参考文献9

二级参考文献127

  • 1苏金树,张博锋,徐昕.基于机器学习的文本分类技术研究进展[J].软件学报,2006,17(9):1848-1859. 被引量:383
  • 2Matthew R B, Luo Jie-bo, Shen Xi-peng, et al. Learning multi-la- bel scene classification[J]. Pattern Recognition, 2004(37) : 1757- 1771.
  • 3Zhang Min-ling, Zhou Zhi-hua. MI-kNN: A lazy learning ap- proach to multi-label learning [J]. Pattern Recognition, 2007 (40) : 2038-2048.
  • 4Xu Xin-shun, Jiang Yuan, Peng Liang, et al. Ensemble approach based on conditional random field for multi-label image and video annotation[C]//Proceedings of the 19th ACM international conference on Multimedia. Scottsdale, Arizona, USA, 2011: 1377-1380.
  • 5Wang Jing-dong, Zhao Ying-hai, Wu Xiu-qing, et al. A transduc- tive multi-label learning approach for video concept detection [J]. Pattern Recognition, 2011(44) : 10-11.
  • 6Snoek C, Worring M, Gemert J V, et al. The challenge problem for automated detection of 101semantic concepts in multimedia [C]//Proceedings of the ACM Multimedia. Santa Barbara, CA, USA, 2006 : 421-430.
  • 7Clare A,King R. Knowledge discovery in multblabel phenotype data[C]//Proceedings of the 5th European Conference on Prin- ciples of Data Mining and Knowledge Discovery (PKDD). Freiburg, Germany, 2001 : 42-53.
  • 8Elisseeff A, Weston J. A kernel method for multi-labelled classifi cation[J]. Advances in Neural Information Processing Systems, 2001(14) : 681-687.
  • 9Blockeel H, Schietgat L, Struyf J, et al. Decision trees for hierar- chical multilabel classification: A case study in functional ge- nomics[C] // Lecture Notes in Computer Science (Including Subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics) 4213 LNAI. 2006:18-29.
  • 10Cesa-Bianchi N, Gentile C, Zaniboni L. Hierarchical classifica- tion:combining bayes with svra[C] // Proceedings of the 23rd International Conference on Machine learning. 2006:177-184.

共引文献122

相关作者

内容加载中请稍等...

相关机构

内容加载中请稍等...

相关主题

内容加载中请稍等...

浏览历史

内容加载中请稍等...
;
使用帮助 返回顶部