Supervised topic models with weighted words:multi-label document classification 被引量：1

Supervised topic models with weighted words:multi-label document classification

导出

摘要 Supervised topic modeling algorithms have been successfully applied to multi-label document classification tasks.Representative models include labeled latent Dirichlet allocation(L-LDA)and dependency-LDA.However,these models neglect the class frequency information of words(i.e.,the number of classes where a word has occurred in the training data),which is significant for classification.To address this,we propose a method,namely the class frequency weight(CF-weight),to weight words by considering the class frequency knowledge.This CF-weight is based on the intuition that a word with higher(lower)class frequency will be less(more)discriminative.In this study,the CF-weight is used to improve L-LDA and dependency-LDA.A number of experiments have been conducted on real-world multi-label datasets.Experimental results demonstrate that CF-weight based algorithms are competitive with the existing supervised topic models. Supervised topic modeling algorithms have been successfully applied to multi-label document classification tasks.Representative models include labeled latent Dirichlet allocation（L-LDA）and dependency-LDA.However,these models neglect the class frequency information of words（i.e.,the number of classes where a word has occurred in the training data）,which is significant for classification.To address this,we propose a method,namely the class frequency weight（CF-weight）,to weight words by considering the class frequency knowledge.This CF-weight is based on the intuition that a word with higher（lower）class frequency will be less（more）discriminative.In this study,the CF-weight is used to improve L-LDA and dependency-LDA.A number of experiments have been conducted on real-world multi-label datasets.Experimental results demonstrate that CF-weight based algorithms are competitive with the existing supervised topic models.

作者 Yue-peng ZOU Ji-hong OUYANG Xi-ming LI

机构地区 College of Computer Science and Technology MOE Key Laboratory of Symbolic Computation and Knowledge Engineering

出处《Frontiers of Information Technology & Electronic Engineering》 SCIE EI CSCD 2018年第4期513-523,共11页 信息与电子工程前沿（英文版）

基金 Project supported by the National Natural Science Foundation of China(No.61602204)

关键词 Supervised topic model Multi-label classification Class frequency Labeled latent Dirichlet allocation (L-LDA) Dependency-LDA Supervised topic model Multi-label classification Class frequency Labeled latent Dirichlet allocation（L-LDA） Dependency-LDA

分类号 TP391 [自动化与计算机技术—计算机应用技术]

引文网络
相关文献

同被引文献4

1张敏灵.一种新型多标记懒惰学习算法[J].计算机研究与发展,2012,49(11):2271-2282. 被引量：39
2Jiang-Hui Cai,Xu-Jun Zhao,Shi-Wei Sun,Ji-Fu Zhang,Hai-Feng Yang.Stellar spectra association rule mining method based on the weighted frequent pattern tree[J].Research in Astronomy and Astrophysics,2013,13(3):334-342. 被引量：4
3LIU Yaolin,XIE Peng,HE Qingsong,ZHAO Xiang,WEI Xiaojian,TAN Ronghui.A New Method Based on Association Rules Mining and Geo-filter for Mining Spatial Association Knowledge[J].Chinese Geographical Science,2017,27(3):389-401. 被引量：6
4Zhiling Cai,William Zhu.Feature Selection for Multi-label Classification Using Neighborhood Preservation[J].IEEE/CAA Journal of Automatica Sinica,2018,5(1):320-330. 被引量：11

引证文献1

1杨岚雁,靳敏,张迎春,张珣.一种基于关联规则的MLKNN多标签分类算法[J].计算机工程与科学,2020,42(7):1309-1317. 被引量：9

二级引证文献9

1杨伟杰,薛河儒,白洁.牛乳体细胞分类器的研究与实现[J].数字技术与应用,2021,39(7):114-116. 被引量：1
2程大勇.基于k近邻的多标签分类算法性能比较[J].太原学院学报（自然科学版）,2022,40(1):59-64. 被引量：5
3李悦,汤鲲.基于TextCNN的政策文本分类[J].电子设计工程,2022,30(12):43-47. 被引量：3
4张永伟,朱祁,吴永城.基于分解策略的多标签在线特征选择算法[J].网络安全与数据治理,2022,41(10):65-71.
5田珂.基于聚类关联规则神经网络组合算法的弹丸初速预测[J].兵工学报,2023,44(2):452-461.
6姜建武,王博.高维数据组合关联关系挖掘方法[J].科学技术与工程,2023,23(4):1615-1624. 被引量：2
7杜昉臻,何圆姣,冯西贝,刘国华.基于人工智能的中医证候分类算法研究[J].南开大学学报（自然科学版）,2023,56(2):12-16. 被引量：2
8肖建芳,刘缅芳.基于稀疏正则化的加权叠加集成多标签分类[J].计算机应用与软件,2024,41(5):286-297.
9周伟,牛誉蓉.基于K-近邻与FOA改进聚类的数据异常分析模型及用电行为分析[J].成都工业学院学报,2024,27(5):11-16.

1Zhi-Xu Li,Qiang Yang,An Liu,Guan-Feng Liu,Jia Zhu,Jia-Jie Xu,Kai Zheng,Min Zhang.Crowd-Guided Entity Matching with Consolidated Textual Data[J].Journal of Computer Science & Technology,2017,32(5):858-876.

Frontiers of Information Technology & Electronic Engineering

2018年第4期

浏览历史

内容加载中请稍等...

Supervised topic models with weighted words:multi-label document classification 被引量：1

同被引文献4

引证文献1

二级引证文献9

相关作者

相关机构

相关主题

浏览历史