基于类别感知课程学习的半监督立场检测

Semi-supervised stance detection based on category-aware curriculum learning

下载PDF

导出

摘要生成伪标签是半监督立场检测的一种有效策略。在现实应用中,生成的伪标签质量存在差异,然而现有的工作将生成伪标签的质量视为是同等的,且没有充分考虑类别不平衡对伪标签生成质量的影响。为了解决上述2个问题,提出基于类别感知课程学习的半监督立场检测模型(SDCL)。首先,使用预训练分类模型对无标签推文生成伪标签;其次,根据伪标签质量的高低对推文按类别排序,并选取每个类别前k个高质量推文;最后,将各个类别选出的推文合并后重新排序,并把排序后带有伪标签的推文再输入分类模型,从而进一步优化模型参数。实验结果表明,与基线模型中表现最好的SANDS(Stance Analysis via Network Distant Supervision)相比,所提模型在3种不同划分(有标签推文总数为500、1000和1500)情况下,在StanceUS数据集上的宏平均(Mac-F1)分数分别提高了2、1和3个百分点,在StanceIN数据集上的Mac-F1分数均提高了1个百分点,验证了所提模型的有效性。 Pseudo-label generation emerges as an effective strategy in semi-supervised stance detection.In practical applications,variations are observed in the quality of generated pseudo-labels.However,in the existing working,the quality of these labels is regarded as equivalent.Furthermore,the influence of category imbalance on the quality of pseudo-label generation is not fully considered.To address these issues,a Semi-supervised stance Detection model based on Categoryaware curriculum Learning(SDCL)was proposed.Firstly,a pre-trained classification model was employed to generate pseudo-labels for unlabeled tweets.Then,tweets were sorted by category based on the quality of pseudo-labels,and the top k high-quality tweets for each category were selected.Finally,the selected tweets from each category were merged,re-sorted,and input into the classification model with pseudo-labels,thereby further optimizing the model parameters.Experimental results indicate that compared to the best-performing baseline model,SANDS(Stance Analysis via Network Distant Supervision),the proposed model demonstrates improvements in Mac-F1(Macro-averaged F1)scores on StanceUS dataset by 2,1,and 3 percentage points respectively under three different splits(with 500,1000,and 1500 labeled tweets).Similarly,on StanceIN dataset,the proposed model exhibits enhancements in Mac-F1 scores by 1 percentage point under the three splits,thereby validating the effectiveness of the proposed model.

作者高肇泽朱小飞项能强 GAO Zhaoze;ZHU Xiaofei;XIANG Nengqiang(College of Computer Science and Engineering,Chongqing University of Technology,Chongqing 400054,China)

机构地区重庆理工大学计算机科学与工程学院

出处《计算机应用》 CSCD 北大核心 2024年第10期3281-3287,共7页 journal of Computer Applications

基金重庆市自然科学基金资助项目(CSTB2022NSCQ-MSX1672) 重庆市教育委员会科学技术研究计划重大项目(KJZD-M202201102) 重庆理工大学校级联合资助项目(gzlcx20233248)。

关键词半监督立场检测类别不平衡课程学习伪标签生成 semi-supervised stance detection category imbalance curriculum learning pseudo-label generation

分类号 TP391.1 [自动化与计算机技术—计算机应用技术]

引文网络
相关文献

参考文献2

1张斌,王莉,杨延杰.联合立场的过程跟踪式多任务谣言验证模型[J].计算机应用,2022,42(11):3371-3378. 被引量：1
2李峤,刘宇.基于机器学习的推特谣言立场分析研究[J].电子设计工程,2019,27(21):36-39. 被引量：3

二级参考文献4

1魏武挥.谣言的传播与辟谣[J].新闻记者,2012(5):28-31. 被引量：5
2Chun Liao,Chong Feng,Sen Yang,He-Yan Huang.A Hybrid Method of Domain Lexicon Construction for Opinion Targets Extraction Using Syntax and Semantics[J].Journal of Computer Science & Technology,2016,31(3):595-603. 被引量：5
3王汝娇,姬东鸿.基于卷积神经网络与多特征融合的Twitter情感分类方法[J].计算机工程,2018,44(2):210-219. 被引量：28
4李峤,刘宇.基于机器学习的推特谣言立场分析研究[J].电子设计工程,2019,27(21):36-39. 被引量：3

共引文献2

1杨利君,滕冲.基于增强的双向树表示的推特谣言立场检测模型[J].中文信息学报,2021,35(10):119-127.
2张斌,王莉,杨延杰.联合立场的过程跟踪式多任务谣言验证模型[J].计算机应用,2022,42(11):3371-3378. 被引量：1

1Zhixiong Chen,Jane Liu,Xiushu Qie,Xugeng Cheng,Mengmiao Yang,Lei Shu,Jing M.Chen.Concurrence of high dust aerosol and stratosphere-intruded ozone pollution in super sandstorms[J].Science Bulletin,2024,69(16):2509-2513.
2姜雨杉,张仰森.大语言模型驱动的立场感知事实核查[J].计算机应用,2024,44(10):3067-3073.
3白学营,陈维常,徐菲,胡健伟.应用感知的深度神经网络剪枝方法[J].宇航总体技术,2024,8(5):64-73.
4王浩汀,尚运涛,曹光,张延祠,李军勇.胫骨高位截骨联合关节镜与单髁置换治疗单间室膝关节骨性关节炎的临床疗效比较[J].中华老年骨科与康复电子杂志,2024,10(4):229-236.
5ZHANG Zhi-yuan,HU Zeng-ning.A Study on the Stance-taking of Advanced Oral English Teaching Materials-A Case Study of Bridging Cultures[J].Journal of Literature and Art Studies,2024,14(9):783-793.
6沈冬晓,张华,彭军良.消肿敛疮膏联合地奥司明片治疗血栓性外痔临床研究[J].光明中医,2024,39(18):3748-3751.
7Shu Ma.The Inheritance and Innovation of Translation Theory——The Perspective on Original Works[J].Journal of Social Science Development Research,2024,2(3):72-81.
8祝凯华.ERBD与PTBD在肝门胆管癌术前减黄中的应用比较[J].黑龙江医药,2024,37(5):1016-1019.
9Mohammad Hassan Baziar,Mahdi Alibolandi.Assessment of liquefaction potential based on shear wave velocity:Strain energy approach[J].Journal of Rock Mechanics and Geotechnical Engineering,2024,16(9):3733-3745.
10Filippo Bistagnino,Davide Pizzi,Filippo Mantovani,Jacopo Rosso Antonino,Marcos Roberto Tovani-Palone.Long COVID and gut candidiasis:What is the existing relationship?[J].World Journal of Gastroenterology,2024,30(37):4104-4114.

计算机应用

2024年第10期

浏览历史

内容加载中请稍等...

基于类别感知课程学习的半监督立场检测

参考文献2

二级参考文献4

共引文献2

相关作者

相关机构

相关主题

浏览历史