混合BERT和宽度学习的低时间复杂度短文本分类

Low time complexity short text classification based on fusion of BERT and broad learing

导出

摘要针对短文本分类任务效率低下和精度不高的问题,提出混合基于Transformer的双向编码器表示和宽度学习分类器(hybrid bidirectional encoder representations from transformer and broad learning, BERT-BL)的高效率和高精度文本分类模型。对基于Transformer的双向编码器表示(bidirectional encoder representation from transformer, BERT)进行微调以更新BERT的参数。使用微调好的BERT将短文本映射成对应的词向量矩阵,将词向量矩阵输入宽度学习(broad learning, BL)分类器中以完成分类任务。试验结果显示,BERT-BL模型在3个公共数据集上的准确率均达到最优;所需要的时间仅为基线模型支持向量机(support vector machine, SVM)、长短期记忆网络(long short-term memory, LSTM)、最小p范数宽度学习(minimum p-norm broad learning,p-BL)和BERT的几十分之一,而且训练过程不需要高性能显卡的参与。通过对比分析,BERT-BL模型不仅在短文本任务中具有良好的性能,而且能节省大量训练时间成本。 To address the issues of low efficiency and low accuracy in short text classification(STC)tasks,a high-efficiency and high-precision text classification model was proposed that combined transformer based on bidirectional encoder representations and broad learning classifiers(BERT-BL).Through the process of fine-tuning the bidirectional encoder representation from transformer(BERT)based on transformer,the parameters of BERT could be updated to optimize its performance.Utilized fine-tuned BERT to map the short text to its respective word vector matrix,then input it into the BL classifier to classify.The experimental results showed that the accuracy of the BERT-BL model reached state-of-art performance on three public datasets respectively. The mainfinding was that the BERT-BL model took only a few tenths of the time required to baseline models of support vector machine( SVM), long short-term memory (LSTM), minimum p-norm broad learning (p-BL) and BERT, and its training process did notrequire the participation of a graphics processing unit. Through comparative analysis, the BERT-BL model not only had goodperformance in STC, but also can save a lot of training time cost.

作者陈晓江杨晓奇陈广豪刘伍颖 CHEN Xiaojiang;YANG Xiaoqi;CHEN Guanghao;LIU Wuying(Information Department,Jieyang Campus of Guangdong Open University,Jieyang 522095,Guangdong,China;School of Information Science and Technology,Guangdong University of Foreign Studies,Guangzhou 510006,Guangdong,China;Department of Software Engineering,Software Engineering Institute of Guangzhou,Guangzhou 510990,Guangdong,China;Shandong Key Laboratory of Language Resources Development and Application,Ludong University,Yantai 264025,Shandong,China;Center for Linguistics and Applied Linguistics,Guangdong University of Foreign Studies,Guangzhou 510420,Guangdong,China)

机构地区广东开放大学揭阳分校信息科广东外语外贸大学信息科学与技术学院广州软件学院软件工程系鲁东大学山东省语言资源开发与应用重点实验室广东外语外贸大学外国语言学及应用语言学研究中心

出处《山东大学学报（工学版）》 CAS CSCD 北大核心 2024年第4期51-58,66,共9页 Journal of Shandong University（Engineering Science）

基金教育部新文科研究与改革实践资助项目(2021060049) 教育部人文社会科学研究青年基金资助项目(20YJC740062) 教育部人文社会科学研究规划基金资助项目(20YJAZH069) 山东省研究生教育教学改革研究资助项目(SDYJG21185) 山东省本科教学改革研究重点资助项目(Z2021323) 上海市哲学社会科学“十三五”规划课题资助项目(2019BYY028) 广州市科技计划资助项目(202201010061)。

关键词短文本分类 BERT-BL BERT 宽度学习高精度 short text classification BERT-BL BERT broad learning high accuracy

分类号 TP391 [自动化与计算机技术—计算机应用技术]

引文网络
相关文献

参考文献2

1戴宏,盛立杰,苗启广.基于胶囊网络的对抗判别域适应算法[J].计算机研究与发展,2021,58(9):1997-2012. 被引量：4
2Guanghao Chen,Sancheng Peng,Rong Zeng,Zhongwang Hu,Lihong Cao,Yongmei Zhou,Zhouhao Ouyang,Xiangyu Nie.p-Norm Broad Learning for Negative Emotion Classification in Social Networks[J].Big Data Mining and Analytics,2022,5(3):245-256. 被引量：2

二级参考文献6

1Yong Bie,Yan Yang.A Multitask Multiview Neural Network for End-to-End Aspect-Based Sentiment Analysis[J].Big Data Mining and Analytics,2021,4(3):195-207. 被引量：5
2Guanlin Zhai,Yan Yang,Heng Wang,Shengdong Du.Multi-Attention Fusion Modeling for Sentiment Analysis of Educational Big Data[J].Big Data Mining and Analytics,2020,3(4):311-319. 被引量：5
3Mondher Bouazizi,Tomoaki Ohtsuki.Multi-Class Sentiment Analysis on Twitter: Classification Performance and Challenges[J].Big Data Mining and Analytics,2019,2(3):181-194. 被引量：8
4陈珂锐,孟小峰.机器学习的可解释性[J].计算机研究与发展,2020,57(9):1971-1986. 被引量：46
5Zufan Zhang,Xieliang Li,Chenquan Gan.Identifying influential nodes in social networks via community structure and influence distribution difference[J].Digital Communications and Networks,2021,7(1):131-139. 被引量：3
6Xu Han,Binyang Li,Zhuoran Wang.An Attention-Based Neural Framework for Uncertainty Identification on Social Media Texts[J].Tsinghua Science and Technology,2020,25(1):117-126. 被引量：5

共引文献4

1张娟飞.基于VMD的数控机床旋转机械运行路径偏离故障检测方法[J].机械与电子,2022,40(11):30-34. 被引量：1
2金泽熙,李磊,刘继.基于改进领域分离网络的迁移学习模型[J].计算机应用,2023,43(8):2382-2389.
3Sancheng Peng,Rong Zeng,Hongzhan Liu,Lihong Cao,Guojun Wang,Jianguo Xie.Deep Broad Learning for Emotion Classification in Textual Conversations[J].Tsinghua Science and Technology,2024,29(2):481-491.
4张雪梅,周艳聪.对抗式域适配迁移学习研究[J].计算机科学与应用,2021,11(12):2853-2861.

1刘海洋,祝迪,劳鹏飞.基于故障现象文本的水轮机故障诊断研究[J].水电能源科学,2024,42(8):164-167.
2刘磊,梁茂成.LingAlign:基于跨语言句向量的多语种句对齐方法研究[J].数据分析与知识发现,2024,8(6):56-68.

山东大学学报（工学版）

2024年第4期

浏览历史

内容加载中请稍等...

混合BERT和宽度学习的低时间复杂度短文本分类

参考文献2

二级参考文献6

共引文献4

相关作者

相关机构

相关主题

浏览历史