News Modeling and Retrieving Information: Data-Driven Approach

下载PDF

导出

摘要 This paper aims to develop Machine Learning algorithms to classify electronic articles related to this phenomenon by retrieving information and topic modelling.The Methodology of this study is categorized into three phases:the Text Classiﬁcation Approach(TCA),the Proposed Algorithms Interpretation(PAI),andﬁnally,Information Retrieval Approach(IRA).The TCA reﬂects the text preprocessing pipeline called a clean corpus.The Global Vec-tors for Word Representation(Glove)pre-trained model,FastText,Term Frequency-Inverse Document Fre-quency(TF-IDF),and Bag-of-Words(BOW)for extracting the features have been interpreted in this research.The PAI manifests the Bidirectional Long Short-Term Memory(Bi-LSTM)and Convolutional Neural Network(CNN)to classify the COVID-19 news.Again,the IRA explains the mathematical interpretation of Latent Dirich-let Allocation(LDA),obtained for modelling the topic of Information Retrieval(IR).In this study,99%accuracy was obtained by performing K-fold cross-validation on Bi-LSTM with Glove.A comparative analysis between Deep Learning and Machine Learning based on feature extraction and computational complexity exploration has been performed in this research.Furthermore,some text analyses and the most inﬂuential aspects of each document have been explored in this study.We have utilized Bidirectional Encoder Representations from Trans-formers(BERT)as a Deep Learning mechanism in our model training,but the result has not been uncovered satisfactory.However,the proposed system can be adjustable in the real-time news classiﬁcation of COVID-19.

作者 Elias Hossain Abdullah Alshahrani Wahidur Rahman

机构地区 Electrical&Computer Engineering Department of Computer Science and Artificial Intelligence Department of Computer Science and Engineering

出处《Intelligent Automation & Soft Computing》 2023年第11期109-123,共15页 智能自动化与软计算（英文）

关键词 COVID-19 news retrieving DATA-DRIVEN machine learning BERT topic modelling

分类号 TP3 [自动化与计算机技术—计算机科学与技术]

引文网络
相关文献

参考文献2

1Fei Hu,Li Li,Zi-Li Zhang,Jing-Yuan Wang,Xiao-Fei Xu.Emphasizing Essential Words for Sentiment Classification Based onRecurrent Neural Networks[J].Journal of Computer Science & Technology,2017,32(4):785-795. 被引量：13
2Hongping Wu,Yuling Liu,Jingwen Wang.Review of Text Classification Methods on Deep Learning[J].Computers, Materials & Continua,2020(6):1309-1321. 被引量：11

共引文献21

1Fei-Fei Kou,Jun-Ping Du,Cong-Xian Yang,Yan-Song Shi,Wan-Qiu Cui,Mei-Yu Liang,Yue Geng.Hashtag Recommendation Based on Multi-Features of Microblogs[J].Journal of Computer Science & Technology,2018,33(4):711-726. 被引量：5
2韩毅,张涵,李跃新.基于情感直方图特征的中文文本情感分类方法[J].计算机工程与设计,2018,39(7):1917-1922.
3洪巍,李敏.文本情感分析方法研究综述[J].计算机工程与科学,2019,41(4):750-757. 被引量：84
4刘明明,李震霄,郑丽丽.基于双向循环神经网络的字符级文本分类[J].江苏建筑职业技术学院学报,2019,19(4):29-34. 被引量：1
5国显达,那日萨,崔少泽.基于CNN-BiLSTM的消费者网络评论情感分析[J].系统工程理论与实践,2020,40(3):653-663. 被引量：29
6尹春勇,何苗.基于改进胶囊网络的文本分类[J].计算机应用,2020,40(9):2525-2530. 被引量：10
7余亮,蒋玉明.基于Senti-PMU模型的文本情感分析[J].现代计算机,2020,26(29):19-24.
8杨奎河,赵萌萌.基于深度学习的情感分析技术[J].信息通信,2020(8):99-101. 被引量：7
9王仲昊,万相奎,李风从,危竞,刘俊杰.多模型以动态权重相融合的词相似性分析[J].华侨大学学报（自然科学版）,2021,42(1):121-127. 被引量：2
10李菲菲,吴璠,王中卿.基于生成式对抗网络和评论专业类型的情感分类研究[J].数据分析与知识发现,2021,5(4):72-79. 被引量：7

1Yan Li,Zhiling Wang,Zenghui Bao,Yukai Wu,Jiahui Wang,Jize Yang,Haonan Xiong,Yipu Song,Hongyi Zhang,Luming Duan.Frequency-tunable microwave quantum light source based on superconducting quantum circuits[J].Chip,2023,2(3):80-84.
2Chundong Xu,Cheng Zhu,Xianpeng Ling,Dongwen Ying.Temporal Convolutional Network for Speech Bandwidth Extension[J].China Communications,2023,20(11):142-150.
3李锵,吴正彪,关欣.结合深度乐谱特征融合的钢琴指法生成方法[J].智能系统学报,2023,18(6):1287-1294.
4WANG Yi-guang,GUO Zheng-chu.Differentially private SGD with random features[J].Applied Mathematics(A Journal of Chinese Universities),2024,39(1):1-23.
5一团糟[J].英语角,2024(4):36-39.
6朱仁欢,刘凯.CHA_(2)DS_(2)-VASc评分联合D-二聚体/纤维蛋白原比值对急性ST段抬高型心肌梗死相关动脉自发再通的预测价值[J].中国心血管病研究,2024,22(2):141-146. 被引量：1
7Youcef Berour Minarro.Business Jet Availability on the Rise Again[J].今日民航,2023(4):112-115.
8Yanyang Liu,Xiangdong Li,Junli Chen,Mingliang Tao.RFI Detection for Multichannel HRWS SAR System Based on Spatial Cross Correlation[J].Journal of Beijing Institute of Technology,2023,32(6):696-703.
9Venerable Yanjue,SHAO Ya’nan(译).Buddhism Is a Cultural Bond Enabling Mutual Understanding and Harmony Between the Peoples in the Lancang-Mekong Region[J].The Voice of Dharma,2022(1):155-157.
10陈龙,阚子晨,高维富,段萍,陈俊宇,檀聪琦,崔作君.Growth mechanism and characteristics of electron drift instability in Hall thruster with different propellant types[J].Chinese Physics B,2024,33(1):511-522.

Intelligent Automation & Soft Computing

2023年第11期

浏览历史

内容加载中请稍等...

News Modeling and Retrieving Information: Data-Driven Approach

参考文献2

共引文献21

相关作者

相关机构

相关主题

浏览历史