A Novel Optimized Language-Independent Text Summarization Technique

下载PDF

导出

摘要 A substantial amount of textual data is present electronically in several languages.These texts directed the gear to information redundancy.It is essential to remove this redundancy and decrease the reading time of these data.Therefore,we need a computerized text summarization technique to extract relevant information from group of text documents with correlated subjects.This paper proposes a language-independent extractive summarization technique.The proposed technique presents a clustering-based optimization technique.The clustering technique determines the main subjects of the text,while the proposed optimization technique minimizes redundancy,and maximizes significance.Experiments are devised and evaluated using BillSum dataset for the English language,MLSUM for German and Russian and Mawdoo3 for the Arabic language.The experiments are evaluated using ROUGE metrics.The results showed the effectiveness of the proposed technique compared to other language-dependent and languageindependent summarization techniques.Our technique achieved better ROUGE metrics for all the utilized datasets.The technique accomplished an F-measure of 41.9%for Rouge-1,18.7%for Rouge-2,39.4%for Rouge-3,and 16.8%for Rouge-4 on average for all the dataset using all three objectives.Our system also exhibited an improvement of 26.6%,35.5%,34.65%,and 31.54%w.r.t.The recent model contributed in the summarization of BillSum in terms of ROUGE metric evaluation.Our model’s performance is higher than the comparedmodels,especially in themetric results ofROUGE_2which is bi-gram matching.

作者 Hanan A.Hosni Mahmoud Alaaeldin M.Hafez

机构地区 Department of Computer Sciences Department of Information Systems

出处《Computers, Materials & Continua》 SCIE EI 2022年第12期5121-5136,共16页 计算机、材料和连续体（英文）

基金 This research was funded by Princess Nourah bint Abdulrahman University Researchers Supporting Project number(PNURSP2022R113) Princess Nourah bint Abdulrahman University,Riyadh,Saudi Arabia.

关键词 Text summarization:language-independent summarization ROUGE

分类号 H31 [语言文字—英语]

引文网络
相关文献

参考文献1

1Shiming He,Zhuozhou Li,Yangning Tang,Zhuofan Liao,Feng Li,Se-Jung Lim.Parameters Compressing in Deep Learning[J].Computers, Materials & Continua,2020(1):321-336. 被引量：9

共引文献8

1刘艳,王田,彭绍亮,王国军,贾维嘉.基于边缘的联邦学习模型清洗和设备聚类方法[J].计算机学报,2021,44(12):2515-2528. 被引量：14
2Linbo Deng,Jinsong Gui,Tian Wang,Jiawei Tan,Xiong Li.An intelligent hybrid MAC protocol for a sensor-based personalized healthcare system[J].Digital Communications and Networks,2022,8(2):174-185. 被引量：1
3Jianming Zhang,Kai Wang,Yaoqi He,Lidan Kuang.Visual Object Tracking via Cascaded RPN Fusion and Coordinate Attention[J].Computer Modeling in Engineering & Sciences,2022(9):909-927.
4Vasumathi Devi Majety,N.Sharmili,Chinmaya Ranjan Pattanaik,ELaxmi Lydia,Subhi R.M.Zeebaree,Sarmad Nozad Mahmood,Ali S.Abosinnee,Ahmed Alkhayyat.Ensemble of Handcrafted and Deep Learning Model for Histopathological Image Classification[J].Computers, Materials & Continua,2022(11):4393-4406.
5Chao-Lung Yang,Yulius Harjoseputro,Yu-Chen Hu,Yung-Yao Chen.An Improved Transfer-Learning for Image-Based Species Classification of Protected Indonesians Birds[J].Computers, Materials & Continua,2022(12):4577-4593.
6Tahir Alyas,Khalid Alissa,Abdul Salam Mohammad,Shazia Asif,Tauqeer Faiz,Gulzar Ahmed.Innovative Fungal Disease Diagnosis System Using Convolutional Neural Network[J].Computers, Materials & Continua,2022(12):4869-4883.
7Dalwinder Singh,Deepak Prashar,Jimmy Singla,Arfat Ahmad Khan,Mohammed Al-Sarem,Neesrin Ali Kurdi.Intelligent Medical Diagnostic System for Hepatitis B[J].Computers, Materials & Continua,2022(12):6047-6068.
8Manar Ahmed Hamza,Aisha Hassan Abdalla Hashim,Heba G.Mohamed,Saud S.Alotaibi,Hany Mahgoub,Amal S.Mehanna,Abdelwahed Motwakel.Hyperparameter Tuned Deep Learning Enabled Intrusion Detection on Internet of Everything Environment[J].Computers, Materials & Continua,2022(12):6579-6594. 被引量：1

1Neeraj Kumar Sirohi,Mamta Bansal,S.N.Rajan.RETRACTED:Recent Approaches for Text Summarization Using Machine Learning&LSTM0[J].Journal on Big Data,2021,3(1):35-47.
2Ebrahim Heidary,Hamïd Parvïn,Samad Nejatian,Karamollah Bagherifard,Vahideh Rezaie.Automatic Persian Text Summarization Using Linguistic Features from Text Structure Analysis[J].Computers, Materials & Continua,2021(12):2845-2861. 被引量：1
3Ebrahim Heidary,Hamïd Parvïn,Samad Nejatian,Karamollah Bagherifard,Vahideh Rezaie,Zulkefli Mansor,Kim-Hung Pho.Automatic Text Summarization Using Genetic Algorithm and Repetitive Patterns[J].Computers, Materials & Continua,2021(4):1085-1101. 被引量：2
4Neeraj Kumar Sirohi,Mamta Bansal,S.N.Rajan.Retraction Notice to:Recent Approaches for Text Summarization Using Machine Learning&LSTM0[J].Journal on Big Data,2021,3(2):97-97.
5Sunqiang Hu,Xiaoyu Li,Yu Deng,Yu Peng,Bin Lin,Shan Yang.A Semantic Supervision Method for Abstractive Summarization[J].Computers, Materials & Continua,2021(10):145-158. 被引量：1
6Muhammad Yahya Saeed,Muhammad Awais,Muhammad Younas,Muhammad Arif Shah,Atif Khan,M.Irfan Uddin,Marwan Mahmoud.An Abstractive Summarization Technique with Variable Length Keywords as per Document Diversity[J].Computers, Materials & Continua,2021(3):2409-2423. 被引量：1
7Ahmad Hussein Ababneh.Investigating the Relevance of Arabic Text Classification Datasets Based on Supervised Learning[J].Journal of Electronic Science and Technology,2022,20(2):187-208. 被引量：1
8沈同平,金力,黄方亮,许欢庆.隐马尔可夫模型的优化及其用于多文本实体识别[J].安庆师范大学学报（自然科学版）,2022,28(2):31-35. 被引量：1
9Omar Badr.As-Salamu Alaykum From Beijing![J].Beijing Review,2021,64(49):48-48.
10黄兵(文/图).突破游戏与现实的界限明基莫比乌斯EX3415R电竞显示器[J].微型计算机,2022(12):76-79.

Computers, Materials & Continua

2022年第12期

浏览历史

内容加载中请稍等...

A Novel Optimized Language-Independent Text Summarization Technique

参考文献1

共引文献8

相关作者

相关机构

相关主题

浏览历史