Question-Answering Pair Matching Based on Question Classification and Ensemble Sentence Embedding

下载PDF

导出

摘要 Question-answering(QA)models find answers to a given question.The necessity of automatically finding answers is increasing because it is very important and challenging from the large-scale QA data sets.In this paper,we deal with the QA pair matching approach in QA models,which finds the most relevant question and its recommended answer for a given question.Existing studies for the approach performed on the entire dataset or datasets within a category that the question writer manually specifies.In contrast,we aim to automatically find the category to which the question belongs by employing the text classification model and to find the answer corresponding to the question within the category.Due to the text classification model,we can effectively reduce the search space for finding the answers to a given question.Therefore,the proposed model improves the accuracy of the QA matching model and significantly reduces the model inference time.Furthermore,to improve the performance of finding similar sentences in each category,we present an ensemble embedding model for sentences,improving the performance compared to the individual embedding models.Using real-world QA data sets,we evaluate the performance of the proposed QA matching model.As a result,the accuracy of our final ensemble embedding model based on the text classification model is 81.18%,which outperforms the existing models by 9.81%∼14.16%point.Moreover,in terms of the model inference speed,our model is faster than the existing models by 2.61∼5.07 times due to the effective reduction of search spaces by the text classification model.

作者 Jae-Seok Jang Hyuk-Yoon Kwon

机构地区 Department of Computer Science and Engineering Department of Industrial Engineering/Graduate School of Data Science

出处《Computer Systems Science & Engineering》 SCIE EI 2023年第9期3471-3489,共19页 计算机系统科学与工程（英文）

基金 This work was supported by the National Research Foundation of Korea(NRF)grant funded by the Korea government(MSIT)(No.2022R1F1A1067008) by the Basic Science Research Program through the National Research Foundation of Korea(NRF)funded by the Ministry of Education(No.2019R1A6A1A03032119).

关键词 Question-answering text classification model data augmentation text embedding

分类号 H31 [语言文字—英语]

引文网络
相关文献

1Lin Zhao,Yanru Liu,Yuefeng Lu,Ying Sun,Jing Li,Kaizhong Yao.Research on Vector Road Data Matching Method Based on Deep Learning[J].Journal of Applied Mathematics and Physics,2023,11(1):303-315.
2Zuoxing Zhang,Dong Liang,Zhen Zhang,Yang Cai,Hongyi Hou.A Domain Question Answering Algorithm Based on the Contrastive Language-Image Pretraining Mechanism[J].Journal of Computer and Communications,2023,11(5):1-15.
3Rachid Karra,Abdelali Lasfar.Impact of Data Quality on Question Answering System Performances[J].Intelligent Automation & Soft Computing,2023(1):335-349.
4Anfeng Zhu,Zhao Xiao,Qiancheng Zhao.Power Data Preprocessing Method of Mountain Wind Farm Based on POT-DBSCAN[J].Energy Engineering,2021,118(3):549-563.
5Congnan Zhang,Jiahui Lu,Yajing Zhang,Pengyuan He,Jinyu Xia,Mingxing Huang.Prevalence, diagnosis, treatment, and associated factors of hepatitis C in the United States from 1999 to 2018: A population-based cross- sectional study[J].Liver Research,2022,6(4):284-288.
6HE Ping.Re-Comment on Stories of"Custom History"——A Sampling Analysis of Chi Zijian's Stories[J].Frontiers of Literary Studies in China-Selected Publications from Chinese Universities,2022,16(3):407-430.
7高婷婷,郑军妹,王丹阳.Fabrication of High-Efficiency Polyvinyl Alcohol Nanofiber Membranes for Air Filtration Based on Principle of Stable Electrospinning[J].Journal of Donghua University(English Edition),2023,40(2):142-148. 被引量：1
8YAN Jingming.Chinese Literature Is Pooling Strong Strength[J].Frontiers of Literary Studies in China-Selected Publications from Chinese Universities,2022,16(4):527-529.
9LIANG Xiangyang.Capturing the Historical Poetry of the Period of"Great Social Transformation"-An Essay on the Creation Motive behind Lu Yao's Ordinary World[J].Frontiers of Literary Studies in China-Selected Publications from Chinese Universities,2022,16(2):345-364.
10Lanlan Li,Xianfeng Yu,Can Sheng,Xueyan Jiang,Qi Zhang,Ying Han,Jiehui Jiang.A review of brain imaging biomarker genomics in Alzheimer’s disease:implementation and perspectives[J].Translational Neurodegeneration,2022,11(1):294-330.

Computer Systems Science & Engineering

2023年第9期

浏览历史

内容加载中请稍等...

Question-Answering Pair Matching Based on Question Classification and Ensemble Sentence Embedding

相关作者

相关机构

相关主题

浏览历史