摘要
为研究面向大规模网络数据的社会化问答系统(Social Question and Answer System,SocialQA).分别描述了问答系统的各个组成技术:1)问句预处理:问句分析和问句扩展.2)问句匹配.本文在1500万个网络问答数据集上,进行了问句匹配的实验.实验表明:在封闭测试中,问句匹配的准确率,达到了90%以上,在开放测试中,问句匹配的准确率达到了70%以上,很好地满足了系统的精度和实时性的要求.
To study the social question and answer system (Social QA) for large-scale web data, we discuss some component techniques for the social QA system, such as question pre-processing (question analysis and question semantic expansion) and question matching. Experiments on a set of 15 million question and answer pairs from the web were conducted. Results show that our system achieves the question matching accuracy over 90% in the closed test and that over 70% in the open test. It can be concluded that this system is efficient and can basically meet the real-time requirements.
出处
《哈尔滨工业大学学报》
EI
CAS
CSCD
北大核心
2008年第12期2011-2015,共5页
Journal of Harbin Institute of Technology
关键词
社会化问答系统
问句分析
问句匹配
social question and answer system
question analysis
question matching