期刊文献+

基于句法结构特征分析及分类技术的答案提取算法 被引量:24

An Answer Extraction Algorithm Based on Syntax Structure Feature Parsing and Classification
下载PDF
导出
摘要 由于中文自然语言处理的特点和困难以及相应的语言处理基础资源的相对缺乏,使得国外一些成熟技术和研究成果不能直接应用到中文问答系统中.为此,针对中文事实型问答系统,提出一种新的基于句法结构特征分析及分类技术的答案提取算法,该方法将答案提取问题看成是候选答案的分类问题,即将候选答案分类为正确和错误两类.首先,该方法根据与问题类型所对应的候选答案的类型信息,从文本片断中提取出候选答案及其在句子中的简单特征和句法结构特征;然后利用这些特征训练分类器;最后用训练得到的分类器判别候选答案是否为正确答案.针对中文事实性问题,该方法与目前典型的基于模式匹配的中文答案提取算法相比,准确率提升6.2%,MRR提升9.7%. Due to the feature and difficulty of Chinese natural language processing and the lack of related resources, some foreign mature techniques can not be applied in Chinese Question Answering (QA) system. For the Chinese factoid QA system, a new answer extraction method based on syntax structure feature parsing and classification is presented in this paper. With the method, the answer extraction is regarded as candidate answer classification problem,i, e. candidate answers are classified into correct and incorrect answer. According to the part-of-speech information of candidate answers corresponding to question types, the candidate answers and their features (both simple and syntactic) in sentences from snippets are firstly extracted. Then these features are used to train the classifier. Finally, the trained classifier is used to distinguish whether the candidate answer is correct or not. For Chinese factoid questions, comparing to currently typical pattern matching based answer extraction algorithm, the new method improves precision by 6.2% and MRR by 9.7%.
出处 《计算机学报》 EI CSCD 北大核心 2008年第4期662-676,共15页 Chinese Journal of Computers
基金 国家自然科学基金(60573090)资助~~
关键词 句法依存分析 分类 答案提取 中文问答系统 事实性问题 syntax dependency parsing classification answer extraction Chinese Question Answering(QA) system factoid questions
  • 相关文献

参考文献15

  • 1Moldovan D I, Pasca M, Harabagiu S M, Surdeanu M. Performance issues and error analysis in an open-domain question answering system. ACM Transactions on Information Systems (TOIS), 2003, 21(2): 133-154
  • 2吴友政,赵军,徐波.基于无监督学习的问答模式抽取技术[J].中文信息学报,2007,21(2):69-76. 被引量:9
  • 3Sun Ang, Jiang Ming-Hu, Ma Yan-Jun. A maximum entropy model based answer extraction for Chinese question answering. Fuzzy Systems and Knowledge Discovery, 2006, 4223:1239-1248
  • 4Sun Ang, Jiang Ming-Hu, Ma Yan-Jun. An instance-based approach for pinpointing answers in Chinese question answering//Proceedings of the 8th International Conference on Signal Processing (ICSP2006). Beijing, China. 2006
  • 5Ittycheriah, Roukos S. IBM's statistical question answering system -TREC 11//Voorhees E M, Buckland Lori P. NIST Special Publication: SP 500-251. Department of Commerce, National Institute of Standards and Technology, Gaithersburg, Maryland, 2002
  • 6Yang Hui, Chua Tat-Seng. The integration of lexical knowledge and external resources for question answering//Voorhees E M, Buckland Lori P. NIST Special Publication: SP 500- 251. Department of Commerce, National Institute of Standards and Technology. Gaithersburg, Maryland, 2002
  • 7Li Xiao-Yan, Croft W Bruce. Evaluating question-answering techniques in Chinese//Proceedings of the 1st International Conference on Human Language Technology Research. Morristown, NJ, USA, 2001:1-6
  • 8Lin De-Kang, Pantel Patrick. Discovery of inference rules for question-answering. Natural Language Engineering, 2001, 7 (4): 343-360
  • 9Zhang Dell, Lee Wee Sun. Web based pattern mining and matching approach to question answering//Voorhees E M, Buckland Lori P. NIST Special Publication: SP 500-251. Department of Commerce, National Institute of Standards and Technology, Gaithersburg, Maryland, 2002
  • 10Brill Eric, Lin Jimmy, Banko Michele, Dumais Susan, Ng Andrew. Data-intensive question answering//Voorhees E M, Harman D K. NIST Special Publication 500-250. Department of Commerce, National Institute of Standards and Technology, Gaithersburg, Maryland, 2001:363-370

二级参考文献14

  • 1Deepak Ravichandran, Eduard Hovy. Learning Surface Text Patterns for a Question Answering[A]. In:Proceeding of the ACL2002 Conference[C]. Philadelphia, PA, July, 2002.
  • 2Dekang Lin, Patrick Pantel. Discovery of Inference Rules for Question Answering[J]. In: Natural Language Engineering, volume 7, 343-360.
  • 3Hui Yang, Tat-Seng Chua. The Integration of Lexical Knowledge and EXternal Resources for Question Answering[A]. In: the Eleventh Text REtrieval Conference[C]. Maryland: USA, 2002. 155-161.
  • 4M.M. Soubbotin, S.M. Soubbotin. Use of Patterns for Detection of Likely Answer Strings: A Systematic Approach[A]. In: the Eleventh Text Retrieval Conference [ C ]. Gaithersburg, Maryland: November 2002.
  • 5Moldovan, D., Harabagio, S., Girju, R., et al. LCC Tools for Question Answering[A]. NIST Special Publication: SP 500-251 The Eleventh Text Retrieval Conference[C].
  • 6Yongping Du, Xuanjing Huang, Xin Li, Lide Wu. A Novel Pattern Learning Method for Open Domain Question Answering[A]. In: the Proceedings of IJCNLP2004[C]. Sanya: China.
  • 7Susan Dumais, Michele Banko, Eric Brill, Jimmy Lina nd Andrew Ng. Web Question Answering: Is More Always Better? [A] In: the Proceeding of 25th Annual International ACM SIGIR Conference on Research and Development in Information Retrieval[C]. Tampere,Finland, 2002.
  • 8Dell Zhang, Wee Sun Lee. Web Based Pattern Mining and Matching Approach to Question Answering[A].In:the Proceeding of TREC-11 [C]. Gaithersburg,MD, 2002.
  • 9Regina Barzilay and Noemie Elhadad. Sentence Alignment for Monolingual Comparable Corpora [A]. In:the Proceedings of EMNLP2003[C]. Sapporo, Japan.25-32.
  • 10Y. Shinyama, S. Sekine, K. Sudo, R. Grishman.Automatic Paraphrase Acquisition from News Articles[A]. In: the Proceedings of Human Language Technology Conference[C]. San Diego, USA, 2002.

共引文献12

同被引文献210

引证文献24

二级引证文献167

相关作者

内容加载中请稍等...

相关机构

内容加载中请稍等...

相关主题

内容加载中请稍等...

浏览历史

内容加载中请稍等...
;
使用帮助 返回顶部