期刊文献+

基于互联网和self-training的中文问答模式学习 被引量:2

Chinese question answering pattern learning based on self-training mechanism and Web
下载PDF
导出
摘要 在已有的问答模式学习中,模式定义和候选答案评分偏于简单,而且学习过程依赖于人工标定语料。通过挖掘W eb文本中动、名词序列的骨架模式,用以扩充模式定义;将self-train ing学习机制引入问答模式学习:用一对训练语料进行初始学习,通过互联网搜索,自动选择可靠程度较高的问答对,重新训练;扩充了启发规则,改进候选答案的评分方法。实验结果表明:所提出的问答模式学习方法能有效地提高中文问答系统的性能。 In the past, the learning for QA pattern relies on the labeled data, and the definition of pattern and the scoring method for the candidate answers are over simplified. The verb and noun sequence was extracted as the skeleton pattern to expand definition of QA pattern. In the learning process, a learning mechanism was established based on self-training. At first, the initial study was completed on a labeled QA pair, then the system would automatically select the reliable data for self training through searching in the Web while the system was running. The scoring method of the candidate answers was also improved by applying several heuristic rules. The experimental results show that the performance of Chinese QA system based on our method is improved significantly.
出处 《计算机应用》 CSCD 北大核心 2008年第6期1575-1577,1581,共4页 journal of Computer Applications
基金 国家自然科学基金资助项目(60603027) 天津市应用基础研究计划资助项目(05YFJMJC11700)
关键词 互联网 问答模式 SELF-TRAINING 机器学习 Web QA pattern self-training machine learning
  • 相关文献

参考文献10

  • 1吴友政,赵军,徐波.基于无监督学习的问答模式抽取技术[J].中文信息学报,2007,21(2):69-76. 被引量:9
  • 2SOUBBOTIN M M, SOUBBOTIN S M. Use of patterns for detection of likely answer strings: A systematic approach [ C]// Proceedings of the 11th Text Retrieval Conference (TREC-11). Gaithersburg, Maryland: NIST Special Publication, 2002:325 - 331.
  • 3RAVICHANDRAN D, HOVY E. Learning surface text patterns for a question answering [ C]// Proceedings of the 40th Annual Meeting on Association for Computational Linguistics ( ACL 2002). Philadelphia, PA: Association for Computational Linguistics, 2002:41 - 47.
  • 4ZHANG D, LEE W S. Web based pattern mining and matching approach to question answering [ C]// Proceedings of the 11th Text Retrieval Conference ( TREC-11). Gaithersburg, Maryland: NIST, 02:505-512.
  • 5ROUSSINOV D, ROBLES J. Web question answering through automatically learned patterns [ C]// Proceedings of the 2004 Joint ACM/IEEE Conference on Digital Libraries (JCDL). New York:ACM Press, 2004:347-348.
  • 6郑逢斌,陈志国,姜保庆,乔保军.语义校对系统中的句子语义骨架模糊匹配算法[J].电子学报,2003,31(8):1138-1140. 被引量:7
  • 7ZHU XIAO - JUN. Semi - supervised learning literature survey, TR 1530 [ R]. University of Wisconsin-Madison: Department of Computer Sciences, 2006.
  • 8AGICHTEIN E, GRAVANO L. Snowball: Extracting relations from large plain-text collections [ C]//Proceedings of the 5th ACM International Conference on Digital Libraries. New York: ACM Press, 2000:85 - 94.
  • 9RILOFF E, WIEBE J, WILSON T. Learning subjective nouns using extraction pattern bootstrapping [ C]// Proceedings of the 7th Conference on Natural Language Learning at HLT-NAACL 2003 ( CONLL). Morristown, N J: Association for Computational Linguistics, 2003, 4:25-32.
  • 10PONTE J M, CROFT W B. A language modeling approach to information retrieval [ C] // Proceedings of the 21 st Annual International ACM SIGIR Conference on Research and Development in Information Retrieval. New York: ACM Press, 1998:275 -281.

二级参考文献20

  • 1Christopher D Manning, Hinrich Schutze. Foundations of Statistical Natural Language Processing[M] .nassachusetts:MTT Press, 1999.
  • 2Dekang Lin. Extracting Collocations from Text Corpora[ D ]. Canada:Department of Computer Science University of Manitoba, 1998.
  • 3R Garside, G Leech, T McEnery. Corpus Annotation:Linguistic Information from Computer Text Corpora [ C].London:Longman,1997.
  • 4Deepak Ravichandran, Eduard Hovy. Learning Surface Text Patterns for a Question Answering[A]. In:Proceeding of the ACL2002 Conference[C]. Philadelphia, PA, July, 2002.
  • 5Dekang Lin, Patrick Pantel. Discovery of Inference Rules for Question Answering[J]. In: Natural Language Engineering, volume 7, 343-360.
  • 6Hui Yang, Tat-Seng Chua. The Integration of Lexical Knowledge and EXternal Resources for Question Answering[A]. In: the Eleventh Text REtrieval Conference[C]. Maryland: USA, 2002. 155-161.
  • 7M.M. Soubbotin, S.M. Soubbotin. Use of Patterns for Detection of Likely Answer Strings: A Systematic Approach[A]. In: the Eleventh Text Retrieval Conference [ C ]. Gaithersburg, Maryland: November 2002.
  • 8Moldovan, D., Harabagio, S., Girju, R., et al. LCC Tools for Question Answering[A]. NIST Special Publication: SP 500-251 The Eleventh Text Retrieval Conference[C].
  • 9Yongping Du, Xuanjing Huang, Xin Li, Lide Wu. A Novel Pattern Learning Method for Open Domain Question Answering[A]. In: the Proceedings of IJCNLP2004[C]. Sanya: China.
  • 10Susan Dumais, Michele Banko, Eric Brill, Jimmy Lina nd Andrew Ng. Web Question Answering: Is More Always Better? [A] In: the Proceeding of 25th Annual International ACM SIGIR Conference on Research and Development in Information Retrieval[C]. Tampere,Finland, 2002.

共引文献14

同被引文献9

引证文献2

二级引证文献3

相关作者

内容加载中请稍等...

相关机构

内容加载中请稍等...

相关主题

内容加载中请稍等...

浏览历史

内容加载中请稍等...
;
使用帮助 返回顶部