期刊文献+

农作物信息抽取系统的设计与实现 被引量:5

Design and Realization of the System of Farm Crop Information Extraction
下载PDF
导出
摘要 研究了特定领域的文本的信息抽取,主要考虑了文本分布的观点。首先从未标注的语料中学习主题和主题间的关系,然后把它应用在同领域的文本信息抽取。经测试,其信息抽取的效果有所提高。 This paper studies information extraction of special domain and mainly considers the view of text distribution. First, it studies topic and relation of topic from un-annotated corpus, then applies it in text information extraction of the same domain. The experiment indicates that the result of the method improves a lot than previous ones to some extent.
出处 《计算机工程》 CAS CSCD 北大核心 2006年第7期197-198,220,共3页 Computer Engineering
基金 国家"863"计划基金资助项目(2001AA4031) 国家自然科学基金资助项目(60473139) 山西省自然科学基金资助项目(20051034)
关键词 主题 信息抽取 聚类 K近邻 Topic Information extraction Clustering K-means
  • 相关文献

参考文献4

  • 1Barzilay R,Lee L.Catching the Drift:Probabilistic Content Models,with Applications to Generation and Summarization[C].HLT-NAACL 2004:Proceedings of the Main Conference,2004:113-120.
  • 2张宇,刘挺.基于改进贝叶斯模型的问题分类[C].第一届全国信息检索与内容安全学术会议,2004.
  • 3孙即祥.现代模式识别[M].长沙:国防科技大学出版社,2003..
  • 4郑家恒,王兴义,李飞.信息抽取模式自动生成方法的研究[J].中文信息学报,2004,18(1):48-54. 被引量:22

二级参考文献5

  • 1[1]Ellen Riloff. Automatically Constructing a Dictionary for Information Extraction Tasks[C]. In: Proceedings of the Eleventh National Conference on Artificial Intelligence, 811-816. AAAI Press/ The MIT Press, 1993.
  • 2[2]Stephen Soderland, David Fisher, Jonathan Aseltine, and Wendy Lehnert. CRYSTAL: Inducing a conceptual dictionary[C]. In: Proceedings of the Fourteenth International Joint Conference on Artificial Intelligence, 1314-1319, 1995.
  • 3[3]Ellen Riloff. Automatically Generating Extraction Patterns from Untagged Text[C]. In: Proceedings of Thirteenth National Conference on Artificial Intelligence (AAAI-96), 1044-1049. 1996.
  • 4[4]Ellen Riloff, Rosie Jones. Learning Dictionaries for Information Extraction by Multi-Level Bootstrapping[C]. In: Proceedings of the Sixteenth National Conference on Artificial Intelligence (AAAI-99), Orlando FL. 1999.
  • 5[5]Roman Yangarber, Ralph Grishman, Pasi Tapanainen and Silja Huttunen. Unsupervised Discovery of Scenario-Level Patterns for Information Extraction[C]. In: Proceedings of Sixth Applied Natural Language Processing Conference (ANLP-2000), 282-289, Seattle WA. 2000.

共引文献28

同被引文献55

  • 1罗贝,吴洁,曹存根,邵志清.从文本中获取植物知识方法的研究[J].计算机科学,2005,32(10):6-13. 被引量:13
  • 2向阳,王敏,马强.基于Jena的本体构建方法研究[J].计算机工程,2007,33(14):59-61. 被引量:33
  • 3中国植物志编辑委员会.中国植物志[M].北京:科学出版社,1959.
  • 4Taylor A. Extracting Knowledge from Biological Descriptions[ C ]. In: Proceedings of the 2nd International Conference on Building and Sharing Very Large - Scale Knowledge Bases. 1995 : 114 - 119.
  • 5Vanel J M. Worldwide Botanical Knowledge Base [ EB/OL]. [2011 - 10 - 11]. http://wwbota, free. fr/.
  • 6Wood M M, Lydon S J, Tablan V, et al. Using Parallel Texts to Improve Recall in IE [ C ]. In: Proceedings of lnternational Confer- ence on Recent Advances in Natural Language Processing (RAN- LP). Amsterdam : John Benjamins, 2004:70 - 77.
  • 7沙丽华.面向领域文档的语义标注方法研究[D].长春:吉林大学,2009.
  • 8Sautter G, Bohm K, Agosti D. A Combining Approach to Find all Taxon Names [ J ]. Biodiversity lnformatics, 2006 ( 3 ) :46 - 58.
  • 9Tang X Y, Heidorn P B. Using Automatically Extracted Information in Species Page Retrieval[ EB/OL ]. [ 2011 - 08 - 10 ]. ht- tp ://www. tdwg. org/proceedings/article/view/195/.
  • 10Soderland S. Learning Information Extraction Rules for Semi - Structured and Free Text[ J]. Machine Learning, 1999, 34 (1 - 3 ) : 233 - 272.

引证文献5

二级引证文献9

相关作者

内容加载中请稍等...

相关机构

内容加载中请稍等...

相关主题

内容加载中请稍等...

浏览历史

内容加载中请稍等...
;
使用帮助 返回顶部