期刊文献+

中文文本的信息自动抽取和相似检索机制 被引量:3

Mechanism of Automatic Extraction and Similar Retrieval for Chinese Texts
下载PDF
导出
摘要 目前信息抽取成为提供高质量信息服务的重要手段,提出面向中文文本信息的自动抽取和相似检索机制,其基本思想是将用户兴趣表示为语义模板,对关键字进行概念扩充,通过搜索引擎获得初步的候选文本集合,在概念触发机制和部分分析技术基础上,利用语义关系到模板槽的映射机制,填充文本语义模板,形成结构化文本数据库.基于文本数据表述的模糊性,给出用户查询与文本语义模板的相似关系,实现了相似检索,可以更加全面地满足用户的信息需求. The mechanism of information extraction and similar retrieval for Chinese texts is presented in this paper. Users' information interests are represented as semantic Template. The relevant texts are obtained by search engine under conceptual expansion of keywords. Based on conceptual trigger and sentences parser,the text semantic templates are filled in term of the mapping rules between semantic relationship and slots ,so the textual database is built. Considering the fuzzy information from natural language texts, the similarity measure between user's queries and text semantic templates are put forward. Moreover, the digital feature of text can be expanded by fuzzy mathematics and calculated about similarity. It is shows that the mechanism of extraction and retrieval can improve the efficiency of users' query and meet the more and more information demands.
出处 《小型微型计算机系统》 CSCD 北大核心 2007年第11期2074-2079,共6页 Journal of Chinese Computer Systems
基金 国家自然科学基金项目(6037309560673039)资助.
关键词 信息抽取语义模板概念扩充模糊语义 information extraction semantic templates conceptual expansion fuzzy semantic
  • 相关文献

参考文献5

二级参考文献34

  • 1[1]Nicholas Kushmerick. Wrapper induction: Efficiency and expressiveness. Artifical Intelligence 118 (2000): 15~68
  • 2[2]Ling Liu, Calton Pu, Wei Han. An XML-enabled data extraction toolkit for web sources. Information Systems 26 (2001): 563~583
  • 3[3]Armaud Sahuguet, Fabien Azavant. Building intelligent Web applications using lightweight wrappers. Data & knowledge Engineering 36 (2001): 283~286
  • 4[16]Hobbs J,Appelt D,Bear J et al.FASTUS:A Cascaded Finite-State Transducer for Extracting Information from Natural-Language Text[C].In:Roche,Schabes eds. Finite State Devices for Natural Language Processing, MIT Press,Cambridge MA, 1996
  • 5[17]Appelt D E.Introduction to Information Extraction[J].AI COMMUNICATIONS, 1999; 12(3)
  • 6[18]Yangarber R.Scenario Customization for Information Extraction[D].Ph D Thesis.New York University,2001-01
  • 7[19]Cowie J, Lehnert W.Information Extraction[J].Communications of the ACM, 1996;39(1)
  • 8[20]Grishman R Adaptive information extraction and sublangu age analysis[C].In:Proceedings of IJCAI-2001 Workshop on Adaptive Text Extraction and Mining,2001
  • 9[1]Applet D E,Israel D J.Introduction to Information Extraction Technology. A Tutorial for IJCAI-99,1999
  • 10[2]Gaizauskas R,Wilks Y.Information Extraction:Beyond Document Retrieval[J].Journal of Documentation, 1997

共引文献232

同被引文献63

引证文献3

二级引证文献14

相关作者

内容加载中请稍等...

相关机构

内容加载中请稍等...

相关主题

内容加载中请稍等...

浏览历史

内容加载中请稍等...
;
使用帮助 返回顶部