期刊文献+

互联网环境下的英文同义术语自动发现研究与系统实现 被引量:4

Study on Automatic English Synonym Terms Discovery from Web and the System Implementation
原文传递
导出
摘要 以英文同义术语为例,提出三种有效的自动获取互联网术语资源的技术手段,包括语法模式的自学习,在线同义词典的抽取,静态同义术语分类的爬取。在此基础上,设计并实现互联网同义术语检索原型系统(WebSynonym Searcher)。实验测试表明,从互联网中自动获取同义术语是一种非常有前景的途径。 There are extremely abundant synonym term resources in the Web. Three effective approaches have been proposed in this paper, which are syntactical pattern learning, online synonym dictionary extraction, and static synonym category crawling. On this basis, a prototype system, Web Synonym Term Searcher, has been implemented. The experimental results show it is a promising way to automatically obtain synonym terms from the Web.
出处 《图书情报工作》 CSSCI 北大核心 2012年第22期26-31,共6页 Library and Information Service
基金 国家社会科学基金资助项目“基于知识组织的术语服务研究”(项目编号:11CTQ018) 国家高技术研究发展计划(863计划)“以科技文献服务为主的搜索引擎研制”(项目编号:2011AA01A206)子课题“资源整合及知识组织技术研究”研究成果之一
关键词 同义术语 互联网 语法模式 在线词典 系统实现 synonym term Web syntactical pattern online dictionary system implementation
  • 相关文献

参考文献14

  • 1Pantel P, Lin Dekang. Discovering word senses from text [ C ]// Proceedings of SIGKDD Conference on Knowledge Discovery and Data Mining. Edmonton: ACM Press, 2002 : 613 - 619.
  • 2陈建超,郑启伦,李庆阳,严桂夺.基于特征词关联性的同义词集挖掘算法[J].计算机应用研究,2009,26(7):2517-2519. 被引量:10
  • 3van der Plas L, Tiedemann J. Finding synonyms using automatic word alignment and measures of distributional similarity[ C ]//Pro- ceedings of 44th Annual Meeting of the Association for Computa- tional Linguistics. Sydney: Association for Computer Linguistics Press. 2006 : 866 - 873.
  • 4张书娟,董喜双,关毅.基于电子商务用户行为的同义词识别[J].中文信息学报,2012,26(3):79-85. 被引量:2
  • 5Tao Cheng, Lauw H W, Paparizos S. Entity synonyms for struc- tured Web search[J]. IEEE Transactions on Knowledge and Data Engineering, 2012,24(10) : 1862 - 1875.
  • 6陆勇,侯汉清.基于模式匹配的汉语同义词自动识别[J].情报学报,2006,25(6):720-724. 被引量:20
  • 7吴云芳,石静,金澎.基于图的同义词集自动获取方法[J].计算机研究与发展,2011,48(4):610-616. 被引量:13
  • 8Masato H, Yasuhiro O, Katsuhiko T. Supervised synonym acquisi- tion using distributional features and syntactic patterns [ J ]. Infor- mation and Media Technologies, 2009, 4(2) : 558 -582.
  • 9Kaji N, Kitsuregawa M. Using hidden markov random fields to combine distributional and pattern - based word clustering [ C ]// Proceedings of the 22nd International Conference on Computational Linguistics. Stroudsburg: Association for Computational Linguistics Press, 2008:401 -408.
  • 10Snow R, Jurafsky D, Ng A. Learning syntactic patterns for auto- matic 'hypemym discovery [ C ]//Proceedings of 17th International Conference on Neural Information Processing Systems. Vancouver: MIT Press, 2004 : 1297 - 1304.

二级参考文献51

  • 1陆勇,侯汉清.用于信息检索的同义词自动识别及其进展[J].南京农业大学学报(社会科学版),2004,4(3):87-93. 被引量:25
  • 2刘华梅,侯汉清.基于情报检索的汉语同义词识别初探[J].情报理论与实践,2005,28(4):373-375. 被引量:11
  • 3刘磊,曹存根,王海涛,陈威.一种基于“是一个”模式的下位概念获取方法[J].计算机科学,2006,33(9):146-151. 被引量:18
  • 4宋明亮.汉语词汇字面相似性原理与后控制词表动态维护研究[J].情报学报,1996,15(4):261-271. 被引量:19
  • 5FELLBARUM C. WordNet: an electronic lexical database[ M]. Massachusetts: MIT Press, 1998.
  • 6PANTERL P,LIN D. Discovering word senses from text[ C]//Proc of the 8th ACM SIGKDD Conference on Knowledge Discovery and Data Mining. New York: ACM Press, 2002:613- 619.
  • 7OLIVIER F. Discovering word senses from a network of lexical cooccurences[ C ]//Proc of the 20th International Conference on Computational Linguistics. 2004 : 1326- 1332.
  • 8WU Jie, LUO Bei, CAO Cun-gen, et al. Acquisition and verification of mereological knowledge from Web page texts[ J]. Journal of East China University of Science and Technology: Natural Science Edition, 2006,32( 11) :1310- 1317.
  • 9HAN Jia-wei, KAMBER M. Data mining: concepts and techniques [ M ]. [ S. l. ] :Morgan Kaufmann Publishers, 2001.
  • 10AGRAWAL R, SRIKANT R. Fast algorithms for mining association rules[ C ]//Proc of the 20th International Conference on Very Large Databases. Santiago: [ s. n. ] , 1994:487- 499.

共引文献77

同被引文献39

  • 1林正军,杨忠.一词多义现象的历时和认知解析[J].外语教学与研究,2005,37(5):362-367. 被引量:123
  • 2陆勇,侯汉清.基于模式匹配的汉语同义词自动识别[J].情报学报,2006,25(6):720-724. 被引量:20
  • 3化柏林.知识抽取中的停用词处理技术[J].现代图书情报技术,2007(8):48-51. 被引量:39
  • 4Oliveira H G, Gomes P. Automatic Discovery of Fuzzy Synsets from Dictionary Definitions [ C] //Proceedings of the Twenty -Second international joint conference on Artificial Intelligence -Volume Volume Three. AAAI Press, 2011 : 1801-1806.
  • 5Jacob Perkins. Python Text Processing with NLTK 2. 0 Cook-bookfM]. Birmingham: Packt Publishing,2010:25-30.
  • 6Lexical Tools[EB/OL]. [2014-02-23]. http://lexsrv3. nlm, nih. gov/LexSysGroup/Projects/lvg/current/web/index, html.
  • 7Lexical Programs[ EB/OL]. [2014-02-23 ]. http://lexsrv3. nlm. nih. gov/LexSysGroup/Projects/lvg/current/docs/user-Doc/references/greenbook. 4.8. html.
  • 8Lexical Tools[EB/OL]. [2014-02-23]. http://lexsrv3.nlm. nih. gov/LexSysGroup/Projects/lvg/2014/docs/userDoc/tools/ norm. html.
  • 9王世清,吴雯娜,常春.叙词表编制中等同关系获取方法 [C]//戴维民,赵建华,汪东波,贺德方.网络环境下信息组 织的创新与发展:全国第五次情报检索语言发展方向研讨会 论文集.北京:国家图书馆出版社,2009: 114-119.
  • 10同义关系抽取结果评测[EB/OL]l[2014-12-29].http://tcci.ccf.org.cn/conference/2012/dldoc/2012语义关系评测结果.pdf.

引证文献4

二级引证文献6

相关作者

内容加载中请稍等...

相关机构

内容加载中请稍等...

相关主题

内容加载中请稍等...

浏览历史

内容加载中请稍等...
;
使用帮助 返回顶部