期刊文献+

基于统计的词语相关度网络自动构建方法研究 被引量:2

Research on Automatic Building of Word Correlation Net Based on Statistic
下载PDF
导出
摘要 词语语义知识库对于扩大自然语言理解的深度具有重要的意义。目前较为成熟的WordNet、HowNet、同义词词林等均为人工开发,对知识的描述较为准确,但开发的工作量巨大,实际应用存在很多困难。为了更加自动化、实证性地获取中文词语相互关联状况的知识,该文提出词语相关度的概念以及基于统计的词语相关度计算方法,并以此为基础构建一个基于强领域特性中文词语的词语相关度网络,设计数组分割的硬盘存储方法,使该任务涉及到的海量数据的分析处理可以在目前的个人PC上完成。最终获得的词语语义知识具备经验主义方法的优点,准确性、泛化性较强,可以在文本分类、检索、过滤等领域发挥重要作用。 Semantic knowledge-base has important meaning for increasing the deepness of NLU.Some comparatively mature Semantic knowledge-base such as WordNet,HowNet and Tongyicicilin was developed by manpower,and has many difficulties on actual application.In order to capture Chinese word knowledge of relating status more automatically and demonstrably,this paper presented the concept of word correlation and a calculation method of word correlation based on statistic.Then a correlation net based on Chinese words which have strong domain characteristic was built.In order to resolve the difficulty of processing the huge amount of data,a hard disk storing method of array segmentation was designed.The semantic knowledge gained by the experiment had the advantage of empiricism.It is veracity and generalization is strong so it can play an important role in many fields such as text categorization,text retrieval,text filtering,etc.
出处 《计算机与数字工程》 2012年第2期15-18,86,共5页 Computer & Digital Engineering
基金 海军工程大学自然科学基金引导项目(编号:HGDYDJJ10008)资助
关键词 词语相关度 词语相关度网络 语义词典 word correlation word correlation net semantic knowledge-base
  • 相关文献

参考文献3

二级参考文献33

共引文献94

同被引文献28

  • 1董振东,董强.知网和汉语研究[J].当代语言学,2001,3(1):33-44. 被引量:56
  • 2金博,史彦军,滕弘飞.基于语义理解的文本相似度算法[J].大连理工大学学报,2005,45(2):291-297. 被引量:79
  • 3朱嫣岚,闵锦,周雅倩,黄萱菁,吴立德.基于HowNet的词汇语义倾向计算[J].中文信息学报,2006,20(1):14-20. 被引量:325
  • 4章志凌,虞立群,陈奕秋,罗海飞,邵晓敏.基于Corpus库的词语相似度计算方法[J].计算机应用,2006,26(3):638-640. 被引量:17
  • 5董振东,董强.知网[DB/OL].[2009-03-15].http://www.keenage.com.
  • 6周学广,任延珍,孙艳,等.信息内容安全[M].武汉:武汉大学出版社,2012.
  • 7Fellbaum C. WordNet : An Electronic Lexical Data- base [ M]. Cambridge & MIT Press, 1998.
  • 8Richardson S D, Dolan W B, Vanderwende L. Mind- Net: Acquiring and structuring semantic information from text[DB/OL]. [2013-04-11]. http://acl, ldc. upenn, edu/P/P98/P98-2180, pd f.
  • 9Baker C F, Fillmore C J, Lowe J B. The Berkeley FrameNet Project[DB/OL]. [2013404-11]. http:acl. ldc. upenn, edu/C/C98/C98-1013, pd f.
  • 10王惠,詹卫东,俞士汶.现代汉语语义词典规格说明书[J/OL].[2013-04-02].http://ccl.pku.edu.cn/doubtfir8/papers/2003_semdict_specification_wang-huizwd.pdf.

引证文献2

二级引证文献4

相关作者

内容加载中请稍等...

相关机构

内容加载中请稍等...

相关主题

内容加载中请稍等...

浏览历史

内容加载中请稍等...
;
使用帮助 返回顶部