摘要
在对当前术语语义相似度计算进行分析研究的基础上,将科技术语相似度计算归纳为基于语料文集的相似度计算和基于开放知识资源的相似度计算,对相似度指标的集成算法进行综述。并对科技术语语义相似度计算在自然语言处理和知识挖掘方面的应用进行总结,对其未来研究发展进行展望,为进一步构建高效的术语相似度计算系统提供良好借鉴。
Based on the analysis of recent related literatures and projects, the paper concludes the term semantic measure methods as follows: similarity measure methods based on corpus characters and similarity measure methods based on open knowledge resources. And then it reviews the integration methods of multi - measure methods. It also summarizes the applications of term semantic similarity measure methods on the area of Natural Language Process (NLP) and Knowledge Mining (KM). Finally, the future development of research on term similarity measure is discussed to help build more efficient term similarity calculation system.
出处
《现代图书情报技术》
CSSCI
北大核心
2010年第7期51-57,共7页
New Technology of Library and Information Service
基金
教育部人文社会科学研究项目基金资助课题"从科技文献中挖掘术语相似性及其在知识发现中的应用"(项目编号:09YJC870031)的研究成果之一
关键词
术语语义相似度
相似度计算
语词相似度
Term semantic similarity Similarity measure Phrase similarity