期刊文献+

基于通用词与术语部件的专利术语抽取 被引量:14

Patent Term Extraction Based on Generic Words and Term Components
下载PDF
导出
摘要 针对目前专利术语抽取中不能有效地过滤一些高频非术语词串和无法正确抽取低频术语的问题,本文提出基于通用词与术语部件的专利术语抽取方法。该方法首先使用通用词作为切分符选取候选术语;再利用与候选术语有相同术语部件的相似候选术语信息,评估候选术语成为术语的可能性。实验结果表明,与传统的方法相比,提出的方法能够有效地提高专利术语抽取的准确度。 Aiming at the problems that some high-frequency non-term strings cannot be effectively filtered and that low-frequency terms cannot be correctly extracted in patent term extraction, this paper proposes a patent term extraction method based on generic words and term components. The proposed method first takes advantage of generic words to select candidate terms. Then, candidate terms with the same term component as the target candidate term are used to evaluate the target candidate term. Experimental results show that the proposed method can effectively improve the accuracy of patent term extraction, when compared with the traditional methods.
作者 俞琰 赵乃瑄 Yu Yan;Zhao Naixuan(Information Service Department,Nanjing Tech University,Nanjing 210009;Computer Science Department,Southeast University Chengxian College,Nanjing 211816)
出处 《情报学报》 CSSCI CSCD 北大核心 2018年第7期742-752,共11页 Journal of the China Society for Scientific and Technical Information
基金 国家社会科学基金一般规划项目"大数据时代支持创新设计的多维度多层次专利文本挖掘研究"(17BTQ059)
关键词 专利文献分析 术语抽取 通用词 术语部件 patent literature analysis term extraction general word term component
  • 相关文献

参考文献30

二级参考文献301

共引文献275

同被引文献219

引证文献14

二级引证文献88

相关作者

内容加载中请稍等...

相关机构

内容加载中请稍等...

相关主题

内容加载中请稍等...

浏览历史

内容加载中请稍等...
;
使用帮助 返回顶部