摘要
【目的】通过考虑专利权利要求特征,提高专利关键词抽取准确性。【方法】挖掘出专利权利要求中技术特征间的限定关系,将限定关系融入基于图的专利关键词抽取方法中,以抽取专利关键词。【结果】在USPTO专利数据集和Baiten专利数据集上进行实验,实验结果表明所提方法的MRR指标较传统的TextRank方法分别相对提升了31.79%(USPTO)和33.81%(Baiten)。【局限】实验分析的数据需要进一步扩大。【结论】融入专利权利要求的限定关系信息能够显著提高专利关键词抽取的准确性。
[Objective]This paper tries to improve the accuracy of patent keyword extraction with the characteristics of patent claims.[Methods]We examined the restriction relationship between technical features of patent claims.Then,we integrated these relationship into the patent keyword extraction method based on graph.[Results]We examined our model with the USPTO and Baiten data sets for patents.The MRR index of our method was 31.79%(USPTO)and 33.81%(Baiten)higher than the traditional Text Rank method.[Limitations]The data of our experimental analysis need to be further expanded.[Conclusions]The proposed method could significantly improve the accuracy of patent keyword extraction.
作者
俞琰
朱晟忱
Yu Yan;Zhu Shengchen(Institute of the Information Management and Technology,Nanjing Tech University,Nanjing 210009,China)
出处
《数据分析与知识发现》
CSSCI
CSCD
北大核心
2022年第10期57-67,共11页
Data Analysis and Knowledge Discovery
基金
国家社会科学基金项目(项目编号:17BTQ059)的研究成果之一。
关键词
抽取
限定关系
权利要求
TextRank
Patent Keyword Extraction
Restriction Relationship
Claim
Text Rank