期刊文献+

专利无效对比文件判定方法研究 被引量:1

Research on the Method of Judging Reference Document in Patent Invalidation Using GBDT
原文传递
导出
摘要 [目的/意义]对比文件是用以判断专利能否授权或无效的重要文件,针对传统信息检索方法的不足且鲜有利用机器学习方法研究对比文件检索的问题,在引入对比文件信息的基础上,构建专利相关性判定模型。[方法/过程]以专利无效判决书中的目标专利与对比文件为数据集进行实验,提取文本相似度、共现词汇和共词数量特征信息,利用GBDT模型将对比文件的检索问题转化为判断其是否相关的分类问题。[结果/结论]研究结果表明,不同字段数据对分类效果的贡献不同,其中说明书字段的准确率、召回率和F1值分别为79%、48%和59%,并且多特征集成后的分类效果显著优于单一文本相似度的结果,最后对实验错分情况进行分析,指出本研究下一步的研究方向。 [Purpose/significance]Comparative documents are important for judging whether a patent can be granted or invalid.Aiming at the shortcomings of traditional information retrieval methods and rarely using machine learning methods to study the issue of comparative document retrieval,based on the introduction of comparative file information,this paper constructs a patent relevance determination model.[Method/process]Experiments were performed by using the target patents and comparative documents in the patent invalidation judgment as the data set to extract text similarity,co-occurrence vocabulary,and co-word quantity feature information.The GBDT model was used to convert the retrieval of comparative documents into classification issues that determined whether they were rel­evant.[Result/conclusion]The research results show that the contribution of different field data to the classification effect is different,in which the FI of the description text reaches 59%,and the classification effect after multi-fea­ture integration is significantly better than the result of single text similarity.Finally,this paper analyzes the experi­mental misclassifications and points out the next research directions.
作者 郭诗琪 贠强 陈亮 周杰 Guo Shiqi;Yun Qiang;Chen Liang;Zhou Jie(Institute of Medical Information/Medical Lirary CAMS&PUMC,Beijing 100020;Institute of Scientific and Technical Information of China,Beijing 100038)
出处 《图书情报工作》 CSSCI 北大核心 2021年第2期117-125,共9页 Library and Information Service
基金 国家重点研发计划项目课题“知识产权信息智能采集及深加工技术研究与应用示范”(项目编号:2017YFB1401902) 中信所重点工作“重点科技领域前沿跟踪与深度研究”(项目编号:ZD2020-02)研究成果之一。
关键词 专利无效宣告 对比文件 特征选择 机器学习 patent invalidity the prior art feature selection machine learning
  • 相关文献

参考文献10

二级参考文献55

  • 1郭炜强,戴天,文贵华.基于领域知识的专利自动分类[J].计算机工程,2005,31(23):52-54. 被引量:17
  • 2李程雄,丁月华,文贵华.SVM-KNN组合改进算法在专利文本分类中的应用[J].计算机工程与应用,2006,42(20):193-195. 被引量:23
  • 3赖院根,朱东华,刘玉琴.专利法律状态信息分析的理论研究及其实证[J].情报杂志,2007,26(8):56-59. 被引量:34
  • 4中华人民共和国国家知识产权局.专利审查指南2010[M].北京:知识产权出版社,2010:122-128.
  • 5刘群 李素建.基于《知网》的词汇语义相似度计算[C]..第三界汉语词汇语义研讨会[C].台北,2002..
  • 6中华人民共和国国家知识产权局.审查指南[K].北京:知识产权出版社,2006.
  • 7KANDO N, LEONG M K. Workshop on patent retrieval SIGIR 2000 workshop report [ J ]. ACM SIGIR Forum Archives, 2000,34 ( 1 ) : 28-30.
  • 8ACL 2003. Proceedings of ACL 2003 workshop on patent corpus processing [ EB/OL ]. ( 2003 ) [ 2007- 03- 05 ]. http://www. slis. tsukuba. ac. jp/- fujii/acl2003ws. html.
  • 9IWAYAMA M,FUJII A,KANDO N. Overview of patent retrieval task at NTCIR-3 [ C ]//Proc of the 3rd NTCIR Workshop on Research in Information Retrieval, Automatic Text Summarization and Question Answering. Tokyo: [ s. n. ] ,2003:21-24.
  • 10FUJII A,IWAYAMA M,KANDO N. Overview of patent retrieval task at NTCIR-4[ C]//Proc of the 4th NTCIR Workshop on Research in Information Access Technologies, Information Retrieval, Question Answering and Summarization. Tokyo: [ s. n. ] ,2004:225-232.

共引文献85

同被引文献8

引证文献1

相关作者

内容加载中请稍等...

相关机构

内容加载中请稍等...

相关主题

内容加载中请稍等...

浏览历史

内容加载中请稍等...
;
使用帮助 返回顶部