期刊文献+

分布式专利信息抽取系统设计与构建 被引量:2

The Design and Implementation of Distributed Patent Information Extraction System
原文传递
导出
摘要 作为重要专利信息源,德温特数据库可以为研究者提供丰富的资源,但其数据导出格式局限性较大且只包含摘要等信息,不利于进一步深入分析。设计并实现基于多Agent平台的分布式德温特专利信息抽取系统,将专利信息导入到本地数据库中;并针对USPTO库提供专利详细信息自动获取。该系统抽取效率较高,为专利研究提供较好的信息获取途径。 As a vital patent source, Derwent patent database offers rich patent resource. However, its output format is limited, and includes patent abstract only. This article designs a distributed Derwent patent extraction system based on multi -agent platform. With it, the patents information is imported into a local database, and the detail information in USPTO can also be acquired. The system is effective, and this study is contributed to make a good information acquisition method for patent research.
出处 《现代图书情报技术》 CSSCI 北大核心 2013年第7期114-121,共8页 New Technology of Library and Information Service
基金 国家科技支撑计划项目"面向企业创新应用链的知识管理体系建设与集成应用示范"(项目编号:2012BAH34F00) 国家社会科学基金重大项目"新兴技术未来分析理论方法与产业创新研究"(项目编号:11&ZD140) 北京市自然科学基金资助项目"中文专利侵权检测与分析理论方法及关键技术研究"(项目编号:9132005)的研究成果之一
关键词 专利信息抽取 负载均衡 多AGENT系统 德温特专利数据库 Patent extraction Load balance Multi -Agent system Derwent patent database
  • 相关文献

参考文献10

  • 1Derwent Innovations Index [ DB/OL j. [ 2013 - 04 - 02 J. http ://www. thomsonscientific, com. cn/productsservices/diimedia/.
  • 2USPTO Patent Full -Text and Image Database[ DB/OL]. [2013 - 04 - 02 ]. http ://patft. uspto, gov/netahtml/PTO/search - bool. html.
  • 3Bedi P, Chawla S. Agent Based Information Retrieval System Using Information Scent [ J ]. Journal of Artificial lntelligee, 2010,3 (4) :220 -238.
  • 4Pavlin G, de Oude P, Marls M, et al. A Multi - Agent Systems Approach to Distributed Bayesian Information Fusion [ J ]. Agent - Based Information Fusion,2010,11 (3) :267 - 282.
  • 5Jumadinova J, Dasgupta P. A Multi - Agent System for Analyzing the Effect of Information on Prediction Markets [ J ]. International Journal of Intelligent Systems, 2011,26 ( 5 ) : 383 - 409.
  • 6张俊,陈宏刚.基于多Agent的实时ETL系统模型研究[J].信息技术,2010,34(2):71-73. 被引量:5
  • 7翟东升,杨洋.基于XML技术的USPTO专利抽取系统[J].北京工业大学学报,2011,37(4):628-633. 被引量:1
  • 8Kunz T. The Influence of Different Workload Descriptions on a Heuristic Load Balancing Scheme [ J ]. IEEE Transactions on Soft- ware Engineering, 1991,17 ( 7 ) : 725 - 730.
  • 9Bahi J M. Dynamic Load Balancing and Efficient Load Estimators for Asynchronous Iterative Algorithms [ J ]. IEEE Traractions on Parallel and Distributed Systems,2005,16 (4) : 289 - 299.
  • 10王春娟.Web集群负载均衡算法的分析与研究[J].电脑知识与技术,2008,3(26):1623—1624.

二级参考文献18

  • 1欧健文,董守斌,蔡斌.模板化网页主题信息的提取方法[J].清华大学学报(自然科学版),2005,45(S1):1743-1747. 被引量:70
  • 2陈廷斌,吴伟.基于多Agent的供应链智能集成与决策研究[J].计算机应用研究,2004,21(8):27-29. 被引量:5
  • 3许力,马瑞新.基于SOA的实时ETL的研究与实现[J].计算机系统应用,2007,16(4):24-27. 被引量:5
  • 4ANTHONY POLITANO. Right-Time ETL and Integration[J]. Information Management Magazine, Jart/Feb2009.
  • 5LIU Ling, PU Calton, WEI Han. An XML-enabled data extraction toolkit for we sources[ J]. Information System, 2001, 26 (8) : 563-583.
  • 6NEIL Perlin. The X factor: from HTML to XHTML[ C ]//Proceedings of 2006 IEEE International Professional Communication Conference. [ S. 1. ] : Institute of Electrical and Electronics Engineers Inc, 2006: 190-192.
  • 7HU Yan, XUAN Yan-yan. Research on Web information extraction based on XML [ C ] //Proceedings of the Second International Conference on Genetic and Evolutionary Computing. [ S. 1. ] : Inst of Elec and Elec Eng Computer Society, 2008 : 201-204.
  • 8JUSSI Myllymaki. Effective Web data extraction with standard XML technologies [ J]. Computer Networks, 2002, 39 (5): 635 -644.
  • 9RAGHAVAN V V, WANG G S, BOLLMANN P. A critical investigation of recall and precision as measures of retrieval system performance [ J ]. Information Systems, 1989, 7 ( 3 ) : 205- 229.
  • 10高彬,谷建华,符宁,张海辉.基于ESB的实时ETL系统的设计与实现[J].计算机应用,2008,28(4):860-862. 被引量:5

共引文献4

同被引文献13

引证文献2

二级引证文献14

相关作者

内容加载中请稍等...

相关机构

内容加载中请稍等...

相关主题

内容加载中请稍等...

浏览历史

内容加载中请稍等...
;
使用帮助 返回顶部