期刊文献+

开放知识资源的元数据自动采集策略研究 被引量:11

A Research on Metadata's Auto Collecting Strategy of Open Knowledge Resources
原文传递
导出
摘要 在对开放知识资源的内容和特点进行调研分析的基础上,研究开放知识资源的采集需求。并以专家遴选出的种子数据源为实证,总结分析不同数据源的特点,最终研究形成三种元数据自动采集策略:基于OAI标准元数据收割协议的策略、基于抽取普通动态网页的策略、基于解析RSS源接口的策略。 The paper is aimed at studying collecting needs of open knowledge resources based on investigating its content and features. Then, the authors experiment with the seed data sources selected by experts so as to summarize and analyze the characteristics of different data sources. Finally, three kinds of metadata's collecting strategy are put forward as follows: the strategy based on OAI standard metadata harvesting protocol, the strategy based on extracting common dynamic web pages, the strategy based on parsing RSS Feeds interface.
出处 《图书馆学研究》 CSSCI 北大核心 2013年第12期47-51,共5页 Research on Library Science
基金 中国科学院文献情报能力专项项目"开放知识资源登记系统(二期)"的研究成果之一
关键词 开放知识资源 元数据 采集策略 网页抽取 OAI协议 RSS源 open knowledge resources metadata collecting strategy web pages extraction OAI protocol RSS feeds
  • 相关文献

参考文献14

  • 1综合科技资源集成登记系统(IRSR)[EB/OL].[2013-03-20].http://irsr.Ilas.ac.cn.
  • 2OpenDOAR [ EB/OL]. [2013 -02 -01 ]. http: //opendoar. org/.
  • 3ROAR [ EB/OL]. [2013 -02-01 ]. http: //roar. eprints, org/.
  • 4OpenAIRE [ EB/OL]. [2013 -02 -01 ]. http: //www. openaire, eu/.
  • 5The World Bank OKR [ EB/OL]. [2013 -02 -05]. https: //openknowledge. worldbank, org/.
  • 6中国科学院机构知识库网格(CASIRGRID)[EB/OL].[2013-03-20].http://www.irgrid.ac.cn/.
  • 7IESR [ EB/OL] . [2013 -02 -01 ]. http: //iesr. ac. uk/.
  • 8NARCIS [ EB/OL]. [2013 -02-20]. http: //www. narcis, nl/? Language = en.
  • 9SciELO [ EB/OL]. [2013 -02 -20]. http: //www. scielo, org/php/index, php? lang = en.
  • 10刘兰,吴振新,张智雄,徐麒.Web Archive的采集策略研究[J].现代图书情报技术,2009(1):10-15. 被引量:26

二级参考文献30

  • 1宛玲,张晓林.数字资源长期保存过程中的知识产权问题分析[J].中国图书馆学报,2005,31(3):65-69. 被引量:57
  • 2孟涛,闫宏飞,王继民.Web网页信息变化的时间局部性规律及其验证[J].情报学报,2005,24(4):398-406. 被引量:8
  • 3Kelly B. Approaches to the Preservation of Web Sites[ EB/OL]. [ 2008 -06 - 11 ]. http ://www. ukoln. ac. uk/.
  • 4Online Australian Publications: Selection Guidelines for Archiving and Preservation by the National Library of Australia [ EB/OL ]. [2008 -06 - 11 ]. http://pandora.nla. gov. au/selectionguidelines. html.
  • 5Michael Day. Collecting and Preserving the World Wide Web: A Feasibility Study Undertaken for the JISC and Wellcome Trust [ J/OL]. [2008 -06 - 11]. http://www. jisc. ac. uk/uploaded_documents/archiving_feasibility. pdf.
  • 6The Internet Archive Web Archive [ EB/OL]. [ 2008 -06 - 11 ]. http ://wa. archive, org/aroundtheworld/index. new. html.
  • 7WebArchiv-Archive of the Czech Web [ EB/OL ]. [ 2008 - 06 - 11 ]. http ://en. webarchiv. cz/thematic_collections.
  • 8MINERVA [ EB/OL]. [ 2008 - 06 - 11 ]. http ://www.loc. gov/MINERVA/presentations. html.
  • 9The Australian Web Domain Harvests: A Preliminary Quantitative Analysis of the Archive Data [ J/OL ]. [ 2008 - 05 - 16 ]. http :// pandora. nla. gov. au/documents/auscrawls. pdf.
  • 10Junghoo Cho, Alexandros Ntoulas. Effective Change Detection Using Sampling [ C ]. Proceedings of 28th /ntemationa/ Confer- ence on Very Large Database, Hongkong, China: Morgan Kauf- mann, August 2002.

共引文献27

同被引文献146

引证文献11

二级引证文献62

相关作者

内容加载中请稍等...

相关机构

内容加载中请稍等...

相关主题

内容加载中请稍等...

浏览历史

内容加载中请稍等...
;
使用帮助 返回顶部