期刊文献+

网上数字档案大数据分析中的知识挖掘技术研究 被引量:36

下载PDF
导出
摘要 促进情报和档案信息交流、共享已成为当前开展深度信息服务的发展趋势,网上各类档案大数据信息正成为开发和利用的新型资源。大数据时代的到来,给传统的数据分析技术带来了极大的挑战,归纳、比较在大数据背景下知识挖掘技术的发展趋势和特点,探讨大数据知识挖掘处理中的数据集成、数据存储、数据分析、语义处理与可视化数据挖掘问题,研究深层次知识挖掘的方法和技术,结合档案网站知识服务功能研究,为开展网上数字档案大数据的分析挖掘提供了启示与参考。 Information exchange and sharing of information and archive has become the ongoing trends in depth information services. All kinds of online data of file are becoming the development and utilization of new resources. Big Data era,it has brought great challenges to the traditional data analysis techniques. It is very important to summarize, compare knowledge in the context of big data mining technology development trends and characteristics. The data mining problems of data processing with knowledge mining,data integration,data storage,data analysis,semantic processing and visualization. Combined with the knowledge to carry out the archives website services research,knowledge mining methods and techniques has been researched. It is to provide inspiration and reference to carry out online digital archive of the analysis of big data mining.
出处 《浙江档案》 北大核心 2013年第10期14-19,共6页 Zhejiang Archives
关键词 大数据 档案信息服务 知识挖掘 信息技术 Big Data Archival Information Services Knowledge Mining IT
  • 相关文献

参考文献17

二级参考文献314

  • 1[OL].<http://hadoop.apache.org.>.
  • 2WinterCorp: 2005 TopTen Program Summary. http:// www. wintercorp, com/WhitePapers/WC TopTenWP. pdf.
  • 3TDWI Checklist Report: Big Data Analytics. http://tdwi. org/research/2010/08/Big-Data-Analytics, aspx.
  • 4Chaudhuri S, Dayal U. An overview of data warehousing and OLAP technology. SIGMOD Rec, 1997,26(1): 65-74.
  • 5Madden S, DeWitt D J, Stonebraker M. Database parallelism choices greatly impact scalability. DatabaseColumn Blog. http://www, databasecolumn, com/2007/10/database-parallelism-choices, html.
  • 6Dean J, Ghemawat S. MapReduce: Simplified data processing on large clusters//Proceedings of the 6th Symposium on Operating System Design and Implementation (OSDI ' 04). San Francisco, California, USA, 2004: 137-150.
  • 7DeWitt D J, Gerber R H, Graefe G, Heytens M L, Kumar K B, Muralikrishna M. GAMMA--A high performance dataflow database machine//Proceedings of the 12th International Conference on Very Large Data Bases (VLDB' 86). Kyoto, Japan, 1986:228-237.
  • 8Fushimi S, Kitsuregawa M, Tanaka H. An overview of the system software of a parallel relational database machine// Proceedings of the 12th International Conference on Very Large DataBases(VLDB'86). Kyoto, Japan, 1986:209-219.
  • 9Brewer E A. Towards robust distributed systems//Proceedings of the 19th Annual ACM Symposium on Principles of Distributed Computing (PODC' 00). Portland, Oregon, USA, 2000:7.
  • 10http: //www. dbms2, com/2008/08/26/known-applications of mapreduce/.

共引文献3109

同被引文献251

引证文献36

二级引证文献159

相关作者

内容加载中请稍等...

相关机构

内容加载中请稍等...

相关主题

内容加载中请稍等...

浏览历史

内容加载中请稍等...
;
使用帮助 返回顶部