期刊文献+

大数据研究综述 被引量:344

Overview of Big Data Research
下载PDF
导出
摘要 2010年,全球数据量跨入了ZB时代,据IDC预测,至2020年全球将拥有35ZB的数据量,大量数据实时地影响我们工作、生活,甚至国家经济、社会发展,大数据时代已经到来。大数据具有数据量巨大、数据类型多样、流动速度快和价值密度低的特点,大数据技术为我们分析问题和解决问题提供了新的思路和方法,其研究渐渐成为热点。阐述了大数据的相关概念、特点、大数据技术特别是在数据挖掘方面国内外发展状况以及我们在大数据时代面临的挑战。通过综述,对大数据有一个全面的认识,为下一步研究打下基础。 In 2010,the quantity of data reached ZB level.According to IDC,there will be at least 35 zettabytes of stored data in 2020.Massive data are affecting our life,even the economy and the development of the society.The Big Data era has already come.There are four defining characteristics of Big Data: volume,variety,velocity and value.It is often referred to them as 'the 4Vs'.The Big Data technology will offer new ideas and methods,which is becoming popular.Introductions to Big Data and Big Data technology with particular emphasis on Data Mining were given.There will be a comprehensive understanding of Big Data and lay a foundation for further study.
出处 《系统仿真学报》 CAS CSCD 北大核心 2013年第S1期142-146,共5页 Journal of System Simulation
基金 国家自然科学基金(61174156 61174035 61273189)
关键词 大数据 大数据技术 数据挖掘 挑战 big data big data technology data mining challenge
  • 相关文献

参考文献3

二级参考文献42

  • 1[OL].<http://hadoop.apache.org.>.
  • 2WinterCorp: 2005 TopTen Program Summary. http:// www. wintercorp, com/WhitePapers/WC TopTenWP. pdf.
  • 3TDWI Checklist Report: Big Data Analytics. http://tdwi. org/research/2010/08/Big-Data-Analytics, aspx.
  • 4Chaudhuri S, Dayal U. An overview of data warehousing and OLAP technology. SIGMOD Rec, 1997,26(1): 65-74.
  • 5Madden S, DeWitt D J, Stonebraker M. Database parallelism choices greatly impact scalability. DatabaseColumn Blog. http://www, databasecolumn, com/2007/10/database-parallelism-choices, html.
  • 6Dean J, Ghemawat S. MapReduce: Simplified data processing on large clusters//Proceedings of the 6th Symposium on Operating System Design and Implementation (OSDI ' 04). San Francisco, California, USA, 2004: 137-150.
  • 7DeWitt D J, Gerber R H, Graefe G, Heytens M L, Kumar K B, Muralikrishna M. GAMMA--A high performance dataflow database machine//Proceedings of the 12th International Conference on Very Large Data Bases (VLDB' 86). Kyoto, Japan, 1986:228-237.
  • 8Fushimi S, Kitsuregawa M, Tanaka H. An overview of the system software of a parallel relational database machine// Proceedings of the 12th International Conference on Very Large DataBases(VLDB'86). Kyoto, Japan, 1986:209-219.
  • 9Brewer E A. Towards robust distributed systems//Proceedings of the 19th Annual ACM Symposium on Principles of Distributed Computing (PODC' 00). Portland, Oregon, USA, 2000:7.
  • 10http: //www. dbms2, com/2008/08/26/known-applications of mapreduce/.

共引文献668

同被引文献2738

引证文献344

二级引证文献2752

相关作者

内容加载中请稍等...

相关机构

内容加载中请稍等...

相关主题

内容加载中请稍等...

浏览历史

内容加载中请稍等...
;
使用帮助 返回顶部