期刊文献+

大数据的概念、特征及其应用 被引量:385

The Concept,Characteristics and Application of Big Data
下载PDF
导出
摘要 随着互联网的飞速发展,特别是近年来随着社交网络、物联网、云计算以及多种传感器的广泛应用,以数量庞大,种类众多,时效性强为特征的非结构化数据不断涌现,数据的重要性愈发凸显,传统的数据存储、分析技术难以实时处理大量的非结构化信息,大数据的概念应运而生。如何获取、聚集、分析大数据成为广泛关注的热点问题。介绍大数据的概念与特点,分别讨论大数据的典型的特征,分析大数据要解决的相关性分析、实时处理等核心问题,最后讨论大数据可能要面临的多种挑战。 With the rapid development of the Internet,especially the wide application ofsocial networking,the Internet of Things,cloud computingas well as a variety of sensorsin recent years,unstructured data,which have large numbers,varieties and also timeliness,continue to emerge.The importance of the data becomes more prominent.It is difficultto use the traditional data storage and analysis technology to handle large volumes of unstructured information in a real-time manner,andthat's how the concept of big data came into being.How to obtain,aggregate and analyzebig data becomesa hot issue.This paper introduces the concept and characteristicsof big data,analyzes the core issues,such as the correlation analysis,real-time processing,etc.,and finally discussesmany challengeslarge data may face.
作者 马建光 姜巍
出处 《国防科技》 2013年第2期10-17,共8页 National Defense Technology
关键词 大数据 非结构化信息 解决核心问题 未来挑战 big data unstructured information resolve of the core issues future challenges
  • 相关文献

参考文献16

二级参考文献309

  • 1[OL].<http://hadoop.apache.org.>.
  • 2WinterCorp: 2005 TopTen Program Summary. http:// www. wintercorp, com/WhitePapers/WC TopTenWP. pdf.
  • 3TDWI Checklist Report: Big Data Analytics. http://tdwi. org/research/2010/08/Big-Data-Analytics, aspx.
  • 4Chaudhuri S, Dayal U. An overview of data warehousing and OLAP technology. SIGMOD Rec, 1997,26(1): 65-74.
  • 5Madden S, DeWitt D J, Stonebraker M. Database parallelism choices greatly impact scalability. DatabaseColumn Blog. http://www, databasecolumn, com/2007/10/database-parallelism-choices, html.
  • 6Dean J, Ghemawat S. MapReduce: Simplified data processing on large clusters//Proceedings of the 6th Symposium on Operating System Design and Implementation (OSDI ' 04). San Francisco, California, USA, 2004: 137-150.
  • 7DeWitt D J, Gerber R H, Graefe G, Heytens M L, Kumar K B, Muralikrishna M. GAMMA--A high performance dataflow database machine//Proceedings of the 12th International Conference on Very Large Data Bases (VLDB' 86). Kyoto, Japan, 1986:228-237.
  • 8Fushimi S, Kitsuregawa M, Tanaka H. An overview of the system software of a parallel relational database machine// Proceedings of the 12th International Conference on Very Large DataBases(VLDB'86). Kyoto, Japan, 1986:209-219.
  • 9Brewer E A. Towards robust distributed systems//Proceedings of the 19th Annual ACM Symposium on Principles of Distributed Computing (PODC' 00). Portland, Oregon, USA, 2000:7.
  • 10http: //www. dbms2, com/2008/08/26/known-applications of mapreduce/.

共引文献4333

同被引文献2517

引证文献385

二级引证文献2561

相关作者

内容加载中请稍等...

相关机构

内容加载中请稍等...

相关主题

内容加载中请稍等...

浏览历史

内容加载中请稍等...
;
使用帮助 返回顶部