期刊文献+

基于NoSQL数据库的大数据查询技术的研究与应用 被引量:28

Research and Application of Large Data Query Technology Based on NoSQL Database
下载PDF
导出
摘要 基于NoSQL数据库理论,根据应用场景的不同,将NoSQL数据库分为面向高性能读写、面向文档和面向分布式计算的3种类型。对比分析这3种类型数据库的6种代表产品的优缺点,结合铁路客票实名制售票信息综合分析系统中的大数据操作的需求,选用NoSQL数据库中的面向分布式计算的Cassandra数据库。基于Cassandra数据库,提出铁路客票实名制信息综合分析系统的技术架构,并设计反向索引以构建客票实名制乘车信息的查询策略和查询流程。通过性能测试,验证了NoSQL数据库技术在处理大数据查询和分析中的高可用性,可突破传统关系型数据库和数据仓库在应用中所遇到的查询性能、扩展性以及投资成本的瓶颈。 Based NoSQL database theory and different application scenarios,NoSQL database can be divided into three types for high-performance read and write,for documents and for distributed computing.According to the comparative analyses of the advantages and disadvantages of six representative products for these three types of databases,and combining with the demands for large data manipulation in the integrated railway real-name ticketing information analysis system,Cassandra database is chosen as NoSQL database for distributed computing.The technical architecture of integrated railway real-name ticketing information analysis system is proposed based on Cassandra database,and inverted indices are designed to build the query strategies and query processes of travel information for ticket real-name system.The high availability of NoSQL database technology in handling and analyzing large data queries has been verified through performance tests.The bottlenecks of query performance,scalability and investment cost of traditional relational database and data warehouse in applications can be broken through.
出处 《中国铁道科学》 EI CAS CSCD 北大核心 2014年第1期135-141,共7页 China Railway Science
基金 中国铁道科学研究院行业服务技术创新项目(1151DZ1003)
关键词 NOSQL数据库 Cassandra数据库 大数据处理 反向索引 数据查询 NoSQL database Cassandra database Large data processing Inverted index Data query
  • 相关文献

参考文献10

  • 1陈勇.地铁自动售检票系统[J].铁道通信信号,2002,38(3):17-19. 被引量:12
  • 2GILBERT S, LYNCH N. Brewer's Conjecture and the Feasibility of Consistent, Available, Partition-Tolerant Web Services [J]. Sigact Newsletter, 2002, 33 (2): 51- 59.
  • 3VOGELS W. Eventually Consistent [J].. Queue, 2008, 6 (6) : 14-19.
  • 4FAY Chang, JEFFREY Dean, SANJAY Ghemawat, et al. A Distributed Storage System for Structured Data [C] //In Proceedings of the 7^th USENIX Symposium on Operating Systems Design and Implementation (OSDI'06). USA: USENIX Association, 2006: 205-218.
  • 5姜龙翔,王鑫,李旭,冯志勇.一种大规模RDF语义数据的分布式存储方案[J].计算机应用与软件,2011,28(11):30-32. 被引量:6
  • 6DAVID Karger, ERIC Lehman, TOM Leighton, et al. Consistent Hashing and Random Trees: Distributed Caching Protocols for Relieving Hot Spots on the World Wide Web [C] //Proceedings of the Twenty-Ninth Annual ACM Symposium on Theory of Computing. New York: ACM, 1997: 654-663.
  • 7Wikipedia: NoSQL [EB/OL]. [2010-06-20]. http: //en. Wikipedia. Org/Wiki/NoSQL.
  • 8NoSQL Database [EB/OL]. [2010-06-20], http: //NoSQLNoSQL. Database. Org/.
  • 9Apache Cassandra [EB/OL]. [2010-06-20]. http: //Cassandra. Apache. Org/.
  • 10LAKSHMAN A, MALIK P. Cassandra: Structured Storage System on a P2P Network [C] //Proceedings of the 28th ACM Symposium on Principles of Distributed Computing. Canada: ACM, 2009: 5-5.

二级参考文献10

  • 1Klyne G, Carroll J J, McBride B. Resource description framework (RDF): concepts and abstract syntax [ S ]. W3C recommendation. World Wide Web Consortium(W3C), 2004.
  • 2Berners-Lee T, Hendler J, Lassila O. The Semantic Web - A new form of Web content that is meaningful to computers will unleash a revolution of new possibilities [ J ]. Science American, 2001.
  • 3Linked Data[EB/OL]. 2011 - 05 - 15. http://linkeddata. org/.
  • 4Fay C, Jeffrey D, Sanjay G. Bigtable: A Distributed Storage System for Structured Data[C]//Proc. of the 7th OSDI, 2006.
  • 5Sanjay G, Howard G, Shun-Tak L. The Google File System[ C]// Proc. of the 19th ACM SOSP,2003:29 -43.
  • 6Hyunsik C, Jihoon S, YongHyun C. SPIDER : A System for Scalable, Parallel/Distributed Evaluation of Large-scale RDF Data [ C ]//Proc. of 18th CIKM, 2009:2057 -2088.
  • 7Giuseppe D, Deniz H, Madan J. Dynamo: Amazon' s Highly Available Key-value Store[ C]//Proc. of 21st SOSP, 2007:205 -220.
  • 8Min C, Martin F. RDFPeers: A Scalable Distributed RDF Repository Based on A Structured PeertoPeer Network[ C ]. 2004.
  • 9Apache Cassandra[EB/OL]. 2011-05-16. http ://cassandra. apache. org/ .
  • 10Oren E, Kotoulas S, Anadiotis G, et al. MARVIN: A platform for large-scale analysis of Semantic Web data [ C ]//Proceeding of the WebSci'09 : Society On-Line, 2009.

共引文献16

同被引文献157

引证文献28

二级引证文献128

相关作者

内容加载中请稍等...

相关机构

内容加载中请稍等...

相关主题

内容加载中请稍等...

浏览历史

内容加载中请稍等...
;
使用帮助 返回顶部