期刊文献+

基于HBase/Spark的教学大数据存储及索引模型研究 被引量:1

A study of the design of a big-data storage and index model for teaching based on HBase/Spark
下载PDF
导出
摘要 为解决海量教学数据的高效处理和管理,提出基于HBase/Spark的教学大数据存储及索引模型设计.先基于组合行键构建HBase表,根据教学Course分类进行预分区,并构建cost评分函数,来检测并迁移负载,达到解决数据存储时写热点和负载均衡的问题.语义解析、组合行键索引查询、Spark并行属性条件过滤实现教学大数据高效查询.实验证明基于HBase/Spark的教学大数据的模型,能够实现更加高效的教学大数据访存管理. In order to solve the efficient processing and management problems of massive teaching data,this paper proposes the design of a big-data storage and index model for teaching based on HBase/Spark.First,HBase table is constructed based on the combined row keys,pre-divided according to the teaching course-classification,and a cost-scoring function is constructed to detect and migrate the load so as to solve the problems of writing hot spots and load balancing in data storage.Semantic parsing,the combined row key index query and the Spark parallel attribute condition filter help realize the efficient query of big data for teaching.The experiment proves that this model can realize more efficient retrieval management of the big data for teaching.
作者 唐立 李亚平 曲金帅 TANG Li;LI Ya-ping;QU Jinshuai(Department of Information Engineering,Anbui College of Economie Management,Hefei 230031,China;Teaching Affairs Office,Anhui Collge of Economic Management,Hefei 230031,China;School of Management,Hefei University of Technology,Hefei 230009,China;University of Key Laboratory of Information and Communication on Security Backup and Recovery in Yunnan Province,Yunnan Minzu University,Kunming 650500,China)
出处 《云南民族大学学报(自然科学版)》 CAS 2020年第5期486-492,507,共8页 Journal of Yunnan Minzu University:Natural Sciences Edition
基金 安徽省高校自然科学研究重点项目(KJ2019A0965) 安徽继续教育改革项目(2018jxjygg008) 云南省应用基础研究计划项目(2018FD055) 安徽省教育厅教学研究项目(2019jyxm0910) 安徽省高水平教学团队项目(2018jxtd044)。
关键词 HBASE SPARK 组合行键 HBase Spark combined row keys
  • 相关文献

参考文献11

二级参考文献73

共引文献92

同被引文献6

引证文献1

二级引证文献2

相关作者

内容加载中请稍等...

相关机构

内容加载中请稍等...

相关主题

内容加载中请稍等...

浏览历史

内容加载中请稍等...
;
使用帮助 返回顶部