期刊文献+

并行化的Apriori算法在海量医疗文档数据挖掘中的应用及优化 被引量:7

Optimization and application of Apriori algorithm based on MapReduce in medical big data
下载PDF
导出
摘要 针对海量医疗文档数据中巨大潜在价值难以有效挖掘的现状,构建了基于NoSQL和MapReduce的存储与挖掘系统MSPM.通过以键值对形式存储,使复杂异构的医疗文档数据归结为统一的且适于被经典Apriori算法利用的事务数据格式,并通过挖掘MapReduce过程化,一次性全局扫描和兴趣集规约计数等优化策略,有效解决了Apriori算法在医疗大数据应用中开销大、执行速度慢和有效性差的问题. To solve the problem that values hidden in big medical data cannot be properly mined, an MSPM system based on NoSQL and MapReduce is proposed. By key-value storage, complex and heterogeneous data are summed up in a unified and convenient format of transaction for Apriori. With MapReduce, complete global scanning and interest set counting solved the problem of low speed, high overhead and poor effectiveness of Apriori algorithm in its application to medical data mining.
出处 《北京师范大学学报(自然科学版)》 CAS CSCD 北大核心 2016年第4期420-424,共5页 Journal of Beijing Normal University(Natural Science)
基金 国家发改委高技术服务业基金资助项目(2014648)
关键词 医疗文档大数据 非关系型数据库 MAPREDUCE 数据挖掘 APRIORI 算法优化 medical big data NoSQL MapReduce data mining Apriori optimization
  • 相关文献

参考文献13

  • 1Murdoch T B, Detsky A S. The inevitable application of big data to health care[J]. JAMA, 2013, 309(13):1351.
  • 2Miller R H, Sim I. Physicians~ use of electronic medical records~ barriers and solutions [J]. Health Affairs, 2004, 23(2):116.
  • 3Gyorodi C, Gyorodi R, Pecherle G, et al. A comparative study~ MongoDB vs. MySQL [ C] /// The 13th International Conference on Engineering of Modern Electric Systems, Oradea: IEEE, 2015:1-6.
  • 4Wu X D, Zhu X Q, Wu G Q, et al. Data mining with big data[J]. IEEE Transactions on Knowledge and Data Engineering, 2014, 26(1).. 97.
  • 5LinC H, Huang L C, Chou S C T, et al. Temporal event tracing on big healthcare data analyties[-C~//IEEE International Conference on Cloud Computing. [S. 1. ]: IEEE, 2014:281-287.
  • 6段季芳,梁雪芳,别荣芳,林定移.基于免疫算法的频繁项集挖掘[J].北京师范大学学报(自然科学版),2009,45(2):161-163. 被引量:1
  • 7Li C. Apriori algorithm optimization study based on MapReduce [C] /// 2015 International Conference on Automation, Mechanical Control and Computational Engineering. [S. 1. ] : Atlantis Press, 2015:261.
  • 8Dhanya S, Vysaakan M, Mahesh A S. An enhancement of the MapReduce Apriori algorithm using vertical data layout and set theory concept of intersection[M] /// Intelligent systems technologies and applications. [S. 1.] : Springer International Publishing, 2016:225-233.
  • 9林长方,吴扬扬,黄仲开,曾少俊.基于MapReduce的Apriori算法并行化[J].江南大学学报(自然科学版),2014,13(4):411-415. 被引量:13
  • 10程波,孙锁柱,吴西钊,王新允.AP-1及其相关基因uPA、uPAR在肺癌中的表达及其意义[J].北京师范大学学报(自然科学版),2010,46(4):492-496. 被引量:2

二级参考文献37

  • 1彭银香,何小东,朱志勇.基于免疫算法的多维关联规则挖掘方法[J].微计算机信息,2007,23(3):171-173. 被引量:4
  • 2刘芳,孙杨军.基于多克隆选择的多维关联规则挖掘算法[J].复旦学报(自然科学版),2004,43(5):742-745. 被引量:9
  • 3王评,陈国龙.一种基于人工免疫的新的频繁项挖掘算法[J].计算机科学,2005,32(8):155-157. 被引量:1
  • 4胡春峰,张迎春,刘慧,李绍东,荣玉涛,汪秀玲,徐凯.小细胞肺癌的CT征象和血管生成(VEGF、MVD)相关性研究[J].实用放射学杂志,2007,23(3):318-320. 被引量:15
  • 5de Castro L N, von Zuben F J. The clonal selection algorithm with engineering applications[C] // Proceedings of GECCO ' 00, Workshop on Artificial Immune Systems and Their Applications. London, UK: Springer, 2000 : 36
  • 6Dao-I Lin, Kedem Z M. Pincer-search: an efficient algorithm for discovering the maximum frequent set[J]. Knowledge and Data Engineering, IEEE Transactions:2002,14 (3) : 553
  • 7莫宏伟.人工免疫原理[M].哈尔滨:哈尔滨工业大学出版社,2003.
  • 8Dominique A G, Schlegel W. Mechanisms of transcriptional regulation underlying temporal intergration of signals[M]. Oxford: Oxford University Press, 2006: 5175-5188.
  • 9Rupp B, Lorenz U, Schmidt J, et al. Discordant effects of activator protein-1 transcription factor on gene regulation, invasion, and metastasis in spontaneous, radiation-induced, and los-induced osteqsarcomas [J]. Mol Carcinog, 1998, 23(2):69.
  • 10Vaiopoulos A G, Papachroni K K, Papavassiliou A G. Colon carcinogenesis: Learning from NF-kappaB and AP- 1[J]. Int J Biochem Cell Biol,2010, 42(7) :1061.

共引文献31

同被引文献58

引证文献7

二级引证文献23

相关作者

内容加载中请稍等...

相关机构

内容加载中请稍等...

相关主题

内容加载中请稍等...

浏览历史

内容加载中请稍等...
;
使用帮助 返回顶部