摘要
通过研究基于Hadoop平台的map/reduce思想,针对关联规则算法Apriori算法提出其在分布式平台下的改进算法,利用分布式化的Apriori算法对居民体检中发现的乙肝患者疾病数据进行分析挖掘,主要建立乙肝阳性和其他健康指标间的关联规则。实验结果证明关联规则算法Apriori在医疗数据挖掘中的有效性和高效性。
By researching on the map/reduce theory based on Hadoop distributed system, for Apriori algorithm, which is a kind of association rules algorithm, puts forward the improved algorithm in a distributed platform. Uses distributed data Apriori algorithm to analyze data of disease patients with hepatitis B which is found in healthy examination. The purpose is to establish association rules between HBV-positive and other health indicators. The results prove that the association rule mining algorithms on medical data is effectiveness and efficiency.
基金
国家重大科技专项(No.2012ZX10004-901)
四川省科技支撑计划项目(No.2013SZ0002
No.2014SZ0109)