摘要
目的提出一种在功能模块化水平上组织大量基因表达谱关联规则的新方法。方法以高频出现在关联规则后件中的基因为基础分类关联规则,再选择显著聚集前件基因的GO基因功能类作为功能模块关联规则的前件,获得功能模块表达关联规则。结果根据NC I60数据,发现了一些与癌症相关的基因及相应的基因功能模块表达关联规则。结论利用功能模块表达关联规则可以有效地挖掘疾病基因表达谱。
Objective To apply apriori algorithm to mine the gene expression profiles will produce a huge number of association rules. A new approach to organize the mined association rules on the modularized gene functional category level is proposed. Methods The association rules containing the genes frequently appearing in the head parts of all the association rules were selected and grouped together. The gene functional categories, which were non-randomly enriched with the genes in the body parts of each grouped association rules, were found with statistical test and were used to form the body parts of the functional modular level association rules. Results By applying this new method to NCI60 data set, some cancer associated genes and their related functional category level expression association rules were discovered. Conclusion Gene expression profiles of diseases can be efficiently mined by association rules on the modularized gene functional category level.
出处
《第三军医大学学报》
CAS
CSCD
北大核心
2006年第13期1370-1373,共4页
Journal of Third Military Medical University
基金
国家自然科学基金资助项目(30370798
30170515
30370388)
国家高技术研究发展计划("863")资助项目(2003AA2Z20512002AA2Z2052)
黑龙江科技攻关资助重点项目(GB03C602-4)
哈尔滨市科技攻关资助项目(2003AA3CS113)
黑龙江自然科学基金资助项目(F0177)~~