期刊文献+

Bayesian optimization algorithm-based methods searching for risk/protective factors

Bayesian optimization algorithm-based methods searching for risk/protective factors
原文传递
导出
摘要 The risks of developing complex diseases are likely to be determined by single nucleotide polymorphisms (SNPs), which are the most common form of DNA variations. Rapidly developing genotyping technologies have made it possible to assess the influence of SNPs on a particular disease. The aim of this paper is to identify the risk/protective factors of a disease, which are modeled as a subset of SNPs (with specified alleles) with the maximum odds ratio. On the basis of risk/protective factor and the relationship between nucleotides and amino acids, two novel risk/protective factors (called k-relaxed risk/protective factors and weighted-relaxed risk/protective factors) are proposed to consider more complex disease-associated SNPs. However, the enormous amount of possible SNPs interactions presents a mathematical and computational challenge. In this paper, we use the Bayesian Optimization Algorithm (BOA) to search for the risk/protective factors of a particular disease. Determining the Bayesian network (BN) structure is NP-hard; therefore, the binary particle swarm optimization was used to determine the BN structure. The proposed algorithm was tested on four datasets. Experimental results showed that the algorithm proposed in this paper is a promising method for discovering SNPs interactions that cause/prevent diseases. The risks of developing complex diseases are likely to be determined by single nucleotide polymorphisms (SNPs), which are the most common form of DNA variations. Rapidly developing genotyping technologies have made it possible to assess the influence of SNPs on a particular disease. The aim of this paper is to identify the risk/protective factors of a disease, which are modeled as a subset of SNPs (with specified alleles) with the maximum odds ratio. On the basis of risk/protective factor and the relationship between nucleotides and amino acids, two novel risk/protective factors (called k-relaxed risk/protective factors and weighted-relaxed risk/protective factors) are proposed to consider more complex disease-associated SNPs. However, the enor- mous amount of possible SNPs interactions presents a mathematical and computational challenge. In this paper, we use the Bayesian Optimization Algorithm (BOA) to search for the risk/protective factors of a particular disease. Determining the Bayesi- an network (BN) structure is NP-hard; therefore, the binary particle swarm optimization was used to determine the BN structure. The proposed algorithm was tested on four datasets. Experimental results showed that the algorithm proposed in this paper is a promising method for discovering SNPs interactions that cause/prevent diseases.
出处 《Chinese Science Bulletin》 SCIE EI CAS 2013年第23期2828-2835,共8页
基金 supported by the National Natural Science Foundation of China(60774086 and 61173111) Ph.D.Program Foundation of Ministry of Education of China(20090201110027)
关键词 贝叶斯算法 保护因子 风险 单核苷酸多态性 基础 贝叶斯优化算法 NP-HARD 粒子群优化算法 risk/protective factors single nucleotide polymorphisms Bayesian optimization algorithm binary particle swarm optimi-zation
  • 相关文献

参考文献1

二级参考文献24

  • 1Shutao Li,Xixian Wu,Mingkui Tan.Gene selection using hybrid particle swarm optimization and genetic algorithm[J]. Soft Computing . 2008 (11)
  • 2Jochen Hampe,Stefan Schreiber,Michael Krawczak.Entropy-based SNP selection for genetic association studies[J]. Human Genetics . 2003 (1)
  • 3Steven L. Salzberg.On Comparing Classifiers: Pitfalls to Avoid and a Recommended Approach[J]. Data Mining and Knowledge Discovery . 1997 (3)
  • 4Kraft P,Hunter D J.Genetic risk prediction--are we there yet. The New England Journal of Medicine . 2009
  • 5Moffatt M F,Kabesch M,Liang L M,et al.Genetic variants regulating ORMDL3 expression contribute to the risk of childhood asthma. Nature . 2007
  • 6Wan X,Yang C,Yang Q,et al.MegaSNP Hunter: A learning approach to detect disease predisposition SNPs and high level interactions in genome wide association study. BMC Bioinformatics . 2009
  • 7Xie Q,Ratnasinghe L D,Hong H,et al.Decision forest analysis of 61 single nucleotide polymorphisms in a case-control study of esophageal cancer; a novel method. BMC Bioinformatics . 2005
  • 8Dumitru B,Alexander Z.Design and validation of methods searching for risk factor in genotype case-control studies. Journal of Computational Biology . 2008
  • 9Uhmn S,Kim D H,Ko Y W,et al.A study on application of single nucleotide polymorphism and machine learning techniques to diagnosis of chronic hepatitis. Expert Systems . 2009
  • 10Wang T H,Wang H S.A genome-wide association study primer for clinicians. Taiwan J Obstet Gynecol . 2009

共引文献5

相关作者

内容加载中请稍等...

相关机构

内容加载中请稍等...

相关主题

内容加载中请稍等...

浏览历史

内容加载中请稍等...
;
使用帮助 返回顶部