期刊文献+

群体基因组学方法:从经典统计学到有监督学习 被引量:3

Population genomics:From classical statistics to supervised learning
原文传递
导出
摘要 群体遗传学的一个主要研究目标是理解突变、自然选择、遗传漂变、群体结构和数量变化等进化力量如何共同影响基因组中的遗传变异.通过分析DNA序列多态数据,可以推测曾经作用于基因组的各种力量,进而探讨生物演化的过程.近年来,随着第二代DNA测序技术的快速革新,群体遗传学进入了基因组学时代,相关的方法在不断发展,并可将群体基因组学方法分为经典统计学方法和新兴的机器学习方法.前者包括经典群体遗传学统计量、单一统计量或多统计量联合检测自然选择、群体历史与自然选择的联合估计以及基于溯祖树和祖先重组图的方法.后者主要基于有监督学习,为群体基因组时代的大数据分析带来了全新范式.本文从理论基础出发,全面回顾了群体基因组学方法发展变化的历程,着重介绍了该领域的最新进展,并就未来的发展方向进行了展望. It is essential to understand how the patterns of genetic variation in organisms have been shaped by different evolutionary forces,such as mutation,natural selection,genetic drift,population structure,and population size change.In recent years,with the rapid innovation of next-generation sequencing technology,we are facing the new era of population genomics.The relevant population genomics methods can be classified as classical statistics and supervised learning.The classical statistics methods include many popular ones for detecting natural selection and inferring the parameters of demography,which are based on single or multiple combined statistics.The supervised learning methods may promise a new paradigm to make sense of large datasets in the genomic era.Here a brief introduction was first given on the important theory in population genomics.Then we overviewed the recent research progress in population genomics and shared our perspectives on its future development.
作者 施怿 李海鹏 SHI Yi;LI Hai Peng(Key Laboratory of Computational Biology,CAS-MPG Partner Institute for Computational Biology,Shanghai Institute of Nutrition and Health,Shanghai Institutes for Biological Sciences,Chinese Academy of Sciences,Shanghai 200031,China;Center for Excellence in Animal Evolution and Genetics,Chinese Academy of Sciences,Kunming 650223,China;University of Chinese Academy of Sciences,Beijing 100049,China)
出处 《中国科学:生命科学》 CSCD 北大核心 2019年第4期445-455,共11页 Scientia Sinica(Vitae)
基金 中国科学院战略性先导科技专项(批准号:XDB13040800) 国家自然科学基金(批准号:91531306 91731304)资助
关键词 群体基因组学 自然选择 重组率 经典统计学 有监督学习 population genomics natural selection recombination rate classical statistics supervised learning
  • 相关文献

参考文献6

二级参考文献166

  • 1Kimura M.Evolutionary rate at the molecular level.Nature,1968,217(5129):624-626.
  • 2Sabeti PC,Schaffner SF,Fry B,Lohmueller J,Varilly P,Shamovsky O,Palma A,Mikkelsen TS,Altshuler D,Lander ES.Positive natural selection in the human lineage.Science,2006,312(5780):1614-1620.
  • 3Biswas S,Akey JM.Genomic insights into positive selection.Trends Genet,2006,22(8):437-446.
  • 4Li WH,Wu CI,Luo CC.A new method for estimating synonymous and nonsynonymous rates of nucleotide substitution considering the relative likelihood of nucleotide and codon changes.Mol Biol Evol,1985,2(2):150-174.
  • 5McDonald JH,Kreitman M.Adaptive protein evolution at the Adh locus in Drosophila.Nature,1991,351(6328):652-654.
  • 6Sabeti PC,Reich DE,Higgins JM,Levine HZ,Richter D J,Schaffner SF,Gabriel SB,Platko JV,Patterson N J,McDonald GJ,Ackerman HC,Campbell S J,Altshuler D,Cooper R,Kwiatkowski D,Ward R,Lander ES.Detecting recent positive selection in the human genome from haplotype structure.Nature,2002,419(6909):832-837.
  • 7Tajima E The effect of change in population size on DNA polymorphism.Genetics,1989,123(3):597-601.
  • 8Fu YX.Statistical tests of neutrality of mutations against population growth,hitchhiking and background selection.Genetics,1997,147(2):915-925.
  • 9Fay JC,Wu CI.Hitchhiking under positive Darwinian selection.Genetics,2000,155(3):1405-1413.
  • 10Otto SP.Detecting the form of selection from DNA sequence data.Trends Genet,2000,16(12):526-529.

共引文献51

同被引文献32

引证文献3

二级引证文献2

相关作者

内容加载中请稍等...

相关机构

内容加载中请稍等...

相关主题

内容加载中请稍等...

浏览历史

内容加载中请稍等...
;
使用帮助 返回顶部