期刊文献+

人类群体遗传结构的图论主成分分析方法 被引量:2

The Methodology of Graph Theory Principle Components Analysis on Human Population Genetic Structure
下载PDF
导出
摘要 目的提出基因频率矩阵的图论主成分分析方法,探讨该法在人类群体遗传结构研究中的应用。方法从分析基因频率矩阵的结构特征入手,将主成分分析与图论中的最小生成树有机结合,构建人类群体遗传结构的图论主成分模型,并以中国26个汉族人群HLA-A基因座遗传结构分析为例,验证图论主成分分析的科学性和适用性。结果图论主成分分析的基本步骤可概括为:①对中心化基因频率的协方差矩阵进行主成分分析;②按图论原理求过m维空间n个点的最小生成树;③利用求“颈”法分割最小生成树;④将最小生成树整合到二维主成分散点图中构建图论主成分分类图。根据此步骤,对中国26个汉族人群HLA-A基因座遗传结构进行了图论主成分分析,分析结果符合中华民族源与流的客观规律。结论图论主成分分类图既可显示各群体的遗传结构特性,又可利用最小生成树的链接关系揭示各群体间的内在联系;图论主成分分析是分析人类群体遗传结构的一种较好方法。 Objective This paper presents the methodology of graph theory principle components analysis, and explores its applicability for studying human population genetic structure. Methods Based on the structure of gene frequency matrix, we combine the method of principle components analysis with the minimal spanning tree of graph theory, to set up the model of graph theory principle components analysis. As an example, the genetic structure of HLA-A locus in 26 Hun populations is analyzed by using graph theory principle components analysis, to show its rationality for studying human population genetic structure. Results The step by step of graph theory principle components analysis can be summarized as follows: ① Carry out principle components analysis to the centred gene frequency covariance matrix; ②Calculate the minimal spanning tree, which passes the points in dimensions space; ③Decompose the minimal spanning tree into several parts using the method of necks found; ④Set up the graph theory principal components classification graph in the principal components classification scallergram. According to these steps, the genetic structure of HLA- A locus in 26 Han populations is analyzed, which indicates that the results accord with the population genetic mechanism of Chinese Han population. Conclusion The graph theory principal components classification scallergram can show not only the genetic structure of populations, but also the intrinsic relationship among the populations. The graph theory principle components analysis is ideality method for studying the human population genetic structure.
出处 《中国卫生统计》 CSCD 北大核心 2006年第1期19-23,共5页 Chinese Journal of Health Statistics
基金 国家自然科学基金资助项目(30170527)
关键词 人类群体遗传结构 图论主成分分析 HLA-A Human population genetic structure, Graph thcory principle components analysis, HLA-A
  • 相关文献

参考文献15

二级参考文献26

  • 1[9]J Aitchison.The statistical analysis of compositional data.Chapman and Hall,1986.
  • 2[10]Butler J C.The effects of closure on the moments of a distribution.J Math Geol,1979a,11(1):75~84.
  • 3[11]Butler J C.Effects of closure on the measures of similarity between samples.J Math Geol,1979b,11(4):75~84.
  • 4[12]Chayes F,Trochimczyk J.An effect of closure on the structure of principal components.J Math Geol,1978,10(4):323~333.
  • 5[16]Aitchison J.The statistical analysis of compositional data (with discussion).J Roy Stat Soc,Ser B,1982,44:140~177.
  • 6[17]Aitchison J.Logratios and natural in compositional data analysis.Mathematical Geology,1999,31(5):563~580.
  • 7[18]Aitchison J.Logratio transformation of compositional data a resolution of the constant sum con20.straint.Marine Micropaleontology,1998,34:117~120.
  • 8[19]Aitchison J.Logratio Analysis and Compositional Distance.Mathematical Geology,2000,32(3):271~275.
  • 9[20]Reyment R. Multidimensional palaeobiology. Pergamon Press, Ox-ford,1991.
  • 10赵桐茂,人类学学报,1987年,6卷,1期,1页

共引文献139

同被引文献27

引证文献2

二级引证文献5

相关作者

内容加载中请稍等...

相关机构

内容加载中请稍等...

相关主题

内容加载中请稍等...

浏览历史

内容加载中请稍等...
;
使用帮助 返回顶部