期刊文献+

等位基因多态性群体遗传结构的多元非线性分析方法 被引量:8

Multiple Nonlinear Statistical Method of Population Genetic Structure Based on the Allelic Polymorphism Data
下载PDF
导出
摘要 长期以来 ,对于多维基因多态性数据的多元统计分析 ,如计算遗传距离时所用的聚类分析、分析群体遗传结构时所用的主成分分析、因子分析和典型相关分析等 ,一直应用为无约束条件数据而设计的经典多元线性分析方法 ,并没有注意基因多态性数据的“闭合效应”所带来的问题。从分析基因多态性数据的分布和结构特征入手 ,文中指出了基因多态性分布具有“闭合数据”的特点 ,分析了由于“闭合效应”的影响 ,经典多元线性方法用于群体遗传结构分析所面临的困难。根据成分数据统计分析的理论和方法 ,提出了基因多态性群体遗传结构的多元非线性分析基本方法。并以主成分分析为例 ,通过实例比较和分析了经典线性主成分分析和“对数比”非线性主成分分析的结果 ,证明“对数比”非线性主成分分析方法是研究基因多态性群体遗传结构的良好方法 ,具有特异、灵敏等优点 。 The distribution and structure of the allelic polymorphism data are analyzed and it is pointed out that the distribution of allelic polymorphism data reveals the characteristic of closed data (also named as compositional data or data of constant sum).It is interpreted that the correlation structure of the allelic polymorphism data contains null correlations introduced by 'closure' and the statistical distribution of the data is not normal because of its constant row sum,which resulted in great difficulties in analyzing the data with traditional multiple linear statistical methods such as principal component analysis,factor analysis,cluster analysis and canonical correlation analysis.Based on the theory of compositional data analysis proposed by Aitchison in 1982,a multiple nonlinear statistical method originating from the 'logratios' approach to the statistical analysis of compositional data is put forward in this paper.As an example,the 'logratios' method was used to analyze the genetic structure of TH01 polymorphic loci in Chinese population and the results were compared with those of multiple linear methods such as component principal.It is concluded that the 'logratios' multiple nonlinear principle component analysis is a better method with the virtue of sensitivity and specificity for analyzing the genetic structure of population from the data of allelic polymorphism.
出处 《Acta Genetica Sinica》 SCIE CAS CSCD 北大核心 2004年第2期202-211,共10页
基金 国家自然科学基金资助项目 (No .30 1 70 52 7)~~
关键词 基因多态性 群体遗传结构 多元非线性分析 allelic polymorphism genetic structure of populations multiple nonlinear statistical method
  • 相关文献

参考文献9

  • 1[9]J Aitchison.The statistical analysis of compositional data.Chapman and Hall,1986.
  • 2[10]Butler J C.The effects of closure on the moments of a distribution.J Math Geol,1979a,11(1):75~84.
  • 3[11]Butler J C.Effects of closure on the measures of similarity between samples.J Math Geol,1979b,11(4):75~84.
  • 4[12]Chayes F,Trochimczyk J.An effect of closure on the structure of principal components.J Math Geol,1978,10(4):323~333.
  • 5[16]Aitchison J.The statistical analysis of compositional data (with discussion).J Roy Stat Soc,Ser B,1982,44:140~177.
  • 6[17]Aitchison J.Logratios and natural in compositional data analysis.Mathematical Geology,1999,31(5):563~580.
  • 7[18]Aitchison J.Logratio transformation of compositional data a resolution of the constant sum con20.straint.Marine Micropaleontology,1998,34:117~120.
  • 8[19]Aitchison J.Logratio Analysis and Compositional Distance.Mathematical Geology,2000,32(3):271~275.
  • 9[20]Reyment R. Multidimensional palaeobiology. Pergamon Press, Ox-ford,1991.

同被引文献103

引证文献8

二级引证文献11

相关作者

内容加载中请稍等...

相关机构

内容加载中请稍等...

相关主题

内容加载中请稍等...

浏览历史

内容加载中请稍等...
;
使用帮助 返回顶部