期刊文献+

基因序列图形表达及聚类分析应用研究 被引量:4

Graphical representation of DNA sequences and the application research of clustering analysis
下载PDF
导出
摘要 在基因序列图形表达模型研究的基础上,提出了一种新的非退化的基因图形三维表示方法。该表达方法不仅避免了图形的重叠和交叉,同时还保留了序列的生物学特征。利用该表达方法对H5N1病毒基因序列进行数字特征的提取并引入基于多维PFS判别函数进行模糊聚类分析应用。在聚类分析过程中直接利用数字特征矩阵作为分析数据,分析结果表明:利用文中所给图形表达建立基因序列数字特征矩阵进行的聚类分析具有一定的合理性。 This article proposed a new 3D graphic representation with non-degeneration on the basis of studying gene sequences representation model. This representation can avoid the overlap or cross without losing biological information. This method was used to abstract character of I-ISN1 virus gene and propose a clustering analysis based on multi-dimension Pseudo F-Statistios (PFS) parameter. In the procedure of clustering, the numerical characterization matrix was used as analytic data source. The result shows that it is rational to make clustering analysis on the gene character abstract from the new graphic representation.
出处 《计算机应用》 CSCD 北大核心 2007年第9期2330-2333,共4页 journal of Computer Applications
基金 湖南省自然科学重点基金资助项目(06JJ4076) 湖南省财政厅项目基金资助项目([2005]90)
关键词 基因序列 图形表达 聚类分析 伪F统计量 基因数字特征 gene sequences graphic representation e.lustering analysis Pseudo F-Statistios gene numerical characterization
  • 相关文献

参考文献18

  • 1ZHANG C T,ZHANG R.Analysis of distribution of bases in the coding sequences by a diagrammatic technique[J].Nucleic Acids Research,1991,19:6313-6317.
  • 2ZHANG R,ZHANG C T,CURVES Z.an intuitive tool for visualizing and analyzing DNA sequences[J].Journal Biomolec Struct Dyn,1994,11:767-782.
  • 3GUO F B,OU H Y,ZHANG C T.ZCURVE:a new system for recognizing protein coding genes in bacterial and archaeal genomes[J].Nucleic Acids Research,2003,31:1780-1789.
  • 4ZHENG W X,CHEN L L,OU H Y,et al.Coronavirus phylogenybased on a geometric approach[J].Molecular Phylogenetics and Evolution,2005,36(2):224-232.
  • 5YUAN C X,LIAO B,WANG T M.New 3D graphical representation of DNA sequences and their numerical characterization[J].Chemical Physics Letters,2003,379:412-417.
  • 6RANDIC M,ZUPAN J,NOVIC M.On 3-D Graphical representation of proteomics maps and their numerical characterization[J].Journal of Chemical Information and Computer Sciences,2001,41(5):1339-1344.
  • 7RANDIC M,VRACKO M,NANDY A,et al.On 3-D Graphical representation of DNA primary sequences and their numerical characterization[J].Journal of Chemical Information and Computer Sciences,2000,40(5):1235-1244.
  • 8LIAO B.A 2D graphical representation of DNA sequence[J].Chemical Physics Letters,2005,401:196-199.
  • 9LIAO B,TAN M,DING K.Application of 2-D graphical representation of DNA sequence[J].Chemical Physics Letters,2005,414:296-300.
  • 10LIAO B,WANG T M.Analysis of similarity/dis-similarity of DNA sequences based on 3-D graphical representation[J].Chemical Physics Letters,2004,388:195-200.

二级参考文献14

  • 1R Sharan, R Elkon, R Shamir. Cluster Analysis and its Application to Gene Expression Data[C]//In Proceedings of the 38th Ernst Schering workshop on Bioinformatics and Genome Analysis. Japan: Springer Verlag, 2002:83-108.
  • 2Einav U. Class Discovery in Acute Lymphoblastic Leukemia using gene expression analysis[D]. M.Sc Thesis, USA: Kluwer Academic,2003.
  • 3Alon U, Barkai N, Notter man D A, et al. Broad pattems of gene expression revealed by clustering analysis of rumor and normal colon tissues probed by oligonucleotide arrays[C]// Proc. Natl. Acad. Sci USA, 1999,96:6745-6750.
  • 4Eisen M B, Spellman PT, Brown P O. Cluster analysis and display of genome-wide expression patterns [C]//Proc. Natl. Acad. Sci, USA,1998,95:14863-14868.
  • 5Sharan R, Shamir R. CLICK: A Clustering Algorithm with Applications to Gene Expression Analysis[C]//. In Proceedings of the Eighth International Conference on Intelligent Systems for Molecular Biology (ISMB). New York: ACM Press, 2000:307-316.
  • 6Eran Segal, Daphne Koller. Probabilistic Hierarchical Clustering for Biological Data[C]//In Proceedings of the sixth annual international conference on Computational biology. New York: ACM Press, 2002:273-280.
  • 7Kohonen T. Self- OrganizingMaps[M]. New York: Springer- Verlag,1997.
  • 8Brian S Everitt, Graham Dunn. Applied Multivariate Data Analysis[M]. UK: Oxford University Press, 2001.
  • 9Theresa M. Culley, Lisa E. Wallace. Calculating F-Statistics[EB/OL].(2001)[2004]. Http://ib.Berkeley.edu/courses/ib160/h13a.html.
  • 10马振华.现代应用数学手册-概率论与随机过程卷[K].北京:清华大学出版社,2002.

共引文献10

同被引文献32

引证文献4

二级引证文献9

相关作者

内容加载中请稍等...

相关机构

内容加载中请稍等...

相关主题

内容加载中请稍等...

浏览历史

内容加载中请稍等...
;
使用帮助 返回顶部