期刊文献+

基于图形表达的基因序列模糊聚类应用研究

Application Research of Fuzzy Clustering Analysis Based on Graphical Representation of DNA Sequences
下载PDF
导出
摘要 给出了一种新的非退化的基因图形三维表达方法,这一表达方法不仅避免了图形的重叠和交叉,同时还保留了序列的生物学特征。利用这种表达方法对H5N1病毒基因序列进行数字特征的提取,并引入基于多维PFS判别函数进行模糊聚类分析应用。在聚类分析过程中直接利用数字特征作为源数据,分析结果表明,利用该图形表达建立基因序列数字特征矩阵进行的聚类分析具有一定的合理性。 A novel 3 - D graphic representation with nondegeneration was presented. This representation has the virtue of avoiding the overlap or cross without losing biological information. The proposed method was used to abstract the numeral character of HSN1 virus gene sequence. A clustering analysis based on the multi - dimension PFS parameter was proposed. In the procedure of clustering,the character was used as the data source. The result shows that it is rational to make clustering analysis on the gene character abstract from the new graphic representation.
出处 《武汉理工大学学报(信息与管理工程版)》 CAS 2009年第1期25-29,共5页 Journal of Wuhan University of Technology:Information & Management Engineering
基金 湖南省自然科学重点基金资助项目(06JJ4076) 湖南省财政厅基金资助项目(200590)
关键词 基因图形表达 聚类分析 伪F统计量 gene graphic representation clustering analysis Pseudo F -Statistics
  • 相关文献

参考文献17

  • 1ZHANG C T,ZHANG R. Analysis of distribution of bases in the coding sequences by a diagrammatic technique[J]. Nucleic Acids Res. ,1991 (19) :6313 -6317.
  • 2ZHANG R,ZHANG C T. Z curves, an intuitive tool for visualizing and analyzing DNA sequences [ J ]. J. Biomol. Struct. Dyn. ,1994( 11 ) :767 -782.
  • 3GUO F B,OU H Y,ZHANG C T. Z curve:a new system for recognizing protein coding genes in bacterial and archaeal genomes [ J ]. Nucleic Acids Res. , 2003 (31) :1780 - 1789.
  • 4RANDIC M,ZUPAN J, NOVIC M. On 3 - D graphical representation of proteomics maps and their numerical characterization[ J ]. J. Chem. Inf. Comput. Sci. , 2001 (41) :1339 - 1344.
  • 5YUAN C X, LIAO B,WANG T M. New 3D graphical representation of DNA sequences and their numerical characterization [ J ]. Chemical Physics Letters, 2003 (379) :412 -417.
  • 6ZHENG W X,CHEN L L. Coronavirus phylogeny based on a geometric approach [J].Molecular Phylogenetics and Evolution,2005 (36) :224 -232.
  • 7RANDIC M,VRACKO M,NANDY A,et,al. On 3 - D graphical representation of DNA primary sequences and their numerical characterization [ J ]. J. Chem. Inf. Comput. Sci. , 2000 ( 40 ) : 1235 - 1244.
  • 8LIAO B. A 2D graphical representation of DNA sequence [ J ]. Chemical Physics Letters,2005 ( 401 ) : 196 - 199.
  • 9LIAO B,TAN M S,DING K Q. Application of 2 -D graphical representation of DNA sequence [ J ]. Chemical Physics Letters,2005 ( 414 ) : 296 - 300.
  • 10LIAO B,WANG T M. Analysis of similarity/dissimilarity of DNA sequences based on 3 - D graphical representation [ J ]. Chemical Physics Letters, 2004 (388) :195 -200.

二级参考文献14

  • 1R Sharan, R Elkon, R Shamir. Cluster Analysis and its Application to Gene Expression Data[C]//In Proceedings of the 38th Ernst Schering workshop on Bioinformatics and Genome Analysis. Japan: Springer Verlag, 2002:83-108.
  • 2Einav U. Class Discovery in Acute Lymphoblastic Leukemia using gene expression analysis[D]. M.Sc Thesis, USA: Kluwer Academic,2003.
  • 3Alon U, Barkai N, Notter man D A, et al. Broad pattems of gene expression revealed by clustering analysis of rumor and normal colon tissues probed by oligonucleotide arrays[C]// Proc. Natl. Acad. Sci USA, 1999,96:6745-6750.
  • 4Eisen M B, Spellman PT, Brown P O. Cluster analysis and display of genome-wide expression patterns [C]//Proc. Natl. Acad. Sci, USA,1998,95:14863-14868.
  • 5Sharan R, Shamir R. CLICK: A Clustering Algorithm with Applications to Gene Expression Analysis[C]//. In Proceedings of the Eighth International Conference on Intelligent Systems for Molecular Biology (ISMB). New York: ACM Press, 2000:307-316.
  • 6Eran Segal, Daphne Koller. Probabilistic Hierarchical Clustering for Biological Data[C]//In Proceedings of the sixth annual international conference on Computational biology. New York: ACM Press, 2002:273-280.
  • 7Kohonen T. Self- OrganizingMaps[M]. New York: Springer- Verlag,1997.
  • 8Brian S Everitt, Graham Dunn. Applied Multivariate Data Analysis[M]. UK: Oxford University Press, 2001.
  • 9Theresa M. Culley, Lisa E. Wallace. Calculating F-Statistics[EB/OL].(2001)[2004]. Http://ib.Berkeley.edu/courses/ib160/h13a.html.
  • 10马振华.现代应用数学手册-概率论与随机过程卷[K].北京:清华大学出版社,2002.

共引文献10

相关作者

内容加载中请稍等...

相关机构

内容加载中请稍等...

相关主题

内容加载中请稍等...

浏览历史

内容加载中请稍等...
;
使用帮助 返回顶部