期刊文献+

BioCluster:Tool for Identification and Clustering of Enterobacteriaceae Based on Biochemical Data

BioCluster:Tool for Identification and Clustering of Enterobacteriaceae Based on Biochemical Data
原文传递
导出
摘要 Presumptive identifcation of different Enterobaeteriaeeae species is routinely achieved based on biochemical properties. Traditional practice includes manual comparison of each biochem- ical property of the unknown sample with known reference samples and inference of its identity based on the maximum similarity pattern with the known samples. This process is labor- intensive, time-consuming, error-prone, and subjective. Therefore, automation of sorting and sim- ilarity in calculation would be advantageous. Here we present a MATLAB-based graphical user interface (GUI) tool named BioCluster. This tool was designed for automated clustering and iden- tification of Enterobacteriaceae based on biochemical test results. In this tool, we used two types of algorithms, i.e., traditional hierarchical clustering (HC) and the Improved Hierarchical Clustering (IHC), a modified algorithm that was developed specifically for the clustering and identification of within this species. IHC takes into account the variability in result of 1-47 biochemical tests family. This tool also provides different options to optimize the clus- tering in a user-friendly way. Using computer-generated synthetic data and some real data, we have demonstrated that BioCluster has high accuracy in clustering and identifying enterobacterial species based on biochemical test data. This tool can be freely downloaded at http://microbialgen.du.ac.bd/ biocluster/. Presumptive identifcation of different Enterobaeteriaeeae species is routinely achieved based on biochemical properties. Traditional practice includes manual comparison of each biochem- ical property of the unknown sample with known reference samples and inference of its identity based on the maximum similarity pattern with the known samples. This process is labor- intensive, time-consuming, error-prone, and subjective. Therefore, automation of sorting and sim- ilarity in calculation would be advantageous. Here we present a MATLAB-based graphical user interface (GUI) tool named BioCluster. This tool was designed for automated clustering and iden- tification of Enterobacteriaceae based on biochemical test results. In this tool, we used two types of algorithms, i.e., traditional hierarchical clustering (HC) and the Improved Hierarchical Clustering (IHC), a modified algorithm that was developed specifically for the clustering and identification of within this species. IHC takes into account the variability in result of 1-47 biochemical tests family. This tool also provides different options to optimize the clus- tering in a user-friendly way. Using computer-generated synthetic data and some real data, we have demonstrated that BioCluster has high accuracy in clustering and identifying enterobacterial species based on biochemical test data. This tool can be freely downloaded at http://microbialgen.du.ac.bd/ biocluster/.
出处 《Genomics, Proteomics & Bioinformatics》 SCIE CAS CSCD 2015年第3期192-199,共8页 基因组蛋白质组与生物信息学报(英文版)
基金 supported by the grants from the Ministry of Science and Technology (S&T) of Bangladesh (Grant No.HEQEP CP236) the University Grants Commission (UGC).
关键词 Bacterial identification ENTEROBACTERIACEAE Biochemical properties Clustering tool Identification tool Hierarchy algorithm Bacterial identification Enterobacteriaceae Biochemical properties Clustering tool Identification tool Hierarchy algorithm
  • 相关文献

参考文献15

  • 1Garrity G, Brenner D J, Krieg NR, Staley JRE. Bergey's Manual of Systematic Bacteriology. Vol 2: The Proteobacteria, Part B: The Gammaproteobacteria. Springer, US; 2005.
  • 2Linton AH, Hinton MH. Enterobacteriaceae associated with animals in health and disease. Soc Appl Bacteriol Symp Ser 1988;17:71S-85S.
  • 3Brenner DJ. Introduction to the family Enterobacteriaceae. In: Balows A, Trfiper HG, Dworkin M, Harder W, Schleifer KH, editors. The prokaryotes. New York: Springer-Verlag; 1992. p. 2673-95.
  • 4Traub WH, Raymond EA, Linehan J. Identification of Enter- obacteriaceae in the clinical microbiology laboratory. Appl Microbiol 1970;20:303-8.
  • 5Carpenter KP, Lapage SP, Steel KJ. Biochemical identification of Enterobacteriaceae. In: Gibbs BM, Skinner FA, editors. Identi- fication methods for microbiologists. London: Academic press; 1966. p. 21-3.
  • 6Carpenter KP, Lapage SP, Steel KJ. Biochemical dentticaton of Enterobacteriaceae. In: Gibbs BM, Skinner FA, editors. Identi- fication methods for microbiologists. London: Academic press; 1966. p. 21-3.
  • 7Minka T. Distance measures as prior probabilities. Technical report, Carnegie Mellon University, 2000.
  • 8Jordan MI, Kearns M J, Solla SA. Advances in neural information processing systems 10. Proceedings of the 1997 Conference. 1998.
  • 9Johnson SC. Hierarchical clustering schemes. Psychometrika 1967;32:241-54.
  • 10Tan P-N, Steinbach M, Kumar V. Introduction to data mining. 1st ed. Pearson Education India, 2007.

相关作者

内容加载中请稍等...

相关机构

内容加载中请稍等...

相关主题

内容加载中请稍等...

浏览历史

内容加载中请稍等...
;
使用帮助 返回顶部