期刊文献+

Identifying cancer genes from cancer mutation profiles by cancer functions 被引量:1

Identifying cancer genes from cancer mutation profiles by cancer functions
原文传递
导出
摘要 It is of great importance to identify new cancer genes from the data of large scale genome screenings of gene mutations in cancers. Considering the alternations of some essential functions are indispensable for oncogenesis, we define them as cancer functions and select, as their approximations, a group of detailed functions in GO (Gene Ontology) highly enriched with known cancer genes. To evaluate the efficiency of using cancer functions as features to identify cancer genes, we define, in the screened genes, the known protein kinase cancer genes as gold standard positives and the other kinase genes as gold standard negatives. The results show that cancer associated functions are more efficient in identifying cancer genes than the selection pressure feature. Furthermore, combining cancer functions with the number of non-silent mutations can generate more reliable positive predictions. Finally, with precision 0.42, we suggest a list of 46 kinase genes as candidate cancer genes which are annotated to cancer functions and carry at least 3 non-silent mutations. It is of great importance to identify new cancer genes from the data of large scale genome screenings of gene mutations in cancers. Considering the alternations of some essential functions are indispensable for oncogenesis, we define them as cancer functions and select, as their approximations, a group of detailed functions in GO (Gene Ontology) highly enriched with known cancer genes. To evaluate the efficiency of using cancer functions as features to identify cancer genes, we define, in the screened genes, the known protein kinase cancer genes as gold standard positives and the other kinase genes as gold standard negatives. The results show that cancer associated functions are more efficient in identifying cancer genes than the selection pressure feature. Furthermore, combining cancer functions with the number of non-silent mutations can generate more reliable positive predictions. Finally, with precision 0.42, we suggest a list of 46 kinase genes as candidate cancer genes which are annotated to cancer functions and carry at least 3 non-silent mutations.
出处 《Science China(Life Sciences)》 SCIE CAS 2008年第6期569-574,共6页 中国科学(生命科学英文版)
基金 the National Natural Science Foundation of China (Grant Nos. 30370388, 30670539 and 30770558)
关键词 MUTATION PROFILE CANCER GENE GENE ONTOLOGY GENE function prediction mutation profile, cancer gene, Gene Ontology, gene function, prediction
  • 相关文献

参考文献2

二级参考文献15

  • 1Zhu T.Global analysis of gene expression using GeneChip microarrays[J].Curr Opin Plant Bio1.2003,6(5):418-425.
  • 2Guo Z,Zhang T,Li X,et al.Towards precise classification of cancers based on robust gene functional expression profiles[J].BMC Bioinformatics,2005,6(1):58.
  • 3Tu K,Yu H,Guo Z,et al.Learnability-based further prediction of gene functions in Gene Ontology[J].Genomics,2004,922-928.
  • 4Chang E,Goh K,Cheng K.T.SVM binary classifier ensembles for multi-class image classification[C].In ACM International Conference on Information and Knowledgment Management (CIKM).Atlanta,2001.395-402.
  • 5Mateos A,Dopazo J,Jansen R,et al.Systematic learning of gene functional classes from DNA array expression data by using multilayer perceptrons[J].Genome Res.2002,12(11):1703-1715.
  • 6Fawcett T,Provost F.Combining Data Mining and Machine Learning for Effective User Profiling[C].In Proceedings of the 2nd International Conference on Knowledge Discovery and Data Mining.Portland,1996.126-133.
  • 7Ling C,Li C.Data mining for direct marketing problems and solutions[C].In Proceedings of the Fourth International Conference on Knowledge Discovery and Data Mining.New York,1998.73-79.
  • 8Kubat M,Matwin S.Addressing the curse of imbalanced training sets:one-sided selection[C].Proceedings of the 14th International Conference on Machine Learning.Nashville,Tennesse,1997.179-186.
  • 9Brown MP,Grundy WN,Lin D et al.Knowledge-based Analysis of Microarray Gene Expression Data Using Support Vector Machines[J].Proc Natl Acad Sci U S A.2000,97(1):262-267.
  • 10Yan R,Liu Y,Jin R,et al.On predicting rare classes with SVM ensembles in scene classification[C].In IEEE International Conference on Acoustics,Speech and Signal Processing (ICASSP).Hong Kong,2003.21-24.

共引文献6

引证文献1

二级引证文献1

相关作者

内容加载中请稍等...

相关机构

内容加载中请稍等...

相关主题

内容加载中请稍等...

浏览历史

内容加载中请稍等...
;
使用帮助 返回顶部