期刊文献+

通过文本挖掘获取疾病相关功能信息 被引量:3

Retrieving Gene Functional Information through Text Mining
下载PDF
导出
摘要 通过定位候选策略和全基因组关联研究等方法,很多人类遗传疾病的致病基因已经定位到某个或某些染色体区间,利用计算机将染色体区间中众多的基因减少到易于实验分析的数目是寻找疾病基因的一个很重要的方法。大部分已有的预测疾病基因的方法都是利用已知致病基因的各类注释信息来预测疾病基因的。但是,目前依然有很多疾病尚没有任何具体的注释信息,这样就无法利用已有的基于已知基因信息的预测方法来识别致病基因。针对这个问题,通过挖掘生物医学文献数据库,结合人类基因产物蛋白质的功能注释数据库,从中提取与疾病相关的功能信息。这样,就可以基于这些挖掘出来的功能信息来实现这类疾病基因的预测。 Many disease genes are located within one or more specific chromosomal regions through position candidate approaches and genome-wide association studies. Prioritizing candidate genes by computational algorithms is important strategy to speed the identification of disease genes. Most approaches to identify disease genes based on function annotations have been presented in recent years. Most of them,starting from the function annotations of known genes associated with diseases,however,can not be used to identify genes for diseases without any known pathogenic genes or related function annotations. For such diseases, a new method is proposed to retrieve ralated gene functional information by mining biomedical literature and protein function annotation database. Thus, the genes for diseases lacking known causative genes also could be identified based on the gene function annotations mined.
出处 《微计算机信息》 2009年第36期1-3,共3页 Control & Automation
基金 基金申请人:周艳红 项目名称:人类遗传疾病相关基因的生物信息学分析与预测 基金颁发部门:国家自然科学基金(90608020) 基金申请人:周艳红 项目名称:基因发现与分析的生物信息学平台研制与应用研究 基金颁发部门:教育部(NCET-06-0651)
关键词 疾病基因 预测 基因本体 文本挖掘 disease gene prediction GO text mining
  • 相关文献

参考文献12

  • 1Lander E S, Linton L M, Bitten B, et al. Initial sequencing and analysis of the human genome[J]. Nature, 2001, 409(6822): 860-921.
  • 2Yan S. Positional candidate cloning of disease genes [J]. Life Sciences, 1999, 11(5): 205-508.
  • 3McCarthy M I, Smedley D, and Hide W. New methods for find- ing disease-susceptibility genes: impact and potential [J]. Genome Biol, 2003, 4(10): 119.
  • 4Franke L, Bakel H, Fokkens L, et al. Reconstruction of a functional human gene network, with an application for prioritizing positional candidate genes[J]. Am J Hum Genet, 2006, 78(6): 1011- 1025.
  • 5Perez-Iratxeta C, Wjst M, Bork P, et al. G2D: a tool for mining genes associated with disease[J]. BMC Genet, 2005, 6: 45.
  • 6Turner F S, Clutterbuck D R, and Semple C A. POCUS: mining genomic sequence annotation to predict disease genes [J]. Genome Biol, 2003, 4(11): R75.
  • 7Freudenberg j and Propping P. A similarity-based method for genome-wide prediction of disease-relevant human genes [J]. Bioinformatics, 2002, 18 Suppl 2:S110-115.
  • 8MEDLINE/PubMed, http://www.ncbi.nlm.nih.gov/PubMed.
  • 9EBI GOA project, http://www.ebi.ac.uk/GOA/index.html.
  • 10杨丽华,戴齐,杨占华.文本分类技术研究[J].微计算机信息,2006(05X):209-211. 被引量:13

二级参考文献7

  • 1张先飞,李弼程,刘安斐.基于改进KNFL算法的海量文本分类研究[J].微计算机信息,2005,21(11S):159-160. 被引量:4
  • 2AH-HWEE TAN.Text Mining:The state of the art and the challenges [C].PAKDD'99 Workshop on Knowledge discovery from Advanced Databases (KDAD'99),Beijing,1999.
  • 3Fabrizio Sebastiani.Machine Learning in Automated Text Categorization[J].ACM Computing Sruveys,2002,34(1):1-47.
  • 4Yang Yiming,Pederson J O.A Comparative Study on Feature Selection in Text Categorization[C].Proceedings of the 14th International Conference on Machine learning.Nashville:Morgan Kanfmann,1997: 412-420.
  • 5Mlademnic,D.,Grobelnik,M.Feature Selection for unbalanced class distribution and Native Bayees [C].Proceedings of the Sisteenth International Conference on Machine Learning.Bled:Morgan Kanfmann, 1999:258-267.
  • 6Belur V D.Nearest Neighbor(NN)Norms:NN pattern Classification Techniques [J].IEEE Computer Society Press,New York:IEEE press, 1991.59.
  • 7Joachims T.Text Categorization with Support Vector Machines:Learning with Many Relevant Features [J].Machine Learning,1998,11398:137-142.

共引文献12

同被引文献18

  • 1尹招琴,朱维斌,李文军.提高大型仪器使用效率 培养学生创新能力[J].实验室研究与探索,2009,28(1):160-162. 被引量:45
  • 2张新德.企业设备维护管理要点浅析[J].硅谷,2009,2(2). 被引量:6
  • 3Aerts S, Gene Prioritization Through Genomic Data Fusion[J]. Nature Biotechnol, 2006, 24(5): 537-544.
  • 4Franke L, Bakel H, Fokkens L, et al. Reconstruction of a Functional Human Gene Network, with an Application for Pfioritizing Positional Candidate Genes[J]. The American Journal of Human Genetics, 2006, 78(6): 1011-1025.
  • 5Iratxeta P C, Wjst M, Bork P, et al. G2D: A Tool for Mining Genes Associated with Disease[J]. BMC Genetics, 2005, 6(3): 45-53.
  • 6Turner F S, Clutterbuck D R, Semple C A. POCUS: Mining Genomic Sequence Annotation to Predict Disease Genes[J]. Genome Biology, 2003, 4(11): 75-83.
  • 7Freudenberg J, Propping E A Similarity-based Method for Genome-wide Prediction of Disease-relevant Human Genes[J]. Bioinformatics, 2002, 18(2): 110-115.
  • 8Sanchez J ~ Barton C, David V. Human Disease Genes[J]. Nature, 2001,409(15): 853-855.
  • 9Mann , W.C. and Thompson, S.A.Rhetorical Structure theory: A theory of text organization Information Sciences Institute ,Universi- ty of Southern California,1987.
  • 10Hearst M.A. Text Tiling:A Quantitative Approach to Discourse Segmentation Technical Report Sequoia 93/24 Berkeley:University of California, 1993.

引证文献3

二级引证文献6

相关作者

内容加载中请稍等...

相关机构

内容加载中请稍等...

相关主题

内容加载中请稍等...

浏览历史

内容加载中请稍等...
;
使用帮助 返回顶部