期刊文献+

基于文本挖掘的DNA微阵列表达数据方法研究

Data expressing methods in DNA microarray based on text mining
下载PDF
导出
摘要 DNA微阵列技术是近年来发展起来的一种功能基因组学研究技术。利用DNA微阵列技术,可以从基因组水平上鉴定出相关的功能基因及其表达调控网络,有助于阐明这些基因的生物学功能及其机理。结合文本挖掘的相关研究结果,探讨了DNA微阵列技术的原理和特点,数据的分析和解释,基因的聚类方法和基于文献的DNA微阵列分析。通过基于文献的微阵列分析方法,找出隐含的、具有语义关联的生物概念,并进行推理,发现隐性的新知识。具体阐述了基于统计、基于自然语言处理、基于关联规则挖掘,基于模式识别的4种分析方法。基于文本挖掘的DNA微阵列技术,有利于发现基因或蛋白质之间的相互作用关系,自动识别生物学名词,提高数据分析效率等。 DNA microarray,developed in recent years, is a technique used in study of functional genomics, and can be used to identify related functional genes and their expression control networks at genomics level, thus contributing to the explanation of the biological function and mechanism of such genes. Tne principles and characteristics of DNA microarray,data analysis and explanation as well as duster methods of genes using DNA micmarray,and litera- ture-based DNA microarray analysis were studied based on the related studies on text mining. The concealed and semantic-related biological concept in literature was detected using DNA microarray to discover the invisible novel knowledge by inference. The 4 analyzing methods based on statistics, natural language processing, association rule mining, and pattern recognition were discussed. Text ming-based DNA microarray technique can discover the interaction between genes or proteins, identify biological terms, and improve data analysis efficacy.
作者 张薇 崔雷
出处 《中华医学图书情报杂志》 CAS 2010年第5期11-15,共5页 Chinese Journal of Medical Library and Information Science
关键词 DNA微阵列 文本挖掘 聚类分析 文献轮廓 关联规则 自然语言处理 模式识别 DNA microarray text mining cluster analysis literature profile association rule natural lan- guage processing pattern recognition
  • 相关文献

参考文献21

  • 1Jenssen TK,Laegreid A,Komorowski J,et al.A literature network of human genes for high-throughput analysis of gene expression[J].Nat genet(S1061-4036),2001,28(1):1-8.
  • 2Yandell MD,Majoros WH.Genomics and natural language processing[J].Nat rev genet(S1471-0056),2002,3(8):601-610.
  • 3Luo F,Khan L,Bastani F,et al.A dynamically growing self-organizing tree(DGSOT)for hierarchical clustering gene expression-Profiles[J].Bioinformatics(S1367-4803),2004,20(16):2605-2617.
  • 4Lliopoulos I,Enright AJ,Ouzounis CA.Textquest:document clustering of Medline abstracts for concept discovery in molecular biolngy[J].Pac Symp Biocomput(S1793-5091),2001(6):384-395.
  • 5Shatka0y H,Edwards S,Wilbur WJ,et al.Genes,themes and microarrays:using informfion retrieval for large-scale gene analysis[J].Proc Int Conf Intell Syst Mol Boil(S1553-0833),2000(8):317-328.
  • 6Swanson DR.Fish oil,Raynaud's syndrome,and undiscovered public knowledge[J].Perspect Biol Med(S0031-5982),1986,30(1):7-18.
  • 7DiGiacomo RA,Kremer JM,shah DM.Fish-oil dietary supplementation in patients with Raynaud's phenomenon:a doubleblind,controlled,prospective study[J].Am J Med(S0002-9343),1989,S6(2):158-164.
  • 8Chaussabel D,Sher A.Mining microarray expression data by literature profiling[J].Genome Biol(S1465-6906),2002,3(10):reseach 0055.
  • 9PubGene[EB/OL].http://www.pubgene.com/.
  • 10Jenssen TK,Laegreid A,Komorowski J,et al.A literature network of human genes for high-throughput analysis of gene expression[J].Nat Genet(S1061-4036),2001,28(1):21-28.

二级参考文献19

  • 1Swanson D R. Fishv oil. Raynaud's Syndrome, and Undiscovered Public Knowledge. Perspectives Biolog Med, 1986,30 ( 1 ) : 18 - 22
  • 2Database and Tools Menu . http://www, ncbi. nlm. nih. gov/About/tools/index, h tml ( Accessed Dec. 19,2006)
  • 3Medstract. http://www, medstract, org/(Accessed Dec. 19,2006)
  • 4Joyce Peng . Life Sciences Integrated Demo. http://www. oracle.com/technology/industries/life_sciences/presentations/lsday903 _2a_Peng_IntegratedDemo. ppt (Accessed Dec. 19,2006)
  • 5PubGene. http ://www. pubgene, corn/ ( Accessed Dec. 19,2006)
  • 6MedMiner. http://discover, nci. nih. gov/textmining/main, jsp (Accessed Dec. 19,2006)
  • 7Bruce Schatz. Automatically Extracting Signaling Pathways from Biomedical Knowledge. http://www, canis, uiuc. edu/- schatz/NCBC -Proposal - Final. doc ( Accessed Dec. 19,2006 )
  • 8Tor- Kristian Jenssen, et al. A Literature Network of Human Genes for High -Throughput Analysis of Gene Expression . http://www.pubgene, com/PDF/Nature - May. pdf ( Accessed Dec. 19,2006)
  • 9Yang S T. A Literature Network of Human Genes for High - throughput Analysis of Gene Expression. http ://binfo. ym. edu. tw/edu/seminars/111601/111601. PPT ( Accessed Dec. 19,2006)
  • 10Pustejovsky J, et al. Medstract: Creating Large - scale Information Servers for Biomedical Libraries. http://medstract, org/papers/ac12002 -4. pdf (Accessed Dec. 19,2006)

共引文献2

相关作者

内容加载中请稍等...

相关机构

内容加载中请稍等...

相关主题

内容加载中请稍等...

浏览历史

内容加载中请稍等...
;
使用帮助 返回顶部