摘要
DNA微阵列技术是近年来发展起来的一种功能基因组学研究技术。利用DNA微阵列技术,可以从基因组水平上鉴定出相关的功能基因及其表达调控网络,有助于阐明这些基因的生物学功能及其机理。结合文本挖掘的相关研究结果,探讨了DNA微阵列技术的原理和特点,数据的分析和解释,基因的聚类方法和基于文献的DNA微阵列分析。通过基于文献的微阵列分析方法,找出隐含的、具有语义关联的生物概念,并进行推理,发现隐性的新知识。具体阐述了基于统计、基于自然语言处理、基于关联规则挖掘,基于模式识别的4种分析方法。基于文本挖掘的DNA微阵列技术,有利于发现基因或蛋白质之间的相互作用关系,自动识别生物学名词,提高数据分析效率等。
DNA microarray,developed in recent years, is a technique used in study of functional genomics, and can be used to identify related functional genes and their expression control networks at genomics level, thus contributing to the explanation of the biological function and mechanism of such genes. Tne principles and characteristics of DNA microarray,data analysis and explanation as well as duster methods of genes using DNA micmarray,and litera- ture-based DNA microarray analysis were studied based on the related studies on text mining. The concealed and semantic-related biological concept in literature was detected using DNA microarray to discover the invisible novel knowledge by inference. The 4 analyzing methods based on statistics, natural language processing, association rule mining, and pattern recognition were discussed. Text ming-based DNA microarray technique can discover the interaction between genes or proteins, identify biological terms, and improve data analysis efficacy.
出处
《中华医学图书情报杂志》
CAS
2010年第5期11-15,共5页
Chinese Journal of Medical Library and Information Science
关键词
DNA微阵列
文本挖掘
聚类分析
文献轮廓
关联规则
自然语言处理
模式识别
DNA microarray
text mining
cluster analysis
literature profile
association rule
natural lan- guage processing
pattern recognition