从基因芯片数据快速有效地挖掘共调控基因

Mining co-regulated genes from microarray data quickly and effectively

下载PDF

导出

摘要针对基因芯片数据高噪音、列(基因)数比行(实验条件)数多几个数量级的特殊性,为了进一步提高从基因芯片数据挖掘共调控基因的时间效率和挖掘结果的有效性,首先根据所有两两基因对之间的Pearson相关系数对原始完整数据集进行分组,然后使用列(基因)枚举方法对各组数据分别进行闭合频繁模式挖掘,并对活化和抑制共调控关系的挖掘分别进行处理。实验结果证明:算法快速有效地挖掘出了两种共调控基因。 Microarray data sets typically contain strong noise and an order of magnitude more genes than experiments.To further reduce the running time and improve the validity of co-regulated genes mined from microarray data,a new method is proposed which firstly groups all genes according to the Pearson correlation coefficient between every two genes,then uses column（gene）enumeration to mine closed frequent patterns as positive or negative co-regulated genes for each group.The experimental results show that the proposed approach can quickly and effectively mine two kinds of co-regulated genes from microarray data.

作者赵倩尚学群

机构地区西北工业大学计算机学院

出处《计算机工程与应用》 CSCD 北大核心 2010年第9期33-37,共5页 Computer Engineering and Applications

基金陕西省自然科学基金(No.2007F27)~~

关键词基因芯片数据共调控基因 Pearson相关系数闭合频繁模式 microarray data co-regulated genes Pearson correlation coefficient closed frequent pattern

分类号 TP311 [自动化与计算机技术—计算机软件与理论]

引文网络
相关文献

参考文献12

1刘万霖,李栋,朱云平,贺福初.基于微阵列数据构建基因调控网络[J].遗传,2007,29(12):1434-1442. 被引量：4
2李传星,李霞,郭政,宫滨生,屠康.调控通路内基因表达的相关性分析[J].遗传,2004,26(6):929-933. 被引量：5
3Ka Y Y,Mario M,Roger E B.From co-expression to co-regulation: How many microarray experiments do we need?[J].Genome Biology,2004,5.
4Mclntosh T,Chawla S.High-confidence rule mining for microarray analysis[J].IEEE/ACM TCBB,2007,4(4) : 611-623.
5Feng Pan, Gao Cong, Yang Jiong, et al.CARPENTER: Finding closed patterns in long biological datasets[C]//Proc ACM SIGKDD Int'l Conf on Knowledge Discovery and Data Mining(KDD),2003:637-642.
6Gao Cong,Tan Kian-Lee,Tung A K H,et al.Mining frequent closed patterns in microarray data[C]//Proceedings of the 4th IEEE International Conference on Data Mining(ICDM' 04), 2004: 363-366.
7Feng Pan,Gao Cong,Xu Xin,et al.COBBLER:Combining column and row enumeration for closed pattern discovery[C]//Proe of the 16th Int Conf on Scientific and Statistical Database Management, 2004: 21-30.
8Spellman P T,Sherlock G,Zhang M Q,et al.Comprehensive identification of cell cycle-regulated genes of the yeast Saccharomyces cerevisiae by microarray hybridization[J].Mol Biol Cell,1998,9: 3273-3297.
9Oba S,Sato M,Takemasa I,et al.A Bayesian missing value estimation method[J].Bioinformatics,2003,19:2088-2096.
10Dominic J A,Isaac S K,Atul J B.Quantifying the relationship between co-expression,co-regulation and gene function[J].BMC Bioinformatics, 2004,5.

二级参考文献52

1李传星,李霞,郭政,宫滨生,屠康.调控通路内基因表达的相关性分析[J].遗传,2004,26(6):929-933. 被引量：5
2[1]Sherlock G,Hernandez-Boussard T,Kasarskis A,Binkley G,Matese JC,Dwight SS,Kaloper M,Weng S,Jin H,Ball CA,Eisen MB,Spellman PT,Brown PO,Botstein D,Cherry JM. The Stanford Microarray Database. Nucleic Acids Res,2001,29(1): 152～155.
3[2]Yoav Arava,Yulei Wang,John D Storey,Chih Long Liu,Patrick O Brown,Daniel Herschlag. Genome-wide analysis of mRNA translation profiles in Saccharomyces cerevisiae. Proc Natl Acad Sci U S A,2003,100(7): 3889～3894.
4[3]Audrey P Gasch,Mingxia Huang,Sandra Metzner,David Botstein,Stephen J Elledge,Patrick O Brown. Genomic expression responses to DNA-damaging agents and the regulatory role of the yeast ATR homolog Mec1p. Mol Biol Cell,2001,12(10): 2987～3003.
5[4]Priya Sudarsanam,Vishwanath R Iyer,Patrick O Brown,Fred Winston. Whole-genome expression analysis of snf/swi mutants of Saccharomyces cerevisiae. PNAS,2000,97(7): 3364～3369.
6[5]Audrey P Gasch,Paul T Spellman,Camilla M Kao,Orna Carmel-Harel,Michael B Eisen,Gisela Storz,David Botstein,Patrick O Brown. Genomic expression programs in the response of yeast cells to environmental changes.Mol Biol Cell,2000,11(12): 4241～4257.
7[6]Paul T Spellman,Gavin Sherlock,Michael Q Zhang,Vishwanath R Iyer,Kirk Anders,Michael B Eisen,Patrick O Brown,David Botstein,Bruce Futcher. Comprehensive identification of cell cycle-regulated genes of the yeast Saccharomyces cerevisiae by microarray hybridization.Mol Biol Cell,1998,9(12): 3273～3297.
8[7]Minoru Kanehisa and Susumu Goto. KEGG: Kyoto Encyclopedia of Genes and Genomes.Nucleic Acids Research,2000,28: 127～130.
9[8]Semyon Kruglyak,Haixu Tang. Regulation of adjacent yeast genes.TIG,2000,16(3): 109～111.
10Wyrick J J, Young RA. Deciphering gene expression regulatory networks. Curr Opin Genet Dev, 2002, 12(2): 130-136.

共引文献7

1刘万霖,李栋,朱云平,贺福初.基于微阵列数据构建基因调控网络[J].遗传,2007,29(12):1434-1442. 被引量：4
2周才秀,王忠,荆志伟,谢力,胡木林.基于微阵列数据构建的基因通路分析软件的研究进展[J].现代生物医学进展,2009,9(7):1352-1355.
3强波,王正志,王广云.肿瘤特征基因筛选与调控网络构建[J].系统仿真学报,2009,21(13):4163-4166.
4于连江,吴春国,郭立强,梁艳春,杨锌朔.易物模型及其求解算法[J].吉林大学学报（理学版）,2010,48(4):653-657. 被引量：1
5缑葵香,宫秀军,汤莉.基于时序互信息构建基因调控网络[J].天津大学学报,2010,43(7):655-660. 被引量：5
6王文杰,侯艳,李康.基因组学数据的网络构建与分析方法[J].中国卫生统计,2017,34(1):177-180. 被引量：4
7杨署光,杨秀光,赵悦,史敏晶,李言,邓小敏,晁金泉,田维敏.橡胶树橡胶生物合成调控相关基因的表达相关性分析[J].广西植物,2020,40(12):1790-1799. 被引量：1

1张黎,逄涣利,王小虎,王佳.一种共调控基因C均值模糊聚类算法[J].计算机工程与应用,2010,46(7):32-33. 被引量：2
2杨传耀,张成洪,胡运发.一种基于投影和树的闭合频繁模式算法[J].模式识别与人工智能,2008,21(1):6-11.
3白天,周春光,刘桂霞,王晗,王喆,张宏婷.一种共调控基因聚类的新方法[J].吉林大学学报（理学版）,2009,47(2):292-298. 被引量：2
4赵宇海,乔百友,林天亮,王国仁.一种基于广义相似性的共调控基因聚类算法[J].东北大学学报（自然科学版）,2009,30(11):1558-1561. 被引量：1
5印莹,赵宇海,张斌,王国仁.时序微阵列数据中的同步和异步共调控基因聚类[J].计算机学报,2007,30(8):1302-1314. 被引量：5
6李小梅,郭红,吕暾.一种采用新的相似性度量方法的共调控基因动态模糊聚类算法[J].福州大学学报（自然科学版）,2011,39(2):198-205. 被引量：1
7张广治,何洁月.生物数据网格的应用研究[J].计算机工程,2004,30(2):3-4.
8TSAU Minhe,KAO Weiwen,CHANG Albert.Generalized Artificial Life Structure for Time-dependent Problems[J].Chinese Journal of Mechanical Engineering,2009,22(3):317-324. 被引量：1
9张宏怡,张军英.延迟基因调控网络重构问题研究[J].西安电子科技大学学报,2007,34(5):809-813. 被引量：1
10薛劼,郭红.一种动态时间弯曲距离的时延调控基因相似度量聚类方法[J].福州大学学报（自然科学版）,2013,41(2):158-163. 被引量：1

计算机工程与应用

2010年第9期

浏览历史

内容加载中请稍等...

从基因芯片数据快速有效地挖掘共调控基因

参考文献12

二级参考文献52

共引文献7

相关作者

相关机构

相关主题

浏览历史