期刊文献+

基于全局和局部回归的因果定向改进算法 被引量:2

AN IMPROVED CAUSAL-EFFECT ORIENTATION ALGORITHM BASED ON THE GLOBAL AND LOCAL REGRESSION
下载PDF
导出
摘要 从观测数据中探索和发现蕴含在变量间的因果关系是大数据时代的基本任务之一,它将在未来各种数据驱动应用中发挥关键作用。推断观测数据间因果关系的方向是此任务的一类基础问题。最近研究表明基于最小描述长度MDL(minimum description length)的全局和局部回归(GLR)算法具有较高的推断准确率及较广的适用性。然而,在GLR模型中由于冗余模型的存在而严重限制了该算法的效率。为避免模型冗余,根据模型的不同特征采取分别构建GLR模型的方法,并在此基础上提出一个改进的用于因果定向的ISLOPE算法。实验结果表明,在保持原算法准确率近似不变的前提下,该算法有效地节约了运行时间,进而提升了算法效率。 Exploring and uncovering the causality between variables from the observed data is one of the fundamental tasks in the era of big data, and will play a crucial role in various data driven applications in the future. Inferring the direction of causality between observation data is a basic problem of this task. Recent studies have shown that the global and local regression(GLR) algorithm based on the minimum description length (MDL) has higher inferential accuracy and wider applicability. However, the efficiency of the algorithm is significantly limited due to the redundant models in CLR model. In the paper, we adopted the method of building CLR model separately according to the different characteristics of the model to avoid the model redundancy. And on this basis, we proposed an improved SLOPE algorithm for causal orientation. Experimental results show that ISLOPE can effectively save the running time and improve the efficiency under the premise that the accuracy of the original algorithm is approximately unchanged.
作者 潘孟姣 蔡青松 Pan Mengjiao;Cai Qingsong(School of Computer and Information Engineering,Beijing Technology and Business University,Beijing 100048,China)
出处 《计算机应用与软件》 北大核心 2018年第10期238-244,共7页 Computer Applications and Software
基金 北京市自然科学基金项目(4172013)
关键词 全局/局部回归模型 最小描述长度 模型冗余 因果定向 加性噪声模型 Global/Local regression model Minimum description length Model redundancy Causal-effect orientation Additive noise model
  • 相关文献

参考文献2

二级参考文献22

  • 1KARLEBACH G,SHAMIR R.Modelling and analysis of gene regulatory networks[J].Nature Reviews Molecular Cell Biology,2008,9(10):770-80.
  • 2CHEN T,HE H,CHURCH M.Modeling gene expression with differential equations[C]//Pacific symposium on biocomputing.Hawaii,USA:UC San Francisco,1999:4-16.
  • 3FRIEDMAN N,LINIAL M,NACHMAN I,et al.Using Bayesian networks to analyze expression data[J].Journal of computational biology,2000,7(3/4):601-20.
  • 4MURPHY K,SAIRA M.Modelling Gene Expression Data Using Dynamic Bayesian Networks[R].Technical report,Berkeley:Computer Science Division University of California,1999.
  • 5EMMERT-STREIB F,GLAZKO G,DE MATOS SIMOES R,et al.Statistical inference and reverse engineering of gene regulatory networks from observational expression data[J].Frontiers in genetics,2012,3:8-23.
  • 6BUTTE A,KOHANE I.Mutual information relevance networks:functional genomic clustering using pairwise entropy measurements[C]//Pac Symp Biocomput.Stanford USA:Stanford University,2000:418-429.
  • 7QIU P,GENTLES A,PLEVRITIS S.Fast calculation of pairwise mutual information for gene regulatory network reconstruction[J].Computer methods and programs in biomedicine,2009,94(2):177-180.
  • 8MARGOLIN A,NEMENMAN I,BASSO K,et al.ARACNE:an algorithm for the reconstruction of gene regulatory networks in a mammalian cellular context[J].BMC Bioinformatics,2006,I(Suppl I):7-22.
  • 9FAITH J,HAYETE B,THADEN J,et al.Large-scale mapping and validation of Escherichia coli transcriptional regulation from a compendium of expression profiles[J].PLoS biology,2007,5(1):8-21.
  • 10ALTAY G,EMMERT-STREIB F.Structural influence of gene networks on their inference:analysis of C3NET[J].Biol Direct,2011(6):31-47.

共引文献38

同被引文献19

引证文献2

二级引证文献3

相关作者

内容加载中请稍等...

相关机构

内容加载中请稍等...

相关主题

内容加载中请稍等...

浏览历史

内容加载中请稍等...
;
使用帮助 返回顶部