非高斯性数据间的因果网络已经在经济学、生物学和环境学等学科得到了广泛应用.DirectLingam(Direct Method for Learning a Linear Non-Gaussian Structural Equation Model)算法是其中一个经典解法,但其存在维度达到25维度以上时外生...非高斯性数据间的因果网络已经在经济学、生物学和环境学等学科得到了广泛应用.DirectLingam(Direct Method for Learning a Linear Non-Gaussian Structural Equation Model)算法是其中一个经典解法,但其存在维度达到25维度以上时外生变量(exogenous variable)识别率低的问题,进而产生级联效应,使得整个网络的估计误差随着层数增大越来越大.为此提出了一种基于负熵局部选择外生变量的DirectLingam算法(LS-DirectLingam),把变量的非高斯性作为外生变量选择的标准,用负熵来度量变量的非高斯,选择负熵最大的k个变量存入局部目标变量集合Lv中,在集合Lv中进一步去寻找外生变量,从而提高了外生变量的识别率.与基本的DirectLingam算法进行实验比较,结果表明LS-DirectLingam算法优于DirectLingam算法.展开更多
Blackleg disease caused by Phoma lingam is an important disease of oil seed rape (Brassica napus) causing losses up to 95%. The efficacy of microbial antagonists against P. lingam in greenhouse was tested. Serratia pl...Blackleg disease caused by Phoma lingam is an important disease of oil seed rape (Brassica napus) causing losses up to 95%. The efficacy of microbial antagonists against P. lingam in greenhouse was tested. Serratia plymuthica HRO-C48 and Gliocladium catenulatum J1446 were able to reduce the disease intensity of OSR cotelydones infested with P. lingam at rates 44% and 52% respectively. The reduction of the infestation of the root collar in BBCH14/15 was evaluated as well. Plants treated with a suspension of the antagonists (2 × 105 cfu/plant) and inoculated with either pycnidiospore suspension (2 × 107 cfu/ml) or agar disks grown with P. lingam mycelium, showed a reduced infestation rate of 53% - 93% in the presence of S. plymuthica and 46% - 77% in the presence of G. catenulatum. The efficacy of the antagonist depends highly on their concentration inside OSR seeds. Below 105 cfu/seed no significant difference was recorded between control untreated and infested plants.展开更多
Finding causality merely from observed data is a fundamental problem in science. The most basic form of this causal problem is to determine whether X leads to Y or Y leads to X in the case of joint observation of two ...Finding causality merely from observed data is a fundamental problem in science. The most basic form of this causal problem is to determine whether X leads to Y or Y leads to X in the case of joint observation of two variables X, Y. In statistics, path analysis is used to describe the direct dependence between a set of variables. But in fact, we usually do not know the causal order between variables. However, ignoring the direction of the causal path will prevent researchers from analyzing or using causal models. In this study, we propose a method for estimating causality based on observed data. First, observed variables are cleaned and valid variables are retained. Then, a direct linear non-Gaussian acyclic graph models(DirectLiNGAM) estimates the causal order K between variables. The third step is to estimate the adjacency matrix B of the causal relationship based on K. Next, since B is not convenient for model interpretation, we use adaptive lasso to prune the causal path and variables. Further, a causal path graph and a recursive model are established. Finally, we test and debug the recursive model, obtain a causal model with good fit, and estimate the direct, indirect and total effects between causal variables. This paper overcomes the randomness assigning causal order to variables. This study is different from the researcher’s understanding of his own model by generating some form of simulation data. The simplest and relatively unsmooth statistical learning method used in this study has obvious advantages in the field of interpretable machine learning.展开更多
文摘非高斯性数据间的因果网络已经在经济学、生物学和环境学等学科得到了广泛应用.DirectLingam(Direct Method for Learning a Linear Non-Gaussian Structural Equation Model)算法是其中一个经典解法,但其存在维度达到25维度以上时外生变量(exogenous variable)识别率低的问题,进而产生级联效应,使得整个网络的估计误差随着层数增大越来越大.为此提出了一种基于负熵局部选择外生变量的DirectLingam算法(LS-DirectLingam),把变量的非高斯性作为外生变量选择的标准,用负熵来度量变量的非高斯,选择负熵最大的k个变量存入局部目标变量集合Lv中,在集合Lv中进一步去寻找外生变量,从而提高了外生变量的识别率.与基本的DirectLingam算法进行实验比较,结果表明LS-DirectLingam算法优于DirectLingam算法.
文摘Blackleg disease caused by Phoma lingam is an important disease of oil seed rape (Brassica napus) causing losses up to 95%. The efficacy of microbial antagonists against P. lingam in greenhouse was tested. Serratia plymuthica HRO-C48 and Gliocladium catenulatum J1446 were able to reduce the disease intensity of OSR cotelydones infested with P. lingam at rates 44% and 52% respectively. The reduction of the infestation of the root collar in BBCH14/15 was evaluated as well. Plants treated with a suspension of the antagonists (2 × 105 cfu/plant) and inoculated with either pycnidiospore suspension (2 × 107 cfu/ml) or agar disks grown with P. lingam mycelium, showed a reduced infestation rate of 53% - 93% in the presence of S. plymuthica and 46% - 77% in the presence of G. catenulatum. The efficacy of the antagonist depends highly on their concentration inside OSR seeds. Below 105 cfu/seed no significant difference was recorded between control untreated and infested plants.
基金Supported by the National Natural Science Foundation of China(61573266)
文摘Finding causality merely from observed data is a fundamental problem in science. The most basic form of this causal problem is to determine whether X leads to Y or Y leads to X in the case of joint observation of two variables X, Y. In statistics, path analysis is used to describe the direct dependence between a set of variables. But in fact, we usually do not know the causal order between variables. However, ignoring the direction of the causal path will prevent researchers from analyzing or using causal models. In this study, we propose a method for estimating causality based on observed data. First, observed variables are cleaned and valid variables are retained. Then, a direct linear non-Gaussian acyclic graph models(DirectLiNGAM) estimates the causal order K between variables. The third step is to estimate the adjacency matrix B of the causal relationship based on K. Next, since B is not convenient for model interpretation, we use adaptive lasso to prune the causal path and variables. Further, a causal path graph and a recursive model are established. Finally, we test and debug the recursive model, obtain a causal model with good fit, and estimate the direct, indirect and total effects between causal variables. This paper overcomes the randomness assigning causal order to variables. This study is different from the researcher’s understanding of his own model by generating some form of simulation data. The simplest and relatively unsmooth statistical learning method used in this study has obvious advantages in the field of interpretable machine learning.