Estimating the intensity of outbursts of coal and gas is important as the intensity and frequency of outbursts of coal and gas tend to increase in deep mining. Fully understanding the major factors contributing to coa...Estimating the intensity of outbursts of coal and gas is important as the intensity and frequency of outbursts of coal and gas tend to increase in deep mining. Fully understanding the major factors contributing to coal and gas outbursts is significant in the evaluation of the intensity of the outburst. In this paper, we discuss the correlation between these major factors and the intensity of the outburst using Analysis of Variance(ANOVA) and Contingency Table Analysis(CTA). Regression analysis is used to evaluate the impact of these major factors on the intensity of outbursts based on physical experiments. Based on the evaluation, two simple models in terms of multiple linear and nonlinear regression were constructed for the prediction of the intensity of the outburst. The results show that the gas pressure and initial moisture in the coal mass could be the most significant factors compared to the weakest factor-porosity. The P values from Fisher's exact test in CTA are: moisture(0.019), geostress(0.290), porosity(0.650), and gas pressure(0.031). P values from ANOVA are moisture(0.094), geostress(0.077), porosity(0.420), and gas pressure(0.051). Furthermore, the multiple nonlinear regression model(RMSE: 3.870) is more accurate than the linear regression model(RMSE: 4.091).展开更多
The thermal decomposition temperature is one of the most important parameters to evaluate fire hazard of organic peroxide. A quantitative structure-property relationship model was proposed for estimating the thermal d...The thermal decomposition temperature is one of the most important parameters to evaluate fire hazard of organic peroxide. A quantitative structure-property relationship model was proposed for estimating the thermal decomposition temperatures of organic peroxides. The entire set of 38 organic peroxides was at random divided into a training set for model development and a prediction set for external model validation. The novel local molecular descriptors of AT1, AT2, AT3, AT4, AT5, AT6 and global molecular descriptor of ATC have been proposed in order to character organic peroxides’ molecular structures. An accurate quantitative structure-property relationship (QSPR) equation is developed for the thermal decomposition temperatures of organic peroxides. The statistical results showed that the QSPR model was obtained using the multiple linear regression (MLR) method with correlation coefficient (R), standard deviation (S), leave-one-out validation correlation coefficient (RCV) values of 0.9795, 6.5676 ℃ and 0.9328, respectively. The average absolute relative deviation (AARD) is only 3.86% for the experimental values. Model test by internal leave-one-out cross validation and external validation and molecular descriptor interpretation were discussed. Comparison with literature results demonstrated that novel local and global descriptors were useful molecular descriptors for predicting the thermal decomposition temperatures of organic peroxides.展开更多
A study on the validity of volume equations currently used for three timber species, Entandrophragma cylindricum, Erythrophleum ivorensis and Pericopsis elata (Sapelli, Tali and Assamela respectively) in south east ...A study on the validity of volume equations currently used for three timber species, Entandrophragma cylindricum, Erythrophleum ivorensis and Pericopsis elata (Sapelli, Tali and Assamela respectively) in south east Cameroon, was conducted between the months of July and September, 2007 to evaluate their suitability for the site. Twenty-two percent sampling intensity was conducted within annual allowable cuts and diameter readings taken on standing trees with the aid of a wide band Relascope. A non linear regression equation model was employed to compute volume equations and the student's t-test for the analysis of the existing models. Based on individual tree volumes within stands, new equations for the three species were constructed. A comparison was made between the new equations and those that were being used at the site. Results indicated a total standing volume of 0.007 m3/ha obtained for the three species (Sapelli 0.003 m3/ha, Tali 0.002 m3/ha and Assamela 0.002 m3/ha). Two new volume equation models [B] and [C] were retained for their goodness-of-fit with [B] for Assamela and [C] for Sapelli and Tali. Results also showed that a total volume of 0.005 m3/ha was underestimated for the three species (Sapelli 0.002 m3/ha, Tali 0.001 m3/ha and Assamela 0.002 m3/ha) when existing volume equations were applied. It is imperative to construct new volume equations that are compatible with the ecological characteristics of the site using representative samples. Setting-up appropriate methods for their validation shall also serve as checks to future management errors.展开更多
Identifying the causal impact of' some intervention challenging when one is faced with correlated binary end-points in observational studies is a challenging task, and it is even more The statistical literature on an...Identifying the causal impact of' some intervention challenging when one is faced with correlated binary end-points in observational studies is a challenging task, and it is even more The statistical literature on analyzing such data is well documented. Dependence between observations from the same study subject in correlated data renders invalid the usual chi-square tests of independence and inflates the variance ofparameter estimates. Disaggregated approaches such as hierarchical linear models which are able to adjust for individual level covariate:s are favoured in the analysis of such data, thereby gaining power over aggregated and individual-level analyses. In this article the authors, therefore, address the issue of analyzing correlated data with dichotomous end-points by using hierarchical logistic regression, a generalization of the standard logistic regression model for independent outcomes.展开更多
For the two seemingly unrelated regression system, this paper proposed a new type of estimator called pre-test principal components estimator (PTPCE) and discussed some properties of PTPCE.
When the population, from which the samples are extracted, is not normally distributed, or if the sample size is particularly reduced, become preferable the use of not parametric statistic test. An alternative to the ...When the population, from which the samples are extracted, is not normally distributed, or if the sample size is particularly reduced, become preferable the use of not parametric statistic test. An alternative to the normal model is the permutation or randomization model. The permutation model is nonparametric because no formal assumptions are made about the population parameters of the reference distribution, i.e., the distribution to which an obtained result is compared to determine its probability when the null hypothesis is true. Typically the reference distribution is a sampling distribution for parametric tests and a permutation distribution for many nonparametric tests. Within the regression models, it is possible to use the permutation tests, considering their ownerships of optimality, especially in the multivariate context and the normal distribution of the response variables is not guaranteed. In the literature there are numerous permutation tests applicable to the estimation of the regression models. The purpose of this study is to examine different kinds of permutation tests applied to linear models, focused our attention on the specific test statistic on which they are based. In this paper we focused our attention on permutation test of the independent variables, proposed by Oja, and other methods to effect the inference in non parametric way, in a regression model. Moreover, we show the recent advances in this context and try to compare them.展开更多
This paper systematically studies the statistical diagnosis and hypothesis testing for the semiparametric linear regression model according to the theories and methods of the statistical diagnosis and hypothesis testi...This paper systematically studies the statistical diagnosis and hypothesis testing for the semiparametric linear regression model according to the theories and methods of the statistical diagnosis and hypothesis testing for parametric regression model.Several diagnostic measures and the methods for gross error testing are derived.Especially,the global and local influence analysis of the gross error on the parameter X and the nonparameter s are discussed in detail;at the same time,the paper proves that the data point deletion model is equivalent to the mean shift model for the semiparametric regression model.Finally,with one simulative computing example,some helpful conclusions are drawn.展开更多
One important model in handling the multivariate data is the varying-coefficient partially linear regression model. In this paper, the generalized likelihood ratio test is developed to test whether its coefficient fun...One important model in handling the multivariate data is the varying-coefficient partially linear regression model. In this paper, the generalized likelihood ratio test is developed to test whether its coefficient functions are varying or not. It is showed that the normalized proposed test follows asymptotically x2-distribution and the Wilks phenomenon under the null hypothesis, and its asymptotic power achieves the optimal rate of the convergence for the nonparametric hypotheses testing. Some simulation studies illustrate that the test works well.展开更多
基金provided by the Natural Science Foundation Project(Key)of Chongqing(No.cstc2013jjB0012)the National Natural Science Foundation of China(No.51434003)the National Natural Science Foundation of China(No.51474040)
文摘Estimating the intensity of outbursts of coal and gas is important as the intensity and frequency of outbursts of coal and gas tend to increase in deep mining. Fully understanding the major factors contributing to coal and gas outbursts is significant in the evaluation of the intensity of the outburst. In this paper, we discuss the correlation between these major factors and the intensity of the outburst using Analysis of Variance(ANOVA) and Contingency Table Analysis(CTA). Regression analysis is used to evaluate the impact of these major factors on the intensity of outbursts based on physical experiments. Based on the evaluation, two simple models in terms of multiple linear and nonlinear regression were constructed for the prediction of the intensity of the outburst. The results show that the gas pressure and initial moisture in the coal mass could be the most significant factors compared to the weakest factor-porosity. The P values from Fisher's exact test in CTA are: moisture(0.019), geostress(0.290), porosity(0.650), and gas pressure(0.031). P values from ANOVA are moisture(0.094), geostress(0.077), porosity(0.420), and gas pressure(0.051). Furthermore, the multiple nonlinear regression model(RMSE: 3.870) is more accurate than the linear regression model(RMSE: 4.091).
基金Project(2015SK20823) supported by Science and Technology Project of Hunan Province,ChinaProject(15A001) supported by Scientific Research Fund of Hunan Provincial Education Department,China+2 种基金Project(2017CL06) supported by Hunan Provincial Key Laboratory of Materials Protection for Electric Power and Transportation,ChinaProject(k1403029-11) supported by Science and Technology Project of Changsha City,ChinaProject(CX2015B372) supported by the Hunan Provincial Innovation Foundation for Postgraduate,China
文摘The thermal decomposition temperature is one of the most important parameters to evaluate fire hazard of organic peroxide. A quantitative structure-property relationship model was proposed for estimating the thermal decomposition temperatures of organic peroxides. The entire set of 38 organic peroxides was at random divided into a training set for model development and a prediction set for external model validation. The novel local molecular descriptors of AT1, AT2, AT3, AT4, AT5, AT6 and global molecular descriptor of ATC have been proposed in order to character organic peroxides’ molecular structures. An accurate quantitative structure-property relationship (QSPR) equation is developed for the thermal decomposition temperatures of organic peroxides. The statistical results showed that the QSPR model was obtained using the multiple linear regression (MLR) method with correlation coefficient (R), standard deviation (S), leave-one-out validation correlation coefficient (RCV) values of 0.9795, 6.5676 ℃ and 0.9328, respectively. The average absolute relative deviation (AARD) is only 3.86% for the experimental values. Model test by internal leave-one-out cross validation and external validation and molecular descriptor interpretation were discussed. Comparison with literature results demonstrated that novel local and global descriptors were useful molecular descriptors for predicting the thermal decomposition temperatures of organic peroxides.
文摘A study on the validity of volume equations currently used for three timber species, Entandrophragma cylindricum, Erythrophleum ivorensis and Pericopsis elata (Sapelli, Tali and Assamela respectively) in south east Cameroon, was conducted between the months of July and September, 2007 to evaluate their suitability for the site. Twenty-two percent sampling intensity was conducted within annual allowable cuts and diameter readings taken on standing trees with the aid of a wide band Relascope. A non linear regression equation model was employed to compute volume equations and the student's t-test for the analysis of the existing models. Based on individual tree volumes within stands, new equations for the three species were constructed. A comparison was made between the new equations and those that were being used at the site. Results indicated a total standing volume of 0.007 m3/ha obtained for the three species (Sapelli 0.003 m3/ha, Tali 0.002 m3/ha and Assamela 0.002 m3/ha). Two new volume equation models [B] and [C] were retained for their goodness-of-fit with [B] for Assamela and [C] for Sapelli and Tali. Results also showed that a total volume of 0.005 m3/ha was underestimated for the three species (Sapelli 0.002 m3/ha, Tali 0.001 m3/ha and Assamela 0.002 m3/ha) when existing volume equations were applied. It is imperative to construct new volume equations that are compatible with the ecological characteristics of the site using representative samples. Setting-up appropriate methods for their validation shall also serve as checks to future management errors.
文摘Identifying the causal impact of' some intervention challenging when one is faced with correlated binary end-points in observational studies is a challenging task, and it is even more The statistical literature on analyzing such data is well documented. Dependence between observations from the same study subject in correlated data renders invalid the usual chi-square tests of independence and inflates the variance ofparameter estimates. Disaggregated approaches such as hierarchical linear models which are able to adjust for individual level covariate:s are favoured in the analysis of such data, thereby gaining power over aggregated and individual-level analyses. In this article the authors, therefore, address the issue of analyzing correlated data with dichotomous end-points by using hierarchical logistic regression, a generalization of the standard logistic regression model for independent outcomes.
文摘For the two seemingly unrelated regression system, this paper proposed a new type of estimator called pre-test principal components estimator (PTPCE) and discussed some properties of PTPCE.
文摘When the population, from which the samples are extracted, is not normally distributed, or if the sample size is particularly reduced, become preferable the use of not parametric statistic test. An alternative to the normal model is the permutation or randomization model. The permutation model is nonparametric because no formal assumptions are made about the population parameters of the reference distribution, i.e., the distribution to which an obtained result is compared to determine its probability when the null hypothesis is true. Typically the reference distribution is a sampling distribution for parametric tests and a permutation distribution for many nonparametric tests. Within the regression models, it is possible to use the permutation tests, considering their ownerships of optimality, especially in the multivariate context and the normal distribution of the response variables is not guaranteed. In the literature there are numerous permutation tests applicable to the estimation of the regression models. The purpose of this study is to examine different kinds of permutation tests applied to linear models, focused our attention on the specific test statistic on which they are based. In this paper we focused our attention on permutation test of the independent variables, proposed by Oja, and other methods to effect the inference in non parametric way, in a regression model. Moreover, we show the recent advances in this context and try to compare them.
基金Supported by the National Natural Science Foundation of China (No. 40604001),the National High Technology Research and Development Program of China (No. 2007AA12Z312).Acknowledgement The authors thank Prof. Tao Benzao and Prof. Wang Xingzhou for several helpful suggestions during the preparation of this manuscript.
文摘This paper systematically studies the statistical diagnosis and hypothesis testing for the semiparametric linear regression model according to the theories and methods of the statistical diagnosis and hypothesis testing for parametric regression model.Several diagnostic measures and the methods for gross error testing are derived.Especially,the global and local influence analysis of the gross error on the parameter X and the nonparameter s are discussed in detail;at the same time,the paper proves that the data point deletion model is equivalent to the mean shift model for the semiparametric regression model.Finally,with one simulative computing example,some helpful conclusions are drawn.
基金supported by National Natural Science Foundation of China under Grant No.1117112the Fund of Shanxi Datong University under Grant No.2010K4+1 种基金the Doctoral Fund of Ministry of Education of China under Grant No.20090076110001National Statistical Science Research Major Program of China under Grant No.2011LZ051
文摘One important model in handling the multivariate data is the varying-coefficient partially linear regression model. In this paper, the generalized likelihood ratio test is developed to test whether its coefficient functions are varying or not. It is showed that the normalized proposed test follows asymptotically x2-distribution and the Wilks phenomenon under the null hypothesis, and its asymptotic power achieves the optimal rate of the convergence for the nonparametric hypotheses testing. Some simulation studies illustrate that the test works well.