The use of [1] Box-Cox power transformation in regression analysis is now common;in the last two decades there has been emphasis on diagnostics methods for Box-Cox power transformation, much of which has involved dele...The use of [1] Box-Cox power transformation in regression analysis is now common;in the last two decades there has been emphasis on diagnostics methods for Box-Cox power transformation, much of which has involved deletion of influential data cases. The pioneer work of [2] studied local influence on constant variance perturbation in the Box-Cox unbiased regression linear mode. Tsai and Wu [3] analyzed local influence method of [2] to assess the effect of the case-weights perturbation on the transformation-power estimator in the Box-Cox unbiased regression linear model. Many authors noted that the influential observations on the biased estimators are different from the unbiased estimators. In this paper I describe a diagnostic method for assessing the local influence on the constant variance perturbation on the transformation in the Box-Cox biased ridge regression linear model. Two real macroeconomic data sets are used to illustrate the methodologies.展开更多
In the teaching and researching of linear regression analysis, it is interesting and enlightening to explore how the dependent variable vector can be inner-transformed into regression coefficient estimator vector from...In the teaching and researching of linear regression analysis, it is interesting and enlightening to explore how the dependent variable vector can be inner-transformed into regression coefficient estimator vector from a visible geometrical view. As an example, the roadmap of such inner transformation is presented based on a simple multiple linear regression model in this work. By applying the matrix algorithms like singular value decomposition (SVD) and Moore-Penrose generalized matrix inverse, the dependent variable vector lands into the right space of the independent variable matrix and is metamorphosed into regression coefficient estimator vector through the three-step of inner transformation. This work explores the geometrical relationship between the dependent variable vector and regression coefficient estimator vector as well as presents a new approach for vector rotating.展开更多
In order to accurately predict bus travel time, a hybrid model based on combining wavelet transform technique with support vector regression(WT-SVR) model is employed. In this model, wavelet decomposition is used to e...In order to accurately predict bus travel time, a hybrid model based on combining wavelet transform technique with support vector regression(WT-SVR) model is employed. In this model, wavelet decomposition is used to extract important information of data at different levels and enhances the forecasting ability of the model. After wavelet transform different components are forecasted by their corresponding SVR predictors. The final prediction result is obtained by the summation of the predicted results for each component. The proposed hybrid model is examined by the data of bus route No.550 in Nanjing, China. The performance of WT-SVR model is evaluated by mean absolute error(MAE), mean absolute percent error(MAPE) and relative mean square error(RMSE), and also compared to regular SVR and ANN models. The results show that the prediction method based on wavelet transform and SVR has better tracking ability and dynamic behavior than regular SVR and ANN models. The forecasting performance is remarkably improved to obtain within 6% MAPE for testing section Ⅰ and 8% MAPE for testing section Ⅱ, which proves that the suggested approach is feasible and applicable in bus travel time prediction.展开更多
The similarity transformation model between different coordinate systems is not accurate enough to describe the discrepancy of them.Therefore,the coordinate transformation from the coordinate frame with poor accuracy ...The similarity transformation model between different coordinate systems is not accurate enough to describe the discrepancy of them.Therefore,the coordinate transformation from the coordinate frame with poor accuracy to that with high accuracy cannot guarantee a high precision of transformation.In this paper,a combined method of similarity transformation and regressive approximating is presented.The local error accumulation and distortion are taken into consideration and the precision of coordinate system is improved by using the recommended展开更多
The spatial interpolation for soil texture does not necessarily satisfy the constant sum and nonnegativity constraints. Meanwhile, although numeric and categorical variables have been used as auxiliary variables to im...The spatial interpolation for soil texture does not necessarily satisfy the constant sum and nonnegativity constraints. Meanwhile, although numeric and categorical variables have been used as auxiliary variables to improve prediction accuracy of soil attributes such as soil organic matter, they (especially the categorical variables) are rarely used in spatial prediction of soil texture. The objective of our study was to comparing the performance of the methods for spatial prediction of soil texture with consideration of the characteristics of compositional data and auxiliary variables. These methods include the ordinary kriging with the symmetry logratio transform, regression kriging with the symmetry logratio transform, and compositional kriging (CK) approaches. The root mean squared error (RMSE), the relative improvement value of RMSE and Aitchison's distance (DA) were all utilized to assess the accuracy of prediction and the mean squared deviation ratio was used to evaluate the goodness of fit of the theoretical estimate of error. The results showed that the prediction methods utilized in this paper could enable interpolation results of soil texture to satisfy the constant sum and nonnegativity constraints. Prediction accuracy and model fitting effect of the CK approach were better, suggesting that the CK method was more appropriate for predicting soil texture. The CK method is directly interpolated on soil texture, which ensures that it is optimal unbiased estimator. If the environment variables are appropriately selected as auxiliary variables, spatial variability of soil texture can be predicted reasonably and accordingly the predicted results will be satisfied.展开更多
Industrial transformation and green production(ITGP) is a new 10-year international research initiative proposed by the Chinese National Committee for Future Earth. It is also an important theme for adapting and respo...Industrial transformation and green production(ITGP) is a new 10-year international research initiative proposed by the Chinese National Committee for Future Earth. It is also an important theme for adapting and responding to global environmental change. Aiming at a thorough examination of the implementation of ITGP in China, this paper presents its objectives, its three major areas, and their progress so far. It also identifies the key elements of its management and proposes new perspectives on managing green transformation. For instance, we introduce a case study on cement industry that shows the positive policy effects of reducing backward production capacity on PCDD/Fs emissions. Finally,to develop different transformation scenarios for a green future, we propose four strategies: 1) policy integration for promoting green industry, 2)system innovation and a multidisciplinary approach, 3) collaborative governance with all potential stakeholders, and 4) managing uncertainty,risks, and long-time horizons.展开更多
A novel method for developing a reliable data driven soft sensor to improve the prediction accuracy of sulfur content in hydrodesulfurization(HDS) process was proposed. Therefore, an integrated approach using support ...A novel method for developing a reliable data driven soft sensor to improve the prediction accuracy of sulfur content in hydrodesulfurization(HDS) process was proposed. Therefore, an integrated approach using support vector regression(SVR) based on wavelet transform(WT) and principal component analysis(PCA) was used. Experimental data from the HDS setup were employed to validate the proposed model. The results reveal that the integrated WT-PCA with SVR model was able to increase the prediction accuracy of SVR model. Implementation of the proposed model delivers the best satisfactory predicting performance(EAARE=0.058 and R2=0.97) in comparison with SVR. The obtained results indicate that the proposed model is more reliable and more precise than the multiple linear regression(MLR), SVR and PCA-SVR.展开更多
This paper is an extension and generalization of the study carried out by [1] on the estimation of the population ratio (R) of the population means of two variables (y and x) under Simple Random Sampling (SRS) scheme,...This paper is an extension and generalization of the study carried out by [1] on the estimation of the population ratio (R) of the population means of two variables (y and x) under Simple Random Sampling (SRS) scheme, using a variable transformation of the auxiliary variable, x. All the six estimators proposed by [1] are easily identified as special cases of the proposed class of estimators. Asymptotic properties of the proposed class of estimators are derived theoretically and subsequently verified using empirical illustrations. Some of the proposed estimators are found to have relatively large gains in efficiency over the customary ratio estimator, ?for the given data set.展开更多
The purpose of this study was to examine the burnout levels of research assistants in Ondokuz Mayis University and to examine the results of multiple linear regression model based on the results obtained from Maslach ...The purpose of this study was to examine the burnout levels of research assistants in Ondokuz Mayis University and to examine the results of multiple linear regression model based on the results obtained from Maslach Burnout Scale with Jackknife Method in terms of validity and generalizability. To do this, a questionnaire was given to 11 research assistants working at Ondokuz Mayis University and the burnout scores of this questionnaire were taken as the dependent variable of the multiple linear regression model. The variable of burnout was explained with the variables of age, weekly hours of classes taught, monthly average credit card debt, numbers of published articles and reports, gender, marital status, number of children and the departments of the research assistants. Dummy variables were assigned to the variables of gender, marital status, number of children and the departments of the research assistants and thus, they were made quantitative. The significance of the model as a result of multiple linear regressions was examined through backward elimination method. After this, for the five explanatory variables which influenced the variable of burnout, standardized model coefficients and coefficients of determination, and 95% confidence intervals of these values were estimated through Jackknife Method and the generalizability of the parameter estimation results of these variables on population was researched.展开更多
In this paper, downscaling models are developed using various linear regression approaches namely direct, forward, backward and stepwise regression for downscaling of GCM output to predict mean monthly precipitation u...In this paper, downscaling models are developed using various linear regression approaches namely direct, forward, backward and stepwise regression for downscaling of GCM output to predict mean monthly precipitation under IPCC SRES scenarios to watershed-basin scale in an arid region in India. The effectiveness of these regression approaches is evaluated through application to downscale the predictand for the Pichola lake region in Rajasthan state in India, which is considered to be a climatically sensitive region. The predictor variables are extracted from (1) the National Centers for Environmental Prediction (NCEP) reanalysis dataset for the period 1948–2000, and (2) the simulations from the third-generation Canadian Coupled Global Climate Model (CGCM3) for emission scenarios A1B, A2, B1 and COMMIT for the period 2001–2100. The selection of important predictor variables becomes a crucial issue for developing downscaling models since reanalysis data are based on wide range of meteorological measurements and observations. Direct regression was found to yield better performance among all other regression techniques explored in the present study. The results of downscaling models using both approaches show that precipitation is likely to increase in future for A1B, A2 and B1 scenarios, whereas no trend is discerned with the COMMIT.展开更多
Objective:To study the risk factors of hemorrhagic transformation in patients with acute cerebral infarction and to analyze the risk factors.Methods: A total of 96 patients with acute cerebral infarction after the thr...Objective:To study the risk factors of hemorrhagic transformation in patients with acute cerebral infarction and to analyze the risk factors.Methods: A total of 96 patients with acute cerebral infarction after the thrombolysis in our hospital from June 2016 to December 2017 were selected as the research object. And they were divided into bleeding group 48 cases and hemorrhage transformation group 48 cases according to whether with hemorrhage occurs transformation. Then the lipid metabolism, atrial fibrillation, history of smoking and drinking, history of hypertension and diabetes, blood pressure, treatment time after onset and infarction area of two groups were compared, and the relationship between those factors and the disease were analyzed by the multi-factor Logistic regression analysis.Results: The atrial fibrillation, history of smoking and drinking of two groups had significant differences;The hospital fasting plasma glucose and LDL-C level of two groups had significant differences;the treatment time after onset and infarction area of two groups had significant differences;The multi-factor Logistic regression analysis showed that atrial fibrillation, blood glucose on admission, LDL-C and large area of infarction are the factors affecting the risk of bleeding in patients with acute cerebral infarction transformation.Conclusion:Atrial fibrillation, blood glucose on admission, LDL-C, treatment time after onset and large area of infarction belongs to the patients with acute cerebral infarction after the thrombolysis transformation of bleeding risk factors.展开更多
Extending the work carried out by [1], this paper proposes six combined-type estimators of population ratio of two variables in post-stratified sampling scheme, using variable transformation. Properties of the propose...Extending the work carried out by [1], this paper proposes six combined-type estimators of population ratio of two variables in post-stratified sampling scheme, using variable transformation. Properties of the proposed estimators were obtained up to first order approximations,(on–1), both for achieved sample configurations (conditional argument) and over repeated samples of fixed size n (unconditional argument). Efficiency conditions were obtained. Under these conditions the proposed combined-type estimators would perform better than the associated customary combined-type estimator. Furthermore, optimum estimators among the proposed combined-type estimators were obtained both under the conditional and unconditional arguments. An empirical work confirmed the theoretical results.展开更多
The study proposes, along the line of [1], six separate-type estimators for estimating the population ratio of two variables in post-stratified sampling, using variable transformation. Properties of the proposed estim...The study proposes, along the line of [1], six separate-type estimators for estimating the population ratio of two variables in post-stratified sampling, using variable transformation. Properties of the proposed estimators were obtained up to first order approximations, both for achieved sample configurations (conditional argument) and over repeated samples of fixed size n (unconditional argument). Efficiency conditions, under which the proposed separate-type estimators would perform better than the associated customary separate-type estimators in terms of having smaller mean squared errors, were obtained. Furthermore, conditions under which some of the proposed separate-type estimators would perform better than other proposed separate-type estimators were also obtained. The optimum estimators among the proposed separate-type estimators were obtained and an empirical illustration confirmed the theoretical results.展开更多
针对尾矿坝在线监测重建设、轻利用的现状,基于尾矿坝位移在线监测时间序列,通过多步逆向云变换算法(Multi-step Backward Cloud Transformation Algorithm Based on Sampling with Replacement,MBCT-SR)改进云模型,根据“3E_(n)原则”...针对尾矿坝在线监测重建设、轻利用的现状,基于尾矿坝位移在线监测时间序列,通过多步逆向云变换算法(Multi-step Backward Cloud Transformation Algorithm Based on Sampling with Replacement,MBCT-SR)改进云模型,根据“3E_(n)原则”和内外包络曲线确定在线监测位移的正常运行值,从而建立尾矿坝位移分级预警阈值模型,并利用某尾矿坝全球导航卫星(Global Navigation Satellite System,GNSS)技术表面位移在线监测数据进行实例验证。结果表明:该尾矿坝水平方向位移的黄、橙、红预警阈值分别为8.41 mm/d、12.94 mm/d、19.41 mm/d,呈现出坝体中间预警阈值最大、并由中间向两侧减小的空间变化规律;尾矿坝垂直方向位移的黄、橙、红预警阈值分别为16.56 mm/d、25.48 mm/d、38.22 mm/d,且随着子坝的堆积,预警阈值逐渐增大。展开更多
文摘The use of [1] Box-Cox power transformation in regression analysis is now common;in the last two decades there has been emphasis on diagnostics methods for Box-Cox power transformation, much of which has involved deletion of influential data cases. The pioneer work of [2] studied local influence on constant variance perturbation in the Box-Cox unbiased regression linear mode. Tsai and Wu [3] analyzed local influence method of [2] to assess the effect of the case-weights perturbation on the transformation-power estimator in the Box-Cox unbiased regression linear model. Many authors noted that the influential observations on the biased estimators are different from the unbiased estimators. In this paper I describe a diagnostic method for assessing the local influence on the constant variance perturbation on the transformation in the Box-Cox biased ridge regression linear model. Two real macroeconomic data sets are used to illustrate the methodologies.
文摘In the teaching and researching of linear regression analysis, it is interesting and enlightening to explore how the dependent variable vector can be inner-transformed into regression coefficient estimator vector from a visible geometrical view. As an example, the roadmap of such inner transformation is presented based on a simple multiple linear regression model in this work. By applying the matrix algorithms like singular value decomposition (SVD) and Moore-Penrose generalized matrix inverse, the dependent variable vector lands into the right space of the independent variable matrix and is metamorphosed into regression coefficient estimator vector through the three-step of inner transformation. This work explores the geometrical relationship between the dependent variable vector and regression coefficient estimator vector as well as presents a new approach for vector rotating.
基金Sponsored by the Projects of International Cooperation and Exchange of the National Natural Science Foundation of China(Grant No.51561135003)the Scientific Research Foundation of Graduated School of Southeast University(Grant No.YBJJ1842)
文摘In order to accurately predict bus travel time, a hybrid model based on combining wavelet transform technique with support vector regression(WT-SVR) model is employed. In this model, wavelet decomposition is used to extract important information of data at different levels and enhances the forecasting ability of the model. After wavelet transform different components are forecasted by their corresponding SVR predictors. The final prediction result is obtained by the summation of the predicted results for each component. The proposed hybrid model is examined by the data of bus route No.550 in Nanjing, China. The performance of WT-SVR model is evaluated by mean absolute error(MAE), mean absolute percent error(MAPE) and relative mean square error(RMSE), and also compared to regular SVR and ANN models. The results show that the prediction method based on wavelet transform and SVR has better tracking ability and dynamic behavior than regular SVR and ANN models. The forecasting performance is remarkably improved to obtain within 6% MAPE for testing section Ⅰ and 8% MAPE for testing section Ⅱ, which proves that the suggested approach is feasible and applicable in bus travel time prediction.
文摘The similarity transformation model between different coordinate systems is not accurate enough to describe the discrepancy of them.Therefore,the coordinate transformation from the coordinate frame with poor accuracy to that with high accuracy cannot guarantee a high precision of transformation.In this paper,a combined method of similarity transformation and regressive approximating is presented.The local error accumulation and distortion are taken into consideration and the precision of coordinate system is improved by using the recommended
基金supported by the National Natural Science Foundation of China (41071152)the Special Fund for Land and Resources Scientific Research in the Public Interest,China (201011006-3)the Special Fund for Agro-Scientific Research in the Public Interest,China (201103005-01-01)
文摘The spatial interpolation for soil texture does not necessarily satisfy the constant sum and nonnegativity constraints. Meanwhile, although numeric and categorical variables have been used as auxiliary variables to improve prediction accuracy of soil attributes such as soil organic matter, they (especially the categorical variables) are rarely used in spatial prediction of soil texture. The objective of our study was to comparing the performance of the methods for spatial prediction of soil texture with consideration of the characteristics of compositional data and auxiliary variables. These methods include the ordinary kriging with the symmetry logratio transform, regression kriging with the symmetry logratio transform, and compositional kriging (CK) approaches. The root mean squared error (RMSE), the relative improvement value of RMSE and Aitchison's distance (DA) were all utilized to assess the accuracy of prediction and the mean squared deviation ratio was used to evaluate the goodness of fit of the theoretical estimate of error. The results showed that the prediction methods utilized in this paper could enable interpolation results of soil texture to satisfy the constant sum and nonnegativity constraints. Prediction accuracy and model fitting effect of the CK approach were better, suggesting that the CK method was more appropriate for predicting soil texture. The CK method is directly interpolated on soil texture, which ensures that it is optimal unbiased estimator. If the environment variables are appropriately selected as auxiliary variables, spatial variability of soil texture can be predicted reasonably and accordingly the predicted results will be satisfied.
基金funded by the Chinese Academy of Sciences (KZZD-EW-TZ-12)National Natural Science Foundation of China (414201040045 and 41371488)Natural Science Foundation of Hainan Province (413129)
文摘Industrial transformation and green production(ITGP) is a new 10-year international research initiative proposed by the Chinese National Committee for Future Earth. It is also an important theme for adapting and responding to global environmental change. Aiming at a thorough examination of the implementation of ITGP in China, this paper presents its objectives, its three major areas, and their progress so far. It also identifies the key elements of its management and proposes new perspectives on managing green transformation. For instance, we introduce a case study on cement industry that shows the positive policy effects of reducing backward production capacity on PCDD/Fs emissions. Finally,to develop different transformation scenarios for a green future, we propose four strategies: 1) policy integration for promoting green industry, 2)system innovation and a multidisciplinary approach, 3) collaborative governance with all potential stakeholders, and 4) managing uncertainty,risks, and long-time horizons.
文摘A novel method for developing a reliable data driven soft sensor to improve the prediction accuracy of sulfur content in hydrodesulfurization(HDS) process was proposed. Therefore, an integrated approach using support vector regression(SVR) based on wavelet transform(WT) and principal component analysis(PCA) was used. Experimental data from the HDS setup were employed to validate the proposed model. The results reveal that the integrated WT-PCA with SVR model was able to increase the prediction accuracy of SVR model. Implementation of the proposed model delivers the best satisfactory predicting performance(EAARE=0.058 and R2=0.97) in comparison with SVR. The obtained results indicate that the proposed model is more reliable and more precise than the multiple linear regression(MLR), SVR and PCA-SVR.
文摘This paper is an extension and generalization of the study carried out by [1] on the estimation of the population ratio (R) of the population means of two variables (y and x) under Simple Random Sampling (SRS) scheme, using a variable transformation of the auxiliary variable, x. All the six estimators proposed by [1] are easily identified as special cases of the proposed class of estimators. Asymptotic properties of the proposed class of estimators are derived theoretically and subsequently verified using empirical illustrations. Some of the proposed estimators are found to have relatively large gains in efficiency over the customary ratio estimator, ?for the given data set.
文摘The purpose of this study was to examine the burnout levels of research assistants in Ondokuz Mayis University and to examine the results of multiple linear regression model based on the results obtained from Maslach Burnout Scale with Jackknife Method in terms of validity and generalizability. To do this, a questionnaire was given to 11 research assistants working at Ondokuz Mayis University and the burnout scores of this questionnaire were taken as the dependent variable of the multiple linear regression model. The variable of burnout was explained with the variables of age, weekly hours of classes taught, monthly average credit card debt, numbers of published articles and reports, gender, marital status, number of children and the departments of the research assistants. Dummy variables were assigned to the variables of gender, marital status, number of children and the departments of the research assistants and thus, they were made quantitative. The significance of the model as a result of multiple linear regressions was examined through backward elimination method. After this, for the five explanatory variables which influenced the variable of burnout, standardized model coefficients and coefficients of determination, and 95% confidence intervals of these values were estimated through Jackknife Method and the generalizability of the parameter estimation results of these variables on population was researched.
文摘In this paper, downscaling models are developed using various linear regression approaches namely direct, forward, backward and stepwise regression for downscaling of GCM output to predict mean monthly precipitation under IPCC SRES scenarios to watershed-basin scale in an arid region in India. The effectiveness of these regression approaches is evaluated through application to downscale the predictand for the Pichola lake region in Rajasthan state in India, which is considered to be a climatically sensitive region. The predictor variables are extracted from (1) the National Centers for Environmental Prediction (NCEP) reanalysis dataset for the period 1948–2000, and (2) the simulations from the third-generation Canadian Coupled Global Climate Model (CGCM3) for emission scenarios A1B, A2, B1 and COMMIT for the period 2001–2100. The selection of important predictor variables becomes a crucial issue for developing downscaling models since reanalysis data are based on wide range of meteorological measurements and observations. Direct regression was found to yield better performance among all other regression techniques explored in the present study. The results of downscaling models using both approaches show that precipitation is likely to increase in future for A1B, A2 and B1 scenarios, whereas no trend is discerned with the COMMIT.
文摘Objective:To study the risk factors of hemorrhagic transformation in patients with acute cerebral infarction and to analyze the risk factors.Methods: A total of 96 patients with acute cerebral infarction after the thrombolysis in our hospital from June 2016 to December 2017 were selected as the research object. And they were divided into bleeding group 48 cases and hemorrhage transformation group 48 cases according to whether with hemorrhage occurs transformation. Then the lipid metabolism, atrial fibrillation, history of smoking and drinking, history of hypertension and diabetes, blood pressure, treatment time after onset and infarction area of two groups were compared, and the relationship between those factors and the disease were analyzed by the multi-factor Logistic regression analysis.Results: The atrial fibrillation, history of smoking and drinking of two groups had significant differences;The hospital fasting plasma glucose and LDL-C level of two groups had significant differences;the treatment time after onset and infarction area of two groups had significant differences;The multi-factor Logistic regression analysis showed that atrial fibrillation, blood glucose on admission, LDL-C and large area of infarction are the factors affecting the risk of bleeding in patients with acute cerebral infarction transformation.Conclusion:Atrial fibrillation, blood glucose on admission, LDL-C, treatment time after onset and large area of infarction belongs to the patients with acute cerebral infarction after the thrombolysis transformation of bleeding risk factors.
文摘Extending the work carried out by [1], this paper proposes six combined-type estimators of population ratio of two variables in post-stratified sampling scheme, using variable transformation. Properties of the proposed estimators were obtained up to first order approximations,(on–1), both for achieved sample configurations (conditional argument) and over repeated samples of fixed size n (unconditional argument). Efficiency conditions were obtained. Under these conditions the proposed combined-type estimators would perform better than the associated customary combined-type estimator. Furthermore, optimum estimators among the proposed combined-type estimators were obtained both under the conditional and unconditional arguments. An empirical work confirmed the theoretical results.
文摘The study proposes, along the line of [1], six separate-type estimators for estimating the population ratio of two variables in post-stratified sampling, using variable transformation. Properties of the proposed estimators were obtained up to first order approximations, both for achieved sample configurations (conditional argument) and over repeated samples of fixed size n (unconditional argument). Efficiency conditions, under which the proposed separate-type estimators would perform better than the associated customary separate-type estimators in terms of having smaller mean squared errors, were obtained. Furthermore, conditions under which some of the proposed separate-type estimators would perform better than other proposed separate-type estimators were also obtained. The optimum estimators among the proposed separate-type estimators were obtained and an empirical illustration confirmed the theoretical results.