Considering both the discrete and ordered nature of the household car ownership an ordered logistic regression model to predict household car ownership is established by using the data of Nanjing Household Travel Surv...Considering both the discrete and ordered nature of the household car ownership an ordered logistic regression model to predict household car ownership is established by using the data of Nanjing Household Travel Survey in the year 2012. The model results show that some household characteristics such as the number of driver licenses household income and home location are significant.Yet the intersection density indicating the street patterns of home location and the dummy near the subway and the bus stop density indicating the transit accessibility of home location are insignificant.The model estimation obtains a good γ2 the goodness of fit of the model and the model validation also shows a good performance in prediction.The marginal effects of all the significant explanatory variables are calculated to quantify the odds change in the household car ownership following a one-unit change in the explanatory variables.展开更多
The excessive staminate catkin thinning (emasculation) of proterandrous walnut is an important management measure for improving yield. To improve the excessive staminate catkin thinning efficiency, the model of quad...The excessive staminate catkin thinning (emasculation) of proterandrous walnut is an important management measure for improving yield. To improve the excessive staminate catkin thinning efficiency, the model of quadratic polynomial regression equation and BP artificial neural network was developed. The effects of ethephon, gibberel in and mepiquat on shedding rate of staminate catkin of pro-terandrous walnut were investigated by modeling field test. Based on the modeling test results, the excessive staminate catkin thinning model of quadratic polynomial regression equation and BP artificial neural network was established, and it was validated by field test next year. The test data were divided into training set, vali-dation set and test set. The total 20 sets of data obtained from the modeling field test were randomly divided into training set (17) and validation set (3) by central composite design (quadric rotational regression test design), and the data obtained from the next-year field test were divided into the test set. The topological struc-ture of BP artificial neural network was 3-5-1. The results showed that the pre-diction errors of BP neural network for samples from the validation set were 1.355 0%, 0.429 1% and 0.353 8%, respectively; the difference between the predicted value by the BP neural network and validated value by field test was 2.04%, and the difference between the predicted value by the regression equation and validated value by field test was 3.12%; the prediction accuracy of BP neural network was over 1.0% higher than that of regression equation. The effective combination of quadratic polynomial stepwise regression and BP artificial neural network wil not only help to determine the effect of independent parameter but also improve the prediction accuracy.展开更多
Time series is a kind of data widely used in various fields such as electricity forecasting,exchange rate forecasting,and solar power generation forecasting,and therefore time series prediction is of great significanc...Time series is a kind of data widely used in various fields such as electricity forecasting,exchange rate forecasting,and solar power generation forecasting,and therefore time series prediction is of great significance.Recently,the encoder-decoder model combined with long short-term memory(LSTM)is widely used for multivariate time series prediction.However,the encoder can only encode information into fixed-length vectors,hence the performance of the model decreases rapidly as the length of the input sequence or output sequence increases.To solve this problem,we propose a combination model named AR_CLSTM based on the encoder_decoder structure and linear autoregression.The model uses a time step-based attention mechanism to enable the decoder to adaptively select past hidden states and extract useful information,and then uses convolution structure to learn the internal relationship between different dimensions of multivariate time series.In addition,AR_CLSTM combines the traditional linear autoregressive method to learn the linear relationship of the time series,so as to further reduce the error of time series prediction in the encoder_decoder structure and improve the multivariate time series Predictive effect.Experiments show that the AR_CLSTM model performs well in different time series predictions,and its root mean square error,mean square error,and average absolute error all decrease significantly.展开更多
In dealing with nonparametric regression the GAM procedure is the most versatile of several new procedures. The terminology behind this procedure is more flexible than traditional parametric modeling tools. It relaxes...In dealing with nonparametric regression the GAM procedure is the most versatile of several new procedures. The terminology behind this procedure is more flexible than traditional parametric modeling tools. It relaxes the usual assumptions of parametric model and enables us to uncover structure to establish the relationship between independent variables and dependent variable in exponential family that may not be obvious otherwise. In this paper, we discussed two methods of fitting generalized additive logistic regression model, one based on Newton Raphson method and another based on iterative weighted least square method for first and second order Taylor series expansion. The use of the GAM procedure with the specified set of weights, using local scoring algorithm, was applied to real life data sets. The cubic spline smoother is applied to the independent variables. Based on nonparametric regression and smoothing techniques, this procedure provides powerful tools for data analysis.展开更多
Credit scoring models are quantitative models used commonly by financial institutions in the measurement and forecasting of credit risk, having a consolidated use in the process of credit concession of these instituti...Credit scoring models are quantitative models used commonly by financial institutions in the measurement and forecasting of credit risk, having a consolidated use in the process of credit concession of these institutions. The purpose of this paper was to evaluate the possibility of application of credit scoring models in a microcredit institution named Fundo Rotativo de A~~o da Cidadania--Cred Cidadania (Revolving Fund of Citizenship Action--Cred Cidadania), located in Recife (Brazil). In order to do this, data related to a sample of clients of the Cred Cidadania was collected, and this data was used to develop two types of credit scoring models: one is credit approval, another is called behavioral scoring. The statistical technique applied in the construction of the models was logistic regression. The results of the study demonstrated that the credit scoring models obtain satisfactory performance when used in the analysis of credit risk in the Cred Cidadania microcredit institution, reaching a correct client classification percentile of about 80%. The results also indicate that the use of credit scoring models supplies subsidies to the institution, aiding it in the prevention and reduction of insolvency and in the decrease of its operational costs, two problems that affect their financial sustainability.展开更多
A bundle adjustment method of remote sensing images based on dual quaternion is presented,which conducted the uniform disposal corresponding location and attitude of sequence images by the dual quaternion.The constrai...A bundle adjustment method of remote sensing images based on dual quaternion is presented,which conducted the uniform disposal corresponding location and attitude of sequence images by the dual quaternion.The constraint relationship of image itself and sequence images is constructed to compensate the systematic errors.The feasibility of this method used in bundle adjustment is theoretically tested by the analysis of the structural characteristics of error equation and normal equation based on dual quaternion.Different distributions of control points and stepwise regression analysis are introduced into the experiment for RC30 image.The results show that the adjustment accuracy can achieve 0.2min plane and 1min elevation.As a result,this method provides a new technique for geometric location problem of remote sensing images.展开更多
文摘Considering both the discrete and ordered nature of the household car ownership an ordered logistic regression model to predict household car ownership is established by using the data of Nanjing Household Travel Survey in the year 2012. The model results show that some household characteristics such as the number of driver licenses household income and home location are significant.Yet the intersection density indicating the street patterns of home location and the dummy near the subway and the bus stop density indicating the transit accessibility of home location are insignificant.The model estimation obtains a good γ2 the goodness of fit of the model and the model validation also shows a good performance in prediction.The marginal effects of all the significant explanatory variables are calculated to quantify the odds change in the household car ownership following a one-unit change in the explanatory variables.
基金Supported by Key Science and Technology Program of Shanxi Province,China(002023)~~
文摘The excessive staminate catkin thinning (emasculation) of proterandrous walnut is an important management measure for improving yield. To improve the excessive staminate catkin thinning efficiency, the model of quadratic polynomial regression equation and BP artificial neural network was developed. The effects of ethephon, gibberel in and mepiquat on shedding rate of staminate catkin of pro-terandrous walnut were investigated by modeling field test. Based on the modeling test results, the excessive staminate catkin thinning model of quadratic polynomial regression equation and BP artificial neural network was established, and it was validated by field test next year. The test data were divided into training set, vali-dation set and test set. The total 20 sets of data obtained from the modeling field test were randomly divided into training set (17) and validation set (3) by central composite design (quadric rotational regression test design), and the data obtained from the next-year field test were divided into the test set. The topological struc-ture of BP artificial neural network was 3-5-1. The results showed that the pre-diction errors of BP neural network for samples from the validation set were 1.355 0%, 0.429 1% and 0.353 8%, respectively; the difference between the predicted value by the BP neural network and validated value by field test was 2.04%, and the difference between the predicted value by the regression equation and validated value by field test was 3.12%; the prediction accuracy of BP neural network was over 1.0% higher than that of regression equation. The effective combination of quadratic polynomial stepwise regression and BP artificial neural network wil not only help to determine the effect of independent parameter but also improve the prediction accuracy.
基金Shanxi Provincial Key Research and Development Program Project Fund(No.201703D111011)。
文摘Time series is a kind of data widely used in various fields such as electricity forecasting,exchange rate forecasting,and solar power generation forecasting,and therefore time series prediction is of great significance.Recently,the encoder-decoder model combined with long short-term memory(LSTM)is widely used for multivariate time series prediction.However,the encoder can only encode information into fixed-length vectors,hence the performance of the model decreases rapidly as the length of the input sequence or output sequence increases.To solve this problem,we propose a combination model named AR_CLSTM based on the encoder_decoder structure and linear autoregression.The model uses a time step-based attention mechanism to enable the decoder to adaptively select past hidden states and extract useful information,and then uses convolution structure to learn the internal relationship between different dimensions of multivariate time series.In addition,AR_CLSTM combines the traditional linear autoregressive method to learn the linear relationship of the time series,so as to further reduce the error of time series prediction in the encoder_decoder structure and improve the multivariate time series Predictive effect.Experiments show that the AR_CLSTM model performs well in different time series predictions,and its root mean square error,mean square error,and average absolute error all decrease significantly.
文摘In dealing with nonparametric regression the GAM procedure is the most versatile of several new procedures. The terminology behind this procedure is more flexible than traditional parametric modeling tools. It relaxes the usual assumptions of parametric model and enables us to uncover structure to establish the relationship between independent variables and dependent variable in exponential family that may not be obvious otherwise. In this paper, we discussed two methods of fitting generalized additive logistic regression model, one based on Newton Raphson method and another based on iterative weighted least square method for first and second order Taylor series expansion. The use of the GAM procedure with the specified set of weights, using local scoring algorithm, was applied to real life data sets. The cubic spline smoother is applied to the independent variables. Based on nonparametric regression and smoothing techniques, this procedure provides powerful tools for data analysis.
文摘Credit scoring models are quantitative models used commonly by financial institutions in the measurement and forecasting of credit risk, having a consolidated use in the process of credit concession of these institutions. The purpose of this paper was to evaluate the possibility of application of credit scoring models in a microcredit institution named Fundo Rotativo de A~~o da Cidadania--Cred Cidadania (Revolving Fund of Citizenship Action--Cred Cidadania), located in Recife (Brazil). In order to do this, data related to a sample of clients of the Cred Cidadania was collected, and this data was used to develop two types of credit scoring models: one is credit approval, another is called behavioral scoring. The statistical technique applied in the construction of the models was logistic regression. The results of the study demonstrated that the credit scoring models obtain satisfactory performance when used in the analysis of credit risk in the Cred Cidadania microcredit institution, reaching a correct client classification percentile of about 80%. The results also indicate that the use of credit scoring models supplies subsidies to the institution, aiding it in the prevention and reduction of insolvency and in the decrease of its operational costs, two problems that affect their financial sustainability.
基金supported by the National Natural Science Foundations of China (Nos.41101441,60974107, 41471381)the Foundation of Graduate Innovation Center in NUAA(No.kfjj130133)
文摘A bundle adjustment method of remote sensing images based on dual quaternion is presented,which conducted the uniform disposal corresponding location and attitude of sequence images by the dual quaternion.The constraint relationship of image itself and sequence images is constructed to compensate the systematic errors.The feasibility of this method used in bundle adjustment is theoretically tested by the analysis of the structural characteristics of error equation and normal equation based on dual quaternion.Different distributions of control points and stepwise regression analysis are introduced into the experiment for RC30 image.The results show that the adjustment accuracy can achieve 0.2min plane and 1min elevation.As a result,this method provides a new technique for geometric location problem of remote sensing images.