This paper selects seven indicators of financial revenue and housing sales price in recent 19 years in China,and uses SPSS and Excel to carry out descriptive statistics,independent sample t-test,correlation analysis a...This paper selects seven indicators of financial revenue and housing sales price in recent 19 years in China,and uses SPSS and Excel to carry out descriptive statistics,independent sample t-test,correlation analysis and regression analysis to comprehensively study the correlation between financial revenue and housing sales price in China,and establishes the relationship between financial revenue and housing sales price When the average selling price of commercial housing increases by one unit,the fiscal revenue will increase by 27.855 points.展开更多
In this paper, we investigate the empirical likelihood diagnosis of modal linear regression models. The empirical likelihood ratio function based on modal regression estimation method for the regression coefficient is...In this paper, we investigate the empirical likelihood diagnosis of modal linear regression models. The empirical likelihood ratio function based on modal regression estimation method for the regression coefficient is introduced. First, the estimation equation based on empirical likelihood method is established. Then, some diagnostic statistics are proposed. At last, we also examine the performance of proposed method for finite sample sizes through simulation study.展开更多
In this paper, we study some robustness aspects of linear regression models of the presence of outliers or discordant observations considering the use of stable distributions for the response in place of the usual nor...In this paper, we study some robustness aspects of linear regression models of the presence of outliers or discordant observations considering the use of stable distributions for the response in place of the usual normality assumption. It is well known that, in general, there is no closed form for the probability density function of stable distributions. However, under a Bayesian approach, the use of a latent or auxiliary random variable gives some simplification to obtain any posterior distribution when related to stable distributions. To show the usefulness of the computational aspects, the methodology is applied to two examples: one is related to a standard linear regression model with an explanatory variable and the other is related to a simulated data set assuming a 23 factorial experiment. Posterior summaries of interest are obtained using MCMC (Markov Chain Monte Carlo) methods and the OpenBugs software.展开更多
Observed rainfall is a very essential parameter for the analysis of rainfall,day to day weather forecast and its validation.The observed rainfall data is only available from five observatories of IMD;while no rainfall...Observed rainfall is a very essential parameter for the analysis of rainfall,day to day weather forecast and its validation.The observed rainfall data is only available from five observatories of IMD;while no rainfall data is available at various important locations in and around Delhi-NCR.However,the 24-hour rainfall data observed by Doppler Weather Radar(DWR)for entire Delhi and surrounding region(up to 150 km)is readily available in a pictorial form.In this paper,efforts have been made to derive/estimate the rainfall at desired locations using DWR hydrological products.Firstly,the rainfall at desired locations has been estimated from the precipitation accumulation product(PAC)of the DWR using image processing in Python language.After this,a linear regression model using the least square method has been developed in R language.Estimated and observed rainfall data of year 2018(July,August and September)was used to train the model.After this,the model was tested on rainfall data of year 2019(July,August and September)and validated.With the use of linear regression model,the error in mean rainfall estimation reduced by 46.58% and the error in max rainfall estimation reduced by 84.53% for the year 2019.The error in mean rainfall estimation reduced by 81.36% and the error in max rainfall estimation reduced by 33.81%for the year 2018.Thus,the rainfall can be estimated with a fair degree of accuracy at desired locations within the range of the Doppler Weather Radar using the radar rainfall products and the developed linear regression model.展开更多
In this paper we aim to analyse temporal variation of CD4 cell counts for HIV-infected individuals under antiretroviral therapy by using statistical methods. This is achieved by resorting to recursive binary regressio...In this paper we aim to analyse temporal variation of CD4 cell counts for HIV-infected individuals under antiretroviral therapy by using statistical methods. This is achieved by resorting to recursive binary regression tree approach [1]?[2]. This approach has made it possible to highlight the existence of several segments of the population of interest described by the interactions between the predictive covariates of the response to the treatment regimen.展开更多
Count data that exhibit over dispersion (variance of counts is larger than its mean) are commonly analyzed using discrete distributions such as negative binomial, Poisson inverse Gaussian and other models. The Poisson...Count data that exhibit over dispersion (variance of counts is larger than its mean) are commonly analyzed using discrete distributions such as negative binomial, Poisson inverse Gaussian and other models. The Poisson is characterized by the equality of mean and variance whereas the Negative Binomial and the Poisson inverse Gaussian have variance larger than the mean and therefore are more appropriate to model over-dispersed count data. As an alternative to these two models, we shall use the generalized Poisson distribution for group comparisons in the presence of multiple covariates. This problem is known as the ANCOVA and is solved for continuous data. Our objectives were to develop ANCOVA using the generalized Poisson distribution, and compare its goodness of fit to that of the nonparametric Generalized Additive Models. We used real life data to show that the model performs quite satisfactorily when compared to the nonparametric Generalized Additive Models.展开更多
Factors affecting bath water temperature model include the shape and size of a bath, people’s gesture, volume, individual temperature adaptation as well as body movement in the bath. In addition, the bathroom space, ...Factors affecting bath water temperature model include the shape and size of a bath, people’s gesture, volume, individual temperature adaptation as well as body movement in the bath. In addition, the bathroom space, ambient temperature and bath materials will also affect changes of the water temperature to a certain extent. In this paper, the cooling function and linear regression method are used and the MATLAB software is also used to simulate the model of water temperature, alkaline bath foams that obtained can accelerate changes in water temperature.展开更多
In oil and gas exploration,elucidating the complex interdependencies among geological variables is paramount.Our study introduces the application of sophisticated regression analysis method at the forefront,aiming not...In oil and gas exploration,elucidating the complex interdependencies among geological variables is paramount.Our study introduces the application of sophisticated regression analysis method at the forefront,aiming not just at predicting geophysical logging curve values but also innovatively mitigate hydrocarbon depletion observed in geochemical logging.Through a rigorous assessment,we explore the efficacy of eight regression models,bifurcated into linear and nonlinear groups,to accommodate the multifaceted nature of geological datasets.Our linear model suite encompasses the Standard Equation,Ridge Regression,Least Absolute Shrinkage and Selection Operator,and Elastic Net,each presenting distinct advantages.The Standard Equation serves as a foundational benchmark,whereas Ridge Regression implements penalty terms to counteract overfitting,thus bolstering model robustness in the presence of multicollinearity.The Least Absolute Shrinkage and Selection Operator for variable selection functions to streamline models,enhancing their interpretability,while Elastic Net amalgamates the merits of Ridge Regression and Least Absolute Shrinkage and Selection Operator,offering a harmonized solution to model complexity and comprehensibility.On the nonlinear front,Gradient Descent,Kernel Ridge Regression,Support Vector Regression,and Piecewise Function-Fitting methods introduce innovative approaches.Gradient Descent assures computational efficiency in optimizing solutions,Kernel Ridge Regression leverages the kernel trick to navigate nonlinear patterns,and Support Vector Regression is proficient in forecasting extremities,pivotal for exploration risk assessment.The Piecewise Function-Fitting approach,tailored for geological data,facilitates adaptable modeling of variable interrelations,accommodating abrupt data trend shifts.Our analysis identifies Ridge Regression,particularly when augmented by Piecewise Function-Fitting,as superior in recouping hydrocarbon losses,and underscoring its utility in resource quantification refinement.Meanwhile,Kernel Ridge Regression emerges as a noteworthy strategy in ameliorating porosity-logging curve prediction for well A,evidencing its aptness for intricate geological structures.This research attests to the scientific ascendancy and broad-spectrum relevance of these regression techniques over conventional methods while heralding new horizons for their deployment in the oil and gas sector.The insights garnered from these advanced modeling strategies are set to transform geological and engineering practices in hydrocarbon prediction,evaluation,and recovery.展开更多
Spatio-temporal assessment of the above ground biomass (AGB) is a cumbersome task due to the difficulties associated with the measurement of different tree parameters such as girth at breast height and height of tre...Spatio-temporal assessment of the above ground biomass (AGB) is a cumbersome task due to the difficulties associated with the measurement of different tree parameters such as girth at breast height and height of trees. The present research was conducted in the campus of Birla Institute of Technology, Mesra, Ranchi, India, which is predomi- nantly covered by Sal (Shorea robusta C. F. Gaertn). Two methods of regression analysis was employed to determine the potential of remote sensing parameters with the AGB measured in the field such as linear regression analysis between the AGB and the individual bands, principal components (PCs) of the bands, vegetation indices (VI), and the PCs of the VIs respectively and multiple linear regression (MLR) analysis be- tween the AGB and all the variables in each category of data. From the linear regression analysis, it was found that only the NDVI exhibited regression coefficient value above 0.80 with the remaining parameters showing very low values. On the other hand, the MLR based analysis revealed significantly improved results as evidenced by the occurrence of very high correlation coefficient values of greater than 0.90 determined between the computed AGB from the MLR equations and field-estimated AGB thereby ascertaining their superiority in providing reliable estimates of AGB. The highest correlation coefficient of 0.99 is found with the MLR involving PCs of VIs.展开更多
We consider a functional partially linear additive model that predicts a functional response by a scalar predictor and functional predictors. The B-spline and eigenbasis least squares estimator for both the parametric...We consider a functional partially linear additive model that predicts a functional response by a scalar predictor and functional predictors. The B-spline and eigenbasis least squares estimator for both the parametric and the nonparametric components proposed. In the final of this paper, as a result, we got the variance decomposition of the model and establish the asymptotic convergence rate for estimator.展开更多
The objective of this paper is to present a review of different calibration and classification methods for functional data in the context of chemometric applications. In chemometric, it is usual to measure certain par...The objective of this paper is to present a review of different calibration and classification methods for functional data in the context of chemometric applications. In chemometric, it is usual to measure certain parameters in terms of a set of spectrometric curves that are observed in a finite set of points (functional data). Although the predictor variable is clearly functional, this problem is usually solved by using multivariate calibration techniques that consider it as a finite set of variables associated with the observed points (wavelengths or times). But these explicative variables are highly correlated and it is therefore more informative to reconstruct first the true functional form of the predictor curves. Although it has been published in several articles related to the implementation of functional data analysis techniques in chemometric, their power to solve real problems is not yet well known. Because of this the extension of multivariate calibration techniques (linear regression, principal component regression and partial least squares) and classification methods (linear discriminant analysis and logistic regression) to the functional domain and some relevant chemometric applications are reviewed in this paper.展开更多
基金Thank you for your valuable comments and suggestions.This research was supported by Yunnan applied basic research project(NO.2017FD150)Chuxiong Normal University General Research Project(NO.XJYB2001).
文摘This paper selects seven indicators of financial revenue and housing sales price in recent 19 years in China,and uses SPSS and Excel to carry out descriptive statistics,independent sample t-test,correlation analysis and regression analysis to comprehensively study the correlation between financial revenue and housing sales price in China,and establishes the relationship between financial revenue and housing sales price When the average selling price of commercial housing increases by one unit,the fiscal revenue will increase by 27.855 points.
文摘In this paper, we investigate the empirical likelihood diagnosis of modal linear regression models. The empirical likelihood ratio function based on modal regression estimation method for the regression coefficient is introduced. First, the estimation equation based on empirical likelihood method is established. Then, some diagnostic statistics are proposed. At last, we also examine the performance of proposed method for finite sample sizes through simulation study.
基金financial support from the Brazilian Institution Conselho Nacional de Desenvolvimento Cientifico e Tecnologico(CNPq).
文摘In this paper, we study some robustness aspects of linear regression models of the presence of outliers or discordant observations considering the use of stable distributions for the response in place of the usual normality assumption. It is well known that, in general, there is no closed form for the probability density function of stable distributions. However, under a Bayesian approach, the use of a latent or auxiliary random variable gives some simplification to obtain any posterior distribution when related to stable distributions. To show the usefulness of the computational aspects, the methodology is applied to two examples: one is related to a standard linear regression model with an explanatory variable and the other is related to a simulated data set assuming a 23 factorial experiment. Posterior summaries of interest are obtained using MCMC (Markov Chain Monte Carlo) methods and the OpenBugs software.
文摘Observed rainfall is a very essential parameter for the analysis of rainfall,day to day weather forecast and its validation.The observed rainfall data is only available from five observatories of IMD;while no rainfall data is available at various important locations in and around Delhi-NCR.However,the 24-hour rainfall data observed by Doppler Weather Radar(DWR)for entire Delhi and surrounding region(up to 150 km)is readily available in a pictorial form.In this paper,efforts have been made to derive/estimate the rainfall at desired locations using DWR hydrological products.Firstly,the rainfall at desired locations has been estimated from the precipitation accumulation product(PAC)of the DWR using image processing in Python language.After this,a linear regression model using the least square method has been developed in R language.Estimated and observed rainfall data of year 2018(July,August and September)was used to train the model.After this,the model was tested on rainfall data of year 2019(July,August and September)and validated.With the use of linear regression model,the error in mean rainfall estimation reduced by 46.58% and the error in max rainfall estimation reduced by 84.53% for the year 2019.The error in mean rainfall estimation reduced by 81.36% and the error in max rainfall estimation reduced by 33.81%for the year 2018.Thus,the rainfall can be estimated with a fair degree of accuracy at desired locations within the range of the Doppler Weather Radar using the radar rainfall products and the developed linear regression model.
文摘In this paper we aim to analyse temporal variation of CD4 cell counts for HIV-infected individuals under antiretroviral therapy by using statistical methods. This is achieved by resorting to recursive binary regression tree approach [1]?[2]. This approach has made it possible to highlight the existence of several segments of the population of interest described by the interactions between the predictive covariates of the response to the treatment regimen.
文摘Count data that exhibit over dispersion (variance of counts is larger than its mean) are commonly analyzed using discrete distributions such as negative binomial, Poisson inverse Gaussian and other models. The Poisson is characterized by the equality of mean and variance whereas the Negative Binomial and the Poisson inverse Gaussian have variance larger than the mean and therefore are more appropriate to model over-dispersed count data. As an alternative to these two models, we shall use the generalized Poisson distribution for group comparisons in the presence of multiple covariates. This problem is known as the ANCOVA and is solved for continuous data. Our objectives were to develop ANCOVA using the generalized Poisson distribution, and compare its goodness of fit to that of the nonparametric Generalized Additive Models. We used real life data to show that the model performs quite satisfactorily when compared to the nonparametric Generalized Additive Models.
文摘Factors affecting bath water temperature model include the shape and size of a bath, people’s gesture, volume, individual temperature adaptation as well as body movement in the bath. In addition, the bathroom space, ambient temperature and bath materials will also affect changes of the water temperature to a certain extent. In this paper, the cooling function and linear regression method are used and the MATLAB software is also used to simulate the model of water temperature, alkaline bath foams that obtained can accelerate changes in water temperature.
文摘In oil and gas exploration,elucidating the complex interdependencies among geological variables is paramount.Our study introduces the application of sophisticated regression analysis method at the forefront,aiming not just at predicting geophysical logging curve values but also innovatively mitigate hydrocarbon depletion observed in geochemical logging.Through a rigorous assessment,we explore the efficacy of eight regression models,bifurcated into linear and nonlinear groups,to accommodate the multifaceted nature of geological datasets.Our linear model suite encompasses the Standard Equation,Ridge Regression,Least Absolute Shrinkage and Selection Operator,and Elastic Net,each presenting distinct advantages.The Standard Equation serves as a foundational benchmark,whereas Ridge Regression implements penalty terms to counteract overfitting,thus bolstering model robustness in the presence of multicollinearity.The Least Absolute Shrinkage and Selection Operator for variable selection functions to streamline models,enhancing their interpretability,while Elastic Net amalgamates the merits of Ridge Regression and Least Absolute Shrinkage and Selection Operator,offering a harmonized solution to model complexity and comprehensibility.On the nonlinear front,Gradient Descent,Kernel Ridge Regression,Support Vector Regression,and Piecewise Function-Fitting methods introduce innovative approaches.Gradient Descent assures computational efficiency in optimizing solutions,Kernel Ridge Regression leverages the kernel trick to navigate nonlinear patterns,and Support Vector Regression is proficient in forecasting extremities,pivotal for exploration risk assessment.The Piecewise Function-Fitting approach,tailored for geological data,facilitates adaptable modeling of variable interrelations,accommodating abrupt data trend shifts.Our analysis identifies Ridge Regression,particularly when augmented by Piecewise Function-Fitting,as superior in recouping hydrocarbon losses,and underscoring its utility in resource quantification refinement.Meanwhile,Kernel Ridge Regression emerges as a noteworthy strategy in ameliorating porosity-logging curve prediction for well A,evidencing its aptness for intricate geological structures.This research attests to the scientific ascendancy and broad-spectrum relevance of these regression techniques over conventional methods while heralding new horizons for their deployment in the oil and gas sector.The insights garnered from these advanced modeling strategies are set to transform geological and engineering practices in hydrocarbon prediction,evaluation,and recovery.
文摘Spatio-temporal assessment of the above ground biomass (AGB) is a cumbersome task due to the difficulties associated with the measurement of different tree parameters such as girth at breast height and height of trees. The present research was conducted in the campus of Birla Institute of Technology, Mesra, Ranchi, India, which is predomi- nantly covered by Sal (Shorea robusta C. F. Gaertn). Two methods of regression analysis was employed to determine the potential of remote sensing parameters with the AGB measured in the field such as linear regression analysis between the AGB and the individual bands, principal components (PCs) of the bands, vegetation indices (VI), and the PCs of the VIs respectively and multiple linear regression (MLR) analysis be- tween the AGB and all the variables in each category of data. From the linear regression analysis, it was found that only the NDVI exhibited regression coefficient value above 0.80 with the remaining parameters showing very low values. On the other hand, the MLR based analysis revealed significantly improved results as evidenced by the occurrence of very high correlation coefficient values of greater than 0.90 determined between the computed AGB from the MLR equations and field-estimated AGB thereby ascertaining their superiority in providing reliable estimates of AGB. The highest correlation coefficient of 0.99 is found with the MLR involving PCs of VIs.
文摘We consider a functional partially linear additive model that predicts a functional response by a scalar predictor and functional predictors. The B-spline and eigenbasis least squares estimator for both the parametric and the nonparametric components proposed. In the final of this paper, as a result, we got the variance decomposition of the model and establish the asymptotic convergence rate for estimator.
文摘The objective of this paper is to present a review of different calibration and classification methods for functional data in the context of chemometric applications. In chemometric, it is usual to measure certain parameters in terms of a set of spectrometric curves that are observed in a finite set of points (functional data). Although the predictor variable is clearly functional, this problem is usually solved by using multivariate calibration techniques that consider it as a finite set of variables associated with the observed points (wavelengths or times). But these explicative variables are highly correlated and it is therefore more informative to reconstruct first the true functional form of the predictor curves. Although it has been published in several articles related to the implementation of functional data analysis techniques in chemometric, their power to solve real problems is not yet well known. Because of this the extension of multivariate calibration techniques (linear regression, principal component regression and partial least squares) and classification methods (linear discriminant analysis and logistic regression) to the functional domain and some relevant chemometric applications are reviewed in this paper.