This paper focus on solving the problem of seafloor control point absolute positioning with low vertical accuracy based on the survey ship sailing circle. The method of dealing with the systematic error based on a sem...This paper focus on solving the problem of seafloor control point absolute positioning with low vertical accuracy based on the survey ship sailing circle. The method of dealing with the systematic error based on a semi-parametric adjustment model was proposed. Firstly, the influence of sound velocity change on ranging error is analyzed. Secondly, a semi-parametric adjustment model for determining three-dimensional coordinates of seafloor control points was established. And respectively proposed solutions under two different conditions, the observation duration is an integral multiple or non-integer multiple of the long-period term of the ranging error. The simulation experiment shows that this method can obviously improve the accuracy of vertical solution of seafloor control point compared with the difference technique and the least-squares method when internal waves exist and observation duration is less than an integer multiple of the long-period term of the ranging error.展开更多
In this paper, using the kernel weight function, we obtain the parameter estimation of p-norm distribution in semi-parametric regression model, which is effective to decide the distribution of random errors. Under the...In this paper, using the kernel weight function, we obtain the parameter estimation of p-norm distribution in semi-parametric regression model, which is effective to decide the distribution of random errors. Under the assumption that the distribution of observations is unimodal and symmetry, this method can give the estimates of the parametric. Finally, two simulated adjustment problem are constructed to explain this method. The new method presented in this paper shows an effective way of solving the problem; the estimated values are nearer to their theoretical ones than those by least squares adjustment approach.展开更多
In this paper, a semi-parametric regression model with an adaptive LASSO penalty imposed on both the linear and the nonlinear components of the mode is considered. The model is rewritten so that a signed-rank techniqu...In this paper, a semi-parametric regression model with an adaptive LASSO penalty imposed on both the linear and the nonlinear components of the mode is considered. The model is rewritten so that a signed-rank technique can be used for estimation. The nonlinear part consists of a covariate that enters the model nonlinearly via an unknown function that is estimated using Bsplines. The author shows that the resulting estimator is consistent under heavy-tailed distributions and asymptotic normality results are given. Monte Carlo simulations as well as practical applications are studied to assess the validity of the proposed estimation method.展开更多
Consider a linear regression model Y=β'X+e.where Y may be right censored and the cdf F+o of e is unknown.We show that a modified semi-parametric MLE.denoted by .is strongly consistent under certain regularity con...Consider a linear regression model Y=β'X+e.where Y may be right censored and the cdf F+o of e is unknown.We show that a modified semi-parametric MLE.denoted by .is strongly consistent under certain regularity conditions.Moreover.if F_o is discontinuous,then P(≠βi.o.)=0, which means that P(=βif the sample size is large)=1.The latter property has not been reported for the existing estimators.By contrast,most estimators,such as the Buckley-James estimator and M-estimators .satisfy that P(≠βi.o.)=1.展开更多
Using data for China for the years 1991 to 2005 by province and employing the semi- parametric panel data model estimation method developed by Horowitz (2004) and Henderson et al. (2006) and Hubler's non-parametr...Using data for China for the years 1991 to 2005 by province and employing the semi- parametric panel data model estimation method developed by Horowitz (2004) and Henderson et al. (2006) and Hubler's non-parametric generalized method of moments (GMM) estimation (2005), this article constructs a dynamic semi-parametric panel data model and describes the dynamic changing trajectory of the effect on consumption of income disparity among urban residents. Our findings show that there is a significant "ratchet effect" in the consumption of urban residents; that income disparity among urban residents has a clear negative influence on consumption; and that the trajectory of this influence shows a roughly bimodal curve.展开更多
This paper introduces a semi-parametric model with right-censored data and a monotone constraint on the nonparametrie part. The authors study the local linear estimators of the parametric coefficients and apply B-spli...This paper introduces a semi-parametric model with right-censored data and a monotone constraint on the nonparametrie part. The authors study the local linear estimators of the parametric coefficients and apply B-spline method to approximate the nonparametric part based on grouped data. The authors obtain the rates of convergence for parametric and nonparametric estimators. Moreover, the authors also prove that the nonparametric estimator is consistent at the boundary. At last, the authors investigate the finite sample performance of the estimation.展开更多
Qin and Lawless (1994) established the statistical inference theory for the empirical likelihood of the general estimating equations. However, in many practical problems, some unknown functional parts h(t) appear in t...Qin and Lawless (1994) established the statistical inference theory for the empirical likelihood of the general estimating equations. However, in many practical problems, some unknown functional parts h(t) appear in the corresponding estimating equations EFG(X, h(T), β) = 0. In this paper, the empirical likelihood inference of combining information about unknown parameters and distribution function through the semiparametric estimating equations are developed, and the corresponding Wilk's theorem is established. The simulations of several useful models are conducted to compare the finite-sample performance of the proposed method and that of the normal approximation based method. An illustrated real example is also presented.展开更多
This paper considers the convergence rates for nonparametric estimators of the error distribution in semi-parametric regression models. By establishing some general laws of the iterated logarithm, it shows that the ra...This paper considers the convergence rates for nonparametric estimators of the error distribution in semi-parametric regression models. By establishing some general laws of the iterated logarithm, it shows that the rates of convergence of either the empirical distribution or a smoothed version of the empirical distribution function matches exactly the rates obtained for an independent sample from the error distribution.展开更多
Econometric simultaneous equation models play an important role in making economic policies, analyzing economic structure and economic forecasting. This paper presents local linear estimators by TSLS with variable ban...Econometric simultaneous equation models play an important role in making economic policies, analyzing economic structure and economic forecasting. This paper presents local linear estimators by TSLS with variable bandwidth for every structural equation in semi-parametric simultaneous equation models in econometrics. The properties under large sample size were studied by using the asymptotic theory when all variables were random. The results show that the estimators of the parameters have consistency and asymptotic normality, and their convergence rates are equal to n^-1/2. And the estimator of the nonparametric function has the consistency and asymptotic normality in interior points and its rate of convergence is equal to the optimal convergence rate of the nonparametric function estimation.展开更多
The research is based on the double difference observations and semi-parametric model. Systematic errors are considered as the parameters to be estimated, and brought into the GPS observation equations. High precision...The research is based on the double difference observations and semi-parametric model. Systematic errors are considered as the parameters to be estimated, and brought into the GPS observation equations. High precision baselines are obtained after separating systematic errors. The crucial steps are choosing regularizer and regularization parameters in processing GPS systematic errors by using the semi-parametric model. We propose a new regularizer and apply it to dealing with systematic errors. Also, we compare it with one proposed by other researchers. This comparison is done when all the regularization parameters equal to one. The computation result of the example shows that two regularizers correspond well and they can separate systematic errors successfully. Thus, we can get high precision baselines. Compared with R=QK-1Q′, our regularizer R=GTG is simple, so, the process of resolving the high precision baselines is relatively simple.展开更多
In this article, we focus on the semi-parametric error-in-variables model with missing responses: , where yi are the response variables missing at random, are design points, ζi are the potential variables observed wi...In this article, we focus on the semi-parametric error-in-variables model with missing responses: , where yi are the response variables missing at random, are design points, ζi are the potential variables observed with measurement errors μi, the unknown slope parameter ß?and nonparametric component g(·) need to be estimated. Here we choose two different approaches to estimate ß?and g(·). Under appropriate conditions, we study the strong consistency for the proposed estimators.展开更多
Recurrent event gap times data frequently arise in biomedical studies and often more than one type of event is of interest. To evaluate the effects of covariates on the marginal recurrent event hazards functions, ther...Recurrent event gap times data frequently arise in biomedical studies and often more than one type of event is of interest. To evaluate the effects of covariates on the marginal recurrent event hazards functions, there exist two types of hazards models: the multiplicative hazards model and the additive hazards model. In the paper, we propose a more flexible additive-multiplicative hazards model for multiple type of recurrent gap times data, wherein some covariates are assumed to be additive while others are multiplicative. An estimating equation approach is presented to estimate the regression parameters. We establish asymptotic properties of the proposed estimators.展开更多
Given a sample of regression data from (Y, Z), a new diagnostic plotting method is proposed for checking the hypothesis H0: the data are from a given Cox model with the time-dependent covariates Z. It compares two est...Given a sample of regression data from (Y, Z), a new diagnostic plotting method is proposed for checking the hypothesis H0: the data are from a given Cox model with the time-dependent covariates Z. It compares two estimates of the marginal distribution FY of Y. One is an estimate of the modified expression of FY under H0, based on a consistent estimate of the parameter under H0, and based on the baseline distribution of the data. The other is the Kaplan-Meier-estimator of FY, together with its confidence band. The new plot, called the marginal distribution plot, can be viewed as a test for testing H0. The main advantage of the test over the existing residual tests is in the case that the data do not satisfy any Cox model or the Cox model is mis-specified. Then the new test is still valid, but not the residual tests and the residual tests often make type II error with a very large probability.展开更多
The analysis of survival data is a major focus of statistics. Interval censored data reflect uncertainty as to the exact times the units failed within an interval. This type of data frequently comes from tests or situ...The analysis of survival data is a major focus of statistics. Interval censored data reflect uncertainty as to the exact times the units failed within an interval. This type of data frequently comes from tests or situations where the objects of interest are not constantly monitored. Thus events are known only to have occurred between the two observation periods. Interval censoring has become increasingly common in the areas that produce failure time data. This paper explores the statistical analysis of interval-censored failure time data with applications. Three different data sets, namely Breast Cancer, Hemophilia, and AIDS data were used to illustrate the methods during this study. Both parametric and nonparametric methods of analysis are carried out in this study. Theory and methodology of fitted models for the interval-censored data are described. Fitting of parametric and non-parametric models to three real data sets are considered. Results derived from different methods are presented and also compared.展开更多
Background: Daily paediatric asthma readmissions within 28 days are a good example of a low count time series and not easily amenable to common time series methods used in studies of asthma seasonality and time trends...Background: Daily paediatric asthma readmissions within 28 days are a good example of a low count time series and not easily amenable to common time series methods used in studies of asthma seasonality and time trends. We sought to model and predict daily trends of childhood asthma readmissions over time inVictoria,Australia. Methods: We used a database of 75,000 childhood asthma admissions from the Department ofHealth,Victoria,Australiain 1997-2009. Daily admissions over time were modeled using a semi parametric Generalized Additive Model (GAM) and by sex and age group. Predictions were also estimated by using these models. Results: N = 2401 asthma readmissions within 28 days occurred during study period. Of these, n = 1358 (57%) were boys. Overall, seasonal peaks occurred in winter (30.5%) followed by autumn (28.6%) and then spring (24.6%) (p展开更多
Fire weather indices have been widely applied to predict fire risk in many regions of the world. The objectives of this study were to establish fire risk probability models based on fire indices over different climati...Fire weather indices have been widely applied to predict fire risk in many regions of the world. The objectives of this study were to establish fire risk probability models based on fire indices over different climatic regions in China. We linked the indices adopted in Canadian, US, and Australia with location, time, altitude, vegetation and fire characteristics during 1998-2007 in four regions using semi- parametric logistic (SPL) regression models. Different combinations of fire risk indices were selected as explanatory variables for specific regional probability model. SPL regression models of probability of fire ignition and large fire events were established to describe the non-linear relationship between fire risk indices and fire risk probabilities in the four regions. Graphs of observed versus estimated probabilities, fire risk maps, graphs of numbers of large fire events were produced from the probability models to assess the skill of these models. Fire ignition in all regions showed a significant link with altitude and NDVI. Indices of fuel moisture are important factors influencing fire occurrence in northern China. The fuel indices of organic material are significant indicators of fire risk in southern China. Besides the well skill of predicting fire risk, the probability models are a useful method to assess the utility of the fire risk indices in estimating fire events. The analysis presents some of the dynamics of climate-fire interactions and their value for management systems.展开更多
In this study,we present a collection of local models,termed geographically weighted(GW)models,which can be found within the GWmodel R package.A GW model suits situations when spatial data are poorly described by the ...In this study,we present a collection of local models,termed geographically weighted(GW)models,which can be found within the GWmodel R package.A GW model suits situations when spatial data are poorly described by the global form,and for some regions the localized fit provides a better description.The approach uses a moving window weighting technique,where a collection of local models are estimated at target locations.Commonly,model parameters or outputs are mapped so that the nature of spatial heterogeneity can be explored and assessed.In particular,we present case studies using:(i)GW summary statistics and a GW principal components analysis;(ii)advanced GW regression fits and diagnostics;(iii)associated Monte Carlo significance tests for non-stationarity;(iv)a GW discriminant analysis;and(v)enhanced kernel bandwidth selection procedures.General Election data-sets from the Republic of Ireland and US are used for demonstration.This study is designed to complement a companion GWmodel study,which focuses on basic and robust GW models.展开更多
Wastewater treatment is one of critical issues faced by water utilities, and receives more and more attentions recently. The energy consumption modeling in biochemical wastewater treatment was investigated in the stud...Wastewater treatment is one of critical issues faced by water utilities, and receives more and more attentions recently. The energy consumption modeling in biochemical wastewater treatment was investigated in the study via a general and robust approach based on Bayesian semi-parametric quantile regression. The dataset was derived from a municipal wastewater treatment plant, where the energy consumption of unit chemical oxygen demand(COD) reduction was the response variable of interest. Via the proposed approach,the comprehensive regression pictures of the energy consumption and truly influencing factors, i.e., the regression relationships at lower, median and higher energy consumption levels were characterized respectively. Meanwhile, the proposals for energy saving in different cases were also facilitated specifically. First, the lower level of energy consumption was closely associated with the temperature of influent wastewater, and the chroma-rich wastewater also showed helpful in the execution of energy saving. Second, at median energy consumption level, the COD-rich wastewater played a determinative role in the reduction of energy consumption, while the higher quality of treated water led to slightly energy intensive. Third, the higher level of energy consumption was most likely to be attributed to the relatively high temperature of wastewater and total nitrogen(TN)-rich wastewater,and both of the factors were preferably to be avoided to alleviate the burden of energy consumption. The study provided an efficient approach to controlling the energy consumption of wastewater treatment in the perspective of statistical regression modeling, and offered valuable suggestions for the future energy saving.展开更多
Taxicab is an important mode in urban transportation system, while the role of taxicabs, especially the relationship with metro system has not been fully studied. This study aims at exploring the factors influencing t...Taxicab is an important mode in urban transportation system, while the role of taxicabs, especially the relationship with metro system has not been fully studied. This study aims at exploring the factors influencing the role played by taxicabs in Shanghai, China. Firstly, taxi trips are categorized into three types, namely metroreplaceable(MR), metro-extending(ME) and metro-supplement(MS) ones. Then, the tendency of travelers towards taxi or metro at a specific metro station is proposed and calculated on the basis of MR taxi trips and metro trips. Factors influencing the tendency are investigated through semi-parametric regression models, with the results indicating that the most significant factors and the influencing radii during the peak and off-peak hours are different. Some built environment factors, such as the number of hospitals and government agencies, have significant positive relationship with the tendency in the time periods. Furthermore, land use related factors, such as the increase of forestry and commercial land, generally promote taxi-hiring in the off-peak hours, while they have a negative impact during the peak hours. Findings of this study can assist governments and policy makers to understand the impact of built environment and land use on trip patterns, and thus may contribute to more reasonable policies and optimized urban planning, which may promote modal switch from taxi to subway.展开更多
This paper studies the estimation of the partially linear panel data models,allowing for cross-sectional dependence through a common factors structure.This semiparametric additive partial linear framework,including bo...This paper studies the estimation of the partially linear panel data models,allowing for cross-sectional dependence through a common factors structure.This semiparametric additive partial linear framework,including both linear and nonlinear additive components,is more flexible compared to linear models,and is preferred to a fully nonparametric regression because of the‘curse of dimensionality’.The consistency and asymptotic normality of the proposed estimators are established for the case where both cross-sectional dimension and temporal dimension go to infinity.The theoretical findings are further supported for small samples via a Monte Carlo study.The results suggest that the proposed method is robust to a wide variety of data generation processes.展开更多
基金The National Key Research and Development Program of China(No.2016YFB0501701)The National High-tech Research and Development Program of China(No.2013AA122501)+1 种基金National Natural Science Foundation of China(Nos.4187610341874016)。
文摘This paper focus on solving the problem of seafloor control point absolute positioning with low vertical accuracy based on the survey ship sailing circle. The method of dealing with the systematic error based on a semi-parametric adjustment model was proposed. Firstly, the influence of sound velocity change on ranging error is analyzed. Secondly, a semi-parametric adjustment model for determining three-dimensional coordinates of seafloor control points was established. And respectively proposed solutions under two different conditions, the observation duration is an integral multiple or non-integer multiple of the long-period term of the ranging error. The simulation experiment shows that this method can obviously improve the accuracy of vertical solution of seafloor control point compared with the difference technique and the least-squares method when internal waves exist and observation duration is less than an integer multiple of the long-period term of the ranging error.
文摘In this paper, using the kernel weight function, we obtain the parameter estimation of p-norm distribution in semi-parametric regression model, which is effective to decide the distribution of random errors. Under the assumption that the distribution of observations is unimodal and symmetry, this method can give the estimates of the parametric. Finally, two simulated adjustment problem are constructed to explain this method. The new method presented in this paper shows an effective way of solving the problem; the estimated values are nearer to their theoretical ones than those by least squares adjustment approach.
文摘In this paper, a semi-parametric regression model with an adaptive LASSO penalty imposed on both the linear and the nonlinear components of the mode is considered. The model is rewritten so that a signed-rank technique can be used for estimation. The nonlinear part consists of a covariate that enters the model nonlinearly via an unknown function that is estimated using Bsplines. The author shows that the resulting estimator is consistent under heavy-tailed distributions and asymptotic normality results are given. Monte Carlo simulations as well as practical applications are studied to assess the validity of the proposed estimation method.
基金Supported by the U.S.Army Grants DAMD17-99-1-9390 and DAMD17-01-0448
文摘Consider a linear regression model Y=β'X+e.where Y may be right censored and the cdf F+o of e is unknown.We show that a modified semi-parametric MLE.denoted by .is strongly consistent under certain regularity conditions.Moreover.if F_o is discontinuous,then P(≠βi.o.)=0, which means that P(=βif the sample size is large)=1.The latter property has not been reported for the existing estimators.By contrast,most estimators,such as the Buckley-James estimator and M-estimators .satisfy that P(≠βi.o.)=1.
文摘Using data for China for the years 1991 to 2005 by province and employing the semi- parametric panel data model estimation method developed by Horowitz (2004) and Henderson et al. (2006) and Hubler's non-parametric generalized method of moments (GMM) estimation (2005), this article constructs a dynamic semi-parametric panel data model and describes the dynamic changing trajectory of the effect on consumption of income disparity among urban residents. Our findings show that there is a significant "ratchet effect" in the consumption of urban residents; that income disparity among urban residents has a clear negative influence on consumption; and that the trajectory of this influence shows a roughly bimodal curve.
基金supported by Fundamental Research Funds for the Central Universitiesthe Research Funds of Renmin University of China under Grant No.11XNK027
文摘This paper introduces a semi-parametric model with right-censored data and a monotone constraint on the nonparametrie part. The authors study the local linear estimators of the parametric coefficients and apply B-spline method to approximate the nonparametric part based on grouped data. The authors obtain the rates of convergence for parametric and nonparametric estimators. Moreover, the authors also prove that the nonparametric estimator is consistent at the boundary. At last, the authors investigate the finite sample performance of the estimation.
基金supported partly by National Natural Science Foundation of China (Grant Nos. 11071022, 11028103 and 11201317)Key Project of Ministry of Education of China (Grant No. 309007)National High-tech R&D Program of China (Grant No. 2008AA12Z107)
文摘Qin and Lawless (1994) established the statistical inference theory for the empirical likelihood of the general estimating equations. However, in many practical problems, some unknown functional parts h(t) appear in the corresponding estimating equations EFG(X, h(T), β) = 0. In this paper, the empirical likelihood inference of combining information about unknown parameters and distribution function through the semiparametric estimating equations are developed, and the corresponding Wilk's theorem is established. The simulations of several useful models are conducted to compare the finite-sample performance of the proposed method and that of the normal approximation based method. An illustrated real example is also presented.
基金supported by the National Science Foundation of China under Grant Nos.11201422,11301481,and 11371321Zhejiang Provincial Natural Science Foundation of China under Grant Nos.Y6110639,Y6110110,LQ12A01018,and LQ12A01017+2 种基金the National Statistical Science Research Project of China under Grant No.2012LY174Foundation for Young Talents of ZJGSU under Grant No.1020XJ1314019Zhejiang Provincial Key Research Base for Humanities and Social Science Research(Statistics)
文摘This paper considers the convergence rates for nonparametric estimators of the error distribution in semi-parametric regression models. By establishing some general laws of the iterated logarithm, it shows that the rates of convergence of either the empirical distribution or a smoothed version of the empirical distribution function matches exactly the rates obtained for an independent sample from the error distribution.
基金This project is supported by National Natural Science Foundation of China (70371025)
文摘Econometric simultaneous equation models play an important role in making economic policies, analyzing economic structure and economic forecasting. This paper presents local linear estimators by TSLS with variable bandwidth for every structural equation in semi-parametric simultaneous equation models in econometrics. The properties under large sample size were studied by using the asymptotic theory when all variables were random. The results show that the estimators of the parameters have consistency and asymptotic normality, and their convergence rates are equal to n^-1/2. And the estimator of the nonparametric function has the consistency and asymptotic normality in interior points and its rate of convergence is equal to the optimal convergence rate of the nonparametric function estimation.
文摘The research is based on the double difference observations and semi-parametric model. Systematic errors are considered as the parameters to be estimated, and brought into the GPS observation equations. High precision baselines are obtained after separating systematic errors. The crucial steps are choosing regularizer and regularization parameters in processing GPS systematic errors by using the semi-parametric model. We propose a new regularizer and apply it to dealing with systematic errors. Also, we compare it with one proposed by other researchers. This comparison is done when all the regularization parameters equal to one. The computation result of the example shows that two regularizers correspond well and they can separate systematic errors successfully. Thus, we can get high precision baselines. Compared with R=QK-1Q′, our regularizer R=GTG is simple, so, the process of resolving the high precision baselines is relatively simple.
文摘In this article, we focus on the semi-parametric error-in-variables model with missing responses: , where yi are the response variables missing at random, are design points, ζi are the potential variables observed with measurement errors μi, the unknown slope parameter ß?and nonparametric component g(·) need to be estimated. Here we choose two different approaches to estimate ß?and g(·). Under appropriate conditions, we study the strong consistency for the proposed estimators.
基金The Science Foundation(JA12301)of Fujian Educational Committeethe Teaching Quality Project(ZL0902/TZ(SJ))of Higher Education in Fujian Provincial Education Department
文摘Recurrent event gap times data frequently arise in biomedical studies and often more than one type of event is of interest. To evaluate the effects of covariates on the marginal recurrent event hazards functions, there exist two types of hazards models: the multiplicative hazards model and the additive hazards model. In the paper, we propose a more flexible additive-multiplicative hazards model for multiple type of recurrent gap times data, wherein some covariates are assumed to be additive while others are multiplicative. An estimating equation approach is presented to estimate the regression parameters. We establish asymptotic properties of the proposed estimators.
文摘Given a sample of regression data from (Y, Z), a new diagnostic plotting method is proposed for checking the hypothesis H0: the data are from a given Cox model with the time-dependent covariates Z. It compares two estimates of the marginal distribution FY of Y. One is an estimate of the modified expression of FY under H0, based on a consistent estimate of the parameter under H0, and based on the baseline distribution of the data. The other is the Kaplan-Meier-estimator of FY, together with its confidence band. The new plot, called the marginal distribution plot, can be viewed as a test for testing H0. The main advantage of the test over the existing residual tests is in the case that the data do not satisfy any Cox model or the Cox model is mis-specified. Then the new test is still valid, but not the residual tests and the residual tests often make type II error with a very large probability.
文摘The analysis of survival data is a major focus of statistics. Interval censored data reflect uncertainty as to the exact times the units failed within an interval. This type of data frequently comes from tests or situations where the objects of interest are not constantly monitored. Thus events are known only to have occurred between the two observation periods. Interval censoring has become increasingly common in the areas that produce failure time data. This paper explores the statistical analysis of interval-censored failure time data with applications. Three different data sets, namely Breast Cancer, Hemophilia, and AIDS data were used to illustrate the methods during this study. Both parametric and nonparametric methods of analysis are carried out in this study. Theory and methodology of fitted models for the interval-censored data are described. Fitting of parametric and non-parametric models to three real data sets are considered. Results derived from different methods are presented and also compared.
文摘Background: Daily paediatric asthma readmissions within 28 days are a good example of a low count time series and not easily amenable to common time series methods used in studies of asthma seasonality and time trends. We sought to model and predict daily trends of childhood asthma readmissions over time inVictoria,Australia. Methods: We used a database of 75,000 childhood asthma admissions from the Department ofHealth,Victoria,Australiain 1997-2009. Daily admissions over time were modeled using a semi parametric Generalized Additive Model (GAM) and by sex and age group. Predictions were also estimated by using these models. Results: N = 2401 asthma readmissions within 28 days occurred during study period. Of these, n = 1358 (57%) were boys. Overall, seasonal peaks occurred in winter (30.5%) followed by autumn (28.6%) and then spring (24.6%) (p
基金supported by the National Basic Research Program, from Ministry of Science and Technology of China (No 2010CB955304)
文摘Fire weather indices have been widely applied to predict fire risk in many regions of the world. The objectives of this study were to establish fire risk probability models based on fire indices over different climatic regions in China. We linked the indices adopted in Canadian, US, and Australia with location, time, altitude, vegetation and fire characteristics during 1998-2007 in four regions using semi- parametric logistic (SPL) regression models. Different combinations of fire risk indices were selected as explanatory variables for specific regional probability model. SPL regression models of probability of fire ignition and large fire events were established to describe the non-linear relationship between fire risk indices and fire risk probabilities in the four regions. Graphs of observed versus estimated probabilities, fire risk maps, graphs of numbers of large fire events were produced from the probability models to assess the skill of these models. Fire ignition in all regions showed a significant link with altitude and NDVI. Indices of fuel moisture are important factors influencing fire occurrence in northern China. The fuel indices of organic material are significant indicators of fire risk in southern China. Besides the well skill of predicting fire risk, the probability models are a useful method to assess the utility of the fire risk indices in estimating fire events. The analysis presents some of the dynamics of climate-fire interactions and their value for management systems.
基金presented in this paper was funded by a Strategic Research Cluster grant(07/SRC/I1168)by Science Foundation Ireland under the National Development Plan.
文摘In this study,we present a collection of local models,termed geographically weighted(GW)models,which can be found within the GWmodel R package.A GW model suits situations when spatial data are poorly described by the global form,and for some regions the localized fit provides a better description.The approach uses a moving window weighting technique,where a collection of local models are estimated at target locations.Commonly,model parameters or outputs are mapped so that the nature of spatial heterogeneity can be explored and assessed.In particular,we present case studies using:(i)GW summary statistics and a GW principal components analysis;(ii)advanced GW regression fits and diagnostics;(iii)associated Monte Carlo significance tests for non-stationarity;(iv)a GW discriminant analysis;and(v)enhanced kernel bandwidth selection procedures.General Election data-sets from the Republic of Ireland and US are used for demonstration.This study is designed to complement a companion GWmodel study,which focuses on basic and robust GW models.
基金supported by the National Natural Science Foundation of China (Nos.51478025,11701023,71420107025)
文摘Wastewater treatment is one of critical issues faced by water utilities, and receives more and more attentions recently. The energy consumption modeling in biochemical wastewater treatment was investigated in the study via a general and robust approach based on Bayesian semi-parametric quantile regression. The dataset was derived from a municipal wastewater treatment plant, where the energy consumption of unit chemical oxygen demand(COD) reduction was the response variable of interest. Via the proposed approach,the comprehensive regression pictures of the energy consumption and truly influencing factors, i.e., the regression relationships at lower, median and higher energy consumption levels were characterized respectively. Meanwhile, the proposals for energy saving in different cases were also facilitated specifically. First, the lower level of energy consumption was closely associated with the temperature of influent wastewater, and the chroma-rich wastewater also showed helpful in the execution of energy saving. Second, at median energy consumption level, the COD-rich wastewater played a determinative role in the reduction of energy consumption, while the higher quality of treated water led to slightly energy intensive. Third, the higher level of energy consumption was most likely to be attributed to the relatively high temperature of wastewater and total nitrogen(TN)-rich wastewater,and both of the factors were preferably to be avoided to alleviate the burden of energy consumption. The study provided an efficient approach to controlling the energy consumption of wastewater treatment in the perspective of statistical regression modeling, and offered valuable suggestions for the future energy saving.
基金the Shanghai Municipal Natural Science Foundation(No.17ZR1445500)the Humanities and Social Sciences Foundation of Ministry of Education in China(Nos.18YJCZH011 and 19YJAZH077)
文摘Taxicab is an important mode in urban transportation system, while the role of taxicabs, especially the relationship with metro system has not been fully studied. This study aims at exploring the factors influencing the role played by taxicabs in Shanghai, China. Firstly, taxi trips are categorized into three types, namely metroreplaceable(MR), metro-extending(ME) and metro-supplement(MS) ones. Then, the tendency of travelers towards taxi or metro at a specific metro station is proposed and calculated on the basis of MR taxi trips and metro trips. Factors influencing the tendency are investigated through semi-parametric regression models, with the results indicating that the most significant factors and the influencing radii during the peak and off-peak hours are different. Some built environment factors, such as the number of hospitals and government agencies, have significant positive relationship with the tendency in the time periods. Furthermore, land use related factors, such as the increase of forestry and commercial land, generally promote taxi-hiring in the off-peak hours, while they have a negative impact during the peak hours. Findings of this study can assist governments and policy makers to understand the impact of built environment and land use on trip patterns, and thus may contribute to more reasonable policies and optimized urban planning, which may promote modal switch from taxi to subway.
基金supported by the National Natural Science Foundation of China under Grant Nos.71703156,71988101,and 72073126。
文摘This paper studies the estimation of the partially linear panel data models,allowing for cross-sectional dependence through a common factors structure.This semiparametric additive partial linear framework,including both linear and nonlinear additive components,is more flexible compared to linear models,and is preferred to a fully nonparametric regression because of the‘curse of dimensionality’.The consistency and asymptotic normality of the proposed estimators are established for the case where both cross-sectional dimension and temporal dimension go to infinity.The theoretical findings are further supported for small samples via a Monte Carlo study.The results suggest that the proposed method is robust to a wide variety of data generation processes.