This paper focus on solving the problem of seafloor control point absolute positioning with low vertical accuracy based on the survey ship sailing circle. The method of dealing with the systematic error based on a sem...This paper focus on solving the problem of seafloor control point absolute positioning with low vertical accuracy based on the survey ship sailing circle. The method of dealing with the systematic error based on a semi-parametric adjustment model was proposed. Firstly, the influence of sound velocity change on ranging error is analyzed. Secondly, a semi-parametric adjustment model for determining three-dimensional coordinates of seafloor control points was established. And respectively proposed solutions under two different conditions, the observation duration is an integral multiple or non-integer multiple of the long-period term of the ranging error. The simulation experiment shows that this method can obviously improve the accuracy of vertical solution of seafloor control point compared with the difference technique and the least-squares method when internal waves exist and observation duration is less than an integer multiple of the long-period term of the ranging error.展开更多
Recurrent event gap times data frequently arise in biomedical studies and often more than one type of event is of interest. To evaluate the effects of covariates on the marginal recurrent event hazards functions, ther...Recurrent event gap times data frequently arise in biomedical studies and often more than one type of event is of interest. To evaluate the effects of covariates on the marginal recurrent event hazards functions, there exist two types of hazards models: the multiplicative hazards model and the additive hazards model. In the paper, we propose a more flexible additive-multiplicative hazards model for multiple type of recurrent gap times data, wherein some covariates are assumed to be additive while others are multiplicative. An estimating equation approach is presented to estimate the regression parameters. We establish asymptotic properties of the proposed estimators.展开更多
Given a sample of regression data from (Y, Z), a new diagnostic plotting method is proposed for checking the hypothesis H0: the data are from a given Cox model with the time-dependent covariates Z. It compares two est...Given a sample of regression data from (Y, Z), a new diagnostic plotting method is proposed for checking the hypothesis H0: the data are from a given Cox model with the time-dependent covariates Z. It compares two estimates of the marginal distribution FY of Y. One is an estimate of the modified expression of FY under H0, based on a consistent estimate of the parameter under H0, and based on the baseline distribution of the data. The other is the Kaplan-Meier-estimator of FY, together with its confidence band. The new plot, called the marginal distribution plot, can be viewed as a test for testing H0. The main advantage of the test over the existing residual tests is in the case that the data do not satisfy any Cox model or the Cox model is mis-specified. Then the new test is still valid, but not the residual tests and the residual tests often make type II error with a very large probability.展开更多
Background: Daily paediatric asthma readmissions within 28 days are a good example of a low count time series and not easily amenable to common time series methods used in studies of asthma seasonality and time trends...Background: Daily paediatric asthma readmissions within 28 days are a good example of a low count time series and not easily amenable to common time series methods used in studies of asthma seasonality and time trends. We sought to model and predict daily trends of childhood asthma readmissions over time inVictoria,Australia. Methods: We used a database of 75,000 childhood asthma admissions from the Department ofHealth,Victoria,Australiain 1997-2009. Daily admissions over time were modeled using a semi parametric Generalized Additive Model (GAM) and by sex and age group. Predictions were also estimated by using these models. Results: N = 2401 asthma readmissions within 28 days occurred during study period. Of these, n = 1358 (57%) were boys. Overall, seasonal peaks occurred in winter (30.5%) followed by autumn (28.6%) and then spring (24.6%) (p展开更多
Using data for China for the years 1991 to 2005 by province and employing the semi- parametric panel data model estimation method developed by Horowitz (2004) and Henderson et al. (2006) and Hubler's non-parametr...Using data for China for the years 1991 to 2005 by province and employing the semi- parametric panel data model estimation method developed by Horowitz (2004) and Henderson et al. (2006) and Hubler's non-parametric generalized method of moments (GMM) estimation (2005), this article constructs a dynamic semi-parametric panel data model and describes the dynamic changing trajectory of the effect on consumption of income disparity among urban residents. Our findings show that there is a significant "ratchet effect" in the consumption of urban residents; that income disparity among urban residents has a clear negative influence on consumption; and that the trajectory of this influence shows a roughly bimodal curve.展开更多
This paper introduces a semi-parametric model with right-censored data and a monotone constraint on the nonparametrie part. The authors study the local linear estimators of the parametric coefficients and apply B-spli...This paper introduces a semi-parametric model with right-censored data and a monotone constraint on the nonparametrie part. The authors study the local linear estimators of the parametric coefficients and apply B-spline method to approximate the nonparametric part based on grouped data. The authors obtain the rates of convergence for parametric and nonparametric estimators. Moreover, the authors also prove that the nonparametric estimator is consistent at the boundary. At last, the authors investigate the finite sample performance of the estimation.展开更多
This paper considers the convergence rates for nonparametric estimators of the error distribution in semi-parametric regression models. By establishing some general laws of the iterated logarithm, it shows that the ra...This paper considers the convergence rates for nonparametric estimators of the error distribution in semi-parametric regression models. By establishing some general laws of the iterated logarithm, it shows that the rates of convergence of either the empirical distribution or a smoothed version of the empirical distribution function matches exactly the rates obtained for an independent sample from the error distribution.展开更多
Econometric simultaneous equation models play an important role in making economic policies, analyzing economic structure and economic forecasting. This paper presents local linear estimators by TSLS with variable ban...Econometric simultaneous equation models play an important role in making economic policies, analyzing economic structure and economic forecasting. This paper presents local linear estimators by TSLS with variable bandwidth for every structural equation in semi-parametric simultaneous equation models in econometrics. The properties under large sample size were studied by using the asymptotic theory when all variables were random. The results show that the estimators of the parameters have consistency and asymptotic normality, and their convergence rates are equal to n^-1/2. And the estimator of the nonparametric function has the consistency and asymptotic normality in interior points and its rate of convergence is equal to the optimal convergence rate of the nonparametric function estimation.展开更多
The research is based on the double difference observations and semi-parametric model. Systematic errors are considered as the parameters to be estimated, and brought into the GPS observation equations. High precision...The research is based on the double difference observations and semi-parametric model. Systematic errors are considered as the parameters to be estimated, and brought into the GPS observation equations. High precision baselines are obtained after separating systematic errors. The crucial steps are choosing regularizer and regularization parameters in processing GPS systematic errors by using the semi-parametric model. We propose a new regularizer and apply it to dealing with systematic errors. Also, we compare it with one proposed by other researchers. This comparison is done when all the regularization parameters equal to one. The computation result of the example shows that two regularizers correspond well and they can separate systematic errors successfully. Thus, we can get high precision baselines. Compared with R=QK-1Q′, our regularizer R=GTG is simple, so, the process of resolving the high precision baselines is relatively simple.展开更多
In this article, we focus on the semi-parametric error-in-variables model with missing responses: , where yi are the response variables missing at random, are design points, ζi are the potential variables observed wi...In this article, we focus on the semi-parametric error-in-variables model with missing responses: , where yi are the response variables missing at random, are design points, ζi are the potential variables observed with measurement errors μi, the unknown slope parameter ß?and nonparametric component g(·) need to be estimated. Here we choose two different approaches to estimate ß?and g(·). Under appropriate conditions, we study the strong consistency for the proposed estimators.展开更多
Fire weather indices have been widely applied to predict fire risk in many regions of the world. The objectives of this study were to establish fire risk probability models based on fire indices over different climati...Fire weather indices have been widely applied to predict fire risk in many regions of the world. The objectives of this study were to establish fire risk probability models based on fire indices over different climatic regions in China. We linked the indices adopted in Canadian, US, and Australia with location, time, altitude, vegetation and fire characteristics during 1998-2007 in four regions using semi- parametric logistic (SPL) regression models. Different combinations of fire risk indices were selected as explanatory variables for specific regional probability model. SPL regression models of probability of fire ignition and large fire events were established to describe the non-linear relationship between fire risk indices and fire risk probabilities in the four regions. Graphs of observed versus estimated probabilities, fire risk maps, graphs of numbers of large fire events were produced from the probability models to assess the skill of these models. Fire ignition in all regions showed a significant link with altitude and NDVI. Indices of fuel moisture are important factors influencing fire occurrence in northern China. The fuel indices of organic material are significant indicators of fire risk in southern China. Besides the well skill of predicting fire risk, the probability models are a useful method to assess the utility of the fire risk indices in estimating fire events. The analysis presents some of the dynamics of climate-fire interactions and their value for management systems.展开更多
Wastewater treatment is one of critical issues faced by water utilities, and receives more and more attentions recently. The energy consumption modeling in biochemical wastewater treatment was investigated in the stud...Wastewater treatment is one of critical issues faced by water utilities, and receives more and more attentions recently. The energy consumption modeling in biochemical wastewater treatment was investigated in the study via a general and robust approach based on Bayesian semi-parametric quantile regression. The dataset was derived from a municipal wastewater treatment plant, where the energy consumption of unit chemical oxygen demand(COD) reduction was the response variable of interest. Via the proposed approach,the comprehensive regression pictures of the energy consumption and truly influencing factors, i.e., the regression relationships at lower, median and higher energy consumption levels were characterized respectively. Meanwhile, the proposals for energy saving in different cases were also facilitated specifically. First, the lower level of energy consumption was closely associated with the temperature of influent wastewater, and the chroma-rich wastewater also showed helpful in the execution of energy saving. Second, at median energy consumption level, the COD-rich wastewater played a determinative role in the reduction of energy consumption, while the higher quality of treated water led to slightly energy intensive. Third, the higher level of energy consumption was most likely to be attributed to the relatively high temperature of wastewater and total nitrogen(TN)-rich wastewater,and both of the factors were preferably to be avoided to alleviate the burden of energy consumption. The study provided an efficient approach to controlling the energy consumption of wastewater treatment in the perspective of statistical regression modeling, and offered valuable suggestions for the future energy saving.展开更多
Taxicab is an important mode in urban transportation system, while the role of taxicabs, especially the relationship with metro system has not been fully studied. This study aims at exploring the factors influencing t...Taxicab is an important mode in urban transportation system, while the role of taxicabs, especially the relationship with metro system has not been fully studied. This study aims at exploring the factors influencing the role played by taxicabs in Shanghai, China. Firstly, taxi trips are categorized into three types, namely metroreplaceable(MR), metro-extending(ME) and metro-supplement(MS) ones. Then, the tendency of travelers towards taxi or metro at a specific metro station is proposed and calculated on the basis of MR taxi trips and metro trips. Factors influencing the tendency are investigated through semi-parametric regression models, with the results indicating that the most significant factors and the influencing radii during the peak and off-peak hours are different. Some built environment factors, such as the number of hospitals and government agencies, have significant positive relationship with the tendency in the time periods. Furthermore, land use related factors, such as the increase of forestry and commercial land, generally promote taxi-hiring in the off-peak hours, while they have a negative impact during the peak hours. Findings of this study can assist governments and policy makers to understand the impact of built environment and land use on trip patterns, and thus may contribute to more reasonable policies and optimized urban planning, which may promote modal switch from taxi to subway.展开更多
This article proposes a new lack-of-test based on the weighted ratio of residuals and variances for partially linear regression models. The large and small sampling properties of the proposed test are established. The...This article proposes a new lack-of-test based on the weighted ratio of residuals and variances for partially linear regression models. The large and small sampling properties of the proposed test are established. The testing procedure is illustrated via several examples. Simulation studies show that the testing procedures are powerful even in small samples. An application of the test to a real data set is presented.展开更多
基金The National Key Research and Development Program of China(No.2016YFB0501701)The National High-tech Research and Development Program of China(No.2013AA122501)+1 种基金National Natural Science Foundation of China(Nos.4187610341874016)。
文摘This paper focus on solving the problem of seafloor control point absolute positioning with low vertical accuracy based on the survey ship sailing circle. The method of dealing with the systematic error based on a semi-parametric adjustment model was proposed. Firstly, the influence of sound velocity change on ranging error is analyzed. Secondly, a semi-parametric adjustment model for determining three-dimensional coordinates of seafloor control points was established. And respectively proposed solutions under two different conditions, the observation duration is an integral multiple or non-integer multiple of the long-period term of the ranging error. The simulation experiment shows that this method can obviously improve the accuracy of vertical solution of seafloor control point compared with the difference technique and the least-squares method when internal waves exist and observation duration is less than an integer multiple of the long-period term of the ranging error.
基金The Science Foundation(JA12301)of Fujian Educational Committeethe Teaching Quality Project(ZL0902/TZ(SJ))of Higher Education in Fujian Provincial Education Department
文摘Recurrent event gap times data frequently arise in biomedical studies and often more than one type of event is of interest. To evaluate the effects of covariates on the marginal recurrent event hazards functions, there exist two types of hazards models: the multiplicative hazards model and the additive hazards model. In the paper, we propose a more flexible additive-multiplicative hazards model for multiple type of recurrent gap times data, wherein some covariates are assumed to be additive while others are multiplicative. An estimating equation approach is presented to estimate the regression parameters. We establish asymptotic properties of the proposed estimators.
文摘Given a sample of regression data from (Y, Z), a new diagnostic plotting method is proposed for checking the hypothesis H0: the data are from a given Cox model with the time-dependent covariates Z. It compares two estimates of the marginal distribution FY of Y. One is an estimate of the modified expression of FY under H0, based on a consistent estimate of the parameter under H0, and based on the baseline distribution of the data. The other is the Kaplan-Meier-estimator of FY, together with its confidence band. The new plot, called the marginal distribution plot, can be viewed as a test for testing H0. The main advantage of the test over the existing residual tests is in the case that the data do not satisfy any Cox model or the Cox model is mis-specified. Then the new test is still valid, but not the residual tests and the residual tests often make type II error with a very large probability.
文摘Background: Daily paediatric asthma readmissions within 28 days are a good example of a low count time series and not easily amenable to common time series methods used in studies of asthma seasonality and time trends. We sought to model and predict daily trends of childhood asthma readmissions over time inVictoria,Australia. Methods: We used a database of 75,000 childhood asthma admissions from the Department ofHealth,Victoria,Australiain 1997-2009. Daily admissions over time were modeled using a semi parametric Generalized Additive Model (GAM) and by sex and age group. Predictions were also estimated by using these models. Results: N = 2401 asthma readmissions within 28 days occurred during study period. Of these, n = 1358 (57%) were boys. Overall, seasonal peaks occurred in winter (30.5%) followed by autumn (28.6%) and then spring (24.6%) (p
文摘Using data for China for the years 1991 to 2005 by province and employing the semi- parametric panel data model estimation method developed by Horowitz (2004) and Henderson et al. (2006) and Hubler's non-parametric generalized method of moments (GMM) estimation (2005), this article constructs a dynamic semi-parametric panel data model and describes the dynamic changing trajectory of the effect on consumption of income disparity among urban residents. Our findings show that there is a significant "ratchet effect" in the consumption of urban residents; that income disparity among urban residents has a clear negative influence on consumption; and that the trajectory of this influence shows a roughly bimodal curve.
基金supported by Fundamental Research Funds for the Central Universitiesthe Research Funds of Renmin University of China under Grant No.11XNK027
文摘This paper introduces a semi-parametric model with right-censored data and a monotone constraint on the nonparametrie part. The authors study the local linear estimators of the parametric coefficients and apply B-spline method to approximate the nonparametric part based on grouped data. The authors obtain the rates of convergence for parametric and nonparametric estimators. Moreover, the authors also prove that the nonparametric estimator is consistent at the boundary. At last, the authors investigate the finite sample performance of the estimation.
基金supported by the National Science Foundation of China under Grant Nos.11201422,11301481,and 11371321Zhejiang Provincial Natural Science Foundation of China under Grant Nos.Y6110639,Y6110110,LQ12A01018,and LQ12A01017+2 种基金the National Statistical Science Research Project of China under Grant No.2012LY174Foundation for Young Talents of ZJGSU under Grant No.1020XJ1314019Zhejiang Provincial Key Research Base for Humanities and Social Science Research(Statistics)
文摘This paper considers the convergence rates for nonparametric estimators of the error distribution in semi-parametric regression models. By establishing some general laws of the iterated logarithm, it shows that the rates of convergence of either the empirical distribution or a smoothed version of the empirical distribution function matches exactly the rates obtained for an independent sample from the error distribution.
基金This project is supported by National Natural Science Foundation of China (70371025)
文摘Econometric simultaneous equation models play an important role in making economic policies, analyzing economic structure and economic forecasting. This paper presents local linear estimators by TSLS with variable bandwidth for every structural equation in semi-parametric simultaneous equation models in econometrics. The properties under large sample size were studied by using the asymptotic theory when all variables were random. The results show that the estimators of the parameters have consistency and asymptotic normality, and their convergence rates are equal to n^-1/2. And the estimator of the nonparametric function has the consistency and asymptotic normality in interior points and its rate of convergence is equal to the optimal convergence rate of the nonparametric function estimation.
文摘The research is based on the double difference observations and semi-parametric model. Systematic errors are considered as the parameters to be estimated, and brought into the GPS observation equations. High precision baselines are obtained after separating systematic errors. The crucial steps are choosing regularizer and regularization parameters in processing GPS systematic errors by using the semi-parametric model. We propose a new regularizer and apply it to dealing with systematic errors. Also, we compare it with one proposed by other researchers. This comparison is done when all the regularization parameters equal to one. The computation result of the example shows that two regularizers correspond well and they can separate systematic errors successfully. Thus, we can get high precision baselines. Compared with R=QK-1Q′, our regularizer R=GTG is simple, so, the process of resolving the high precision baselines is relatively simple.
文摘In this article, we focus on the semi-parametric error-in-variables model with missing responses: , where yi are the response variables missing at random, are design points, ζi are the potential variables observed with measurement errors μi, the unknown slope parameter ß?and nonparametric component g(·) need to be estimated. Here we choose two different approaches to estimate ß?and g(·). Under appropriate conditions, we study the strong consistency for the proposed estimators.
基金supported by the National Basic Research Program, from Ministry of Science and Technology of China (No 2010CB955304)
文摘Fire weather indices have been widely applied to predict fire risk in many regions of the world. The objectives of this study were to establish fire risk probability models based on fire indices over different climatic regions in China. We linked the indices adopted in Canadian, US, and Australia with location, time, altitude, vegetation and fire characteristics during 1998-2007 in four regions using semi- parametric logistic (SPL) regression models. Different combinations of fire risk indices were selected as explanatory variables for specific regional probability model. SPL regression models of probability of fire ignition and large fire events were established to describe the non-linear relationship between fire risk indices and fire risk probabilities in the four regions. Graphs of observed versus estimated probabilities, fire risk maps, graphs of numbers of large fire events were produced from the probability models to assess the skill of these models. Fire ignition in all regions showed a significant link with altitude and NDVI. Indices of fuel moisture are important factors influencing fire occurrence in northern China. The fuel indices of organic material are significant indicators of fire risk in southern China. Besides the well skill of predicting fire risk, the probability models are a useful method to assess the utility of the fire risk indices in estimating fire events. The analysis presents some of the dynamics of climate-fire interactions and their value for management systems.
基金supported by the National Natural Science Foundation of China (Nos.51478025,11701023,71420107025)
文摘Wastewater treatment is one of critical issues faced by water utilities, and receives more and more attentions recently. The energy consumption modeling in biochemical wastewater treatment was investigated in the study via a general and robust approach based on Bayesian semi-parametric quantile regression. The dataset was derived from a municipal wastewater treatment plant, where the energy consumption of unit chemical oxygen demand(COD) reduction was the response variable of interest. Via the proposed approach,the comprehensive regression pictures of the energy consumption and truly influencing factors, i.e., the regression relationships at lower, median and higher energy consumption levels were characterized respectively. Meanwhile, the proposals for energy saving in different cases were also facilitated specifically. First, the lower level of energy consumption was closely associated with the temperature of influent wastewater, and the chroma-rich wastewater also showed helpful in the execution of energy saving. Second, at median energy consumption level, the COD-rich wastewater played a determinative role in the reduction of energy consumption, while the higher quality of treated water led to slightly energy intensive. Third, the higher level of energy consumption was most likely to be attributed to the relatively high temperature of wastewater and total nitrogen(TN)-rich wastewater,and both of the factors were preferably to be avoided to alleviate the burden of energy consumption. The study provided an efficient approach to controlling the energy consumption of wastewater treatment in the perspective of statistical regression modeling, and offered valuable suggestions for the future energy saving.
基金the Shanghai Municipal Natural Science Foundation(No.17ZR1445500)the Humanities and Social Sciences Foundation of Ministry of Education in China(Nos.18YJCZH011 and 19YJAZH077)
文摘Taxicab is an important mode in urban transportation system, while the role of taxicabs, especially the relationship with metro system has not been fully studied. This study aims at exploring the factors influencing the role played by taxicabs in Shanghai, China. Firstly, taxi trips are categorized into three types, namely metroreplaceable(MR), metro-extending(ME) and metro-supplement(MS) ones. Then, the tendency of travelers towards taxi or metro at a specific metro station is proposed and calculated on the basis of MR taxi trips and metro trips. Factors influencing the tendency are investigated through semi-parametric regression models, with the results indicating that the most significant factors and the influencing radii during the peak and off-peak hours are different. Some built environment factors, such as the number of hospitals and government agencies, have significant positive relationship with the tendency in the time periods. Furthermore, land use related factors, such as the increase of forestry and commercial land, generally promote taxi-hiring in the off-peak hours, while they have a negative impact during the peak hours. Findings of this study can assist governments and policy makers to understand the impact of built environment and land use on trip patterns, and thus may contribute to more reasonable policies and optimized urban planning, which may promote modal switch from taxi to subway.
基金partially supported by the Major Project of Humanities Social Science Foundation of Ministry of Education under Grant No.08JJD910247Key Project of Chinese Ministry of Education under Grant No. 108120+3 种基金National Natural Science Foundation of China under Grant No.11271368Beijing Natural Science Foundation under Grant No.1102021the Fundamental Research Funds for the Central Universitiesthe Research Funds of Renmin University of China under Grant Nos.10XNL018 and 11XNH107
文摘This article proposes a new lack-of-test based on the weighted ratio of residuals and variances for partially linear regression models. The large and small sampling properties of the proposed test are established. The testing procedure is illustrated via several examples. Simulation studies show that the testing procedures are powerful even in small samples. An application of the test to a real data set is presented.