As an emergency and auxiliary power source for aircraft,lithium(Li)-ion batteries are important components of aerospace power systems.The Remaining Useful Life(RUL)prediction of Li-ion batteries is a key technology to...As an emergency and auxiliary power source for aircraft,lithium(Li)-ion batteries are important components of aerospace power systems.The Remaining Useful Life(RUL)prediction of Li-ion batteries is a key technology to ensure the reliable operation of aviation power systems.Particle Filter(PF)is an effective method to predict the RUL of Li-ion batteries because of its uncertainty representation and management ability.However,there are problems that particle weights cannot be updated in the prediction stage and particles degradation.To settle these issues,an innovative technique of F-distribution PF and Kernel Smoothing(FPFKS)algorithm is proposed.In the prediction stage,the weights of the particles are dynamically updated by the F kernel instead of being fixed all the time.Meanwhile,a first-order independent Markov capacity degradation model is established.Moreover,the kernel smoothing algorithm is integrated into PF,so that the variance of the parameters of capacity degradation model keeps invariant.Experiments based on NASA battery data sets show that FPFKS can be excellently applied to RUL prediction of Liion batteries.展开更多
In the seismic profile interpretation process,as the seismic data are big and the small geological features are difficult to identify,improvement of the efficiency is needed. In this study,structure tensor method in c...In the seismic profile interpretation process,as the seismic data are big and the small geological features are difficult to identify,improvement of the efficiency is needed. In this study,structure tensor method in computer image edge detection processing is applied into the 2D seismic profile. Coherent attribute is used to extract formation edge. At the same time,extracting the eigenvalues and eigenvectors to calculate the seismic geometric properties which include dip and apparent dip,automatic identification is achieved. Testing the Gaussian kernel function with synthetic models and comparing the coherent attribute and dip attribute extraction results before and after,the conclusion that Gaussian filter can remove the random noise is obtained.展开更多
The issue of selection of bandwidth in kernel smoothing method is considered within the context of partially linear models, hi this paper, we study the asymptotic behavior of the bandwidth choice based on generalized ...The issue of selection of bandwidth in kernel smoothing method is considered within the context of partially linear models, hi this paper, we study the asymptotic behavior of the bandwidth choice based on generalized cross-validation (CCV) approach and prove that this bandwidth choice is asymptotically optimal. Numerical simulation are also conducted to investigate the empirical performance of generalized cross-valldation.展开更多
There are already a lot of models to fit a set of stationary time series, such as AR, MA, and ARMA models. For the non-stationary data, an ARIMA or seasonal ARIMA models can be used to fit the given data. Moreover, th...There are already a lot of models to fit a set of stationary time series, such as AR, MA, and ARMA models. For the non-stationary data, an ARIMA or seasonal ARIMA models can be used to fit the given data. Moreover, there are also many statistical softwares that can be used to build a stationary or non-stationary time series model for a given set of time series data, such as SAS, SPLUS, etc. However, some statistical softwares wouldn't work well for small samples with or without missing data, especially for small time series data with seasonal trend. A nonparametric smoothing technique to build a forecasting model for a given small seasonal time series data is carried out in this paper. And then, both the method provided in this paper and that in SAS package are applied to the modeling of international airline passengers data respectively, the comparisons between the two methods are done afterwards. The results of the comparison show us the method provided in this paper has superiority over SAS's method.展开更多
In this paper, we present influence analysis that can identify observations: which locally infulence the choice of smoothing parameter by cross-validation method when fitting a kelnel regnssion model. The diagnostic m...In this paper, we present influence analysis that can identify observations: which locally infulence the choice of smoothing parameter by cross-validation method when fitting a kelnel regnssion model. The diagnostic methods are illustrated with examples.展开更多
To reduce the influences of outliers on support vector machine(SVM) classification problem,a new tangent loss function was constructed.Since the tangent loss function was not smooth in some interval,a smoothing functi...To reduce the influences of outliers on support vector machine(SVM) classification problem,a new tangent loss function was constructed.Since the tangent loss function was not smooth in some interval,a smoothing function was used to approximate it in this interval.According to this loss function,the corresponding tangent SVM(TSVM) was got.The experimental results show that TSVM is less sensitive to outliers than SVM.So the proposed new loss function and TSVM are both effective.展开更多
We consider the problem of parameter estimation in both linear and nonlinear ordinary differential equation(ODE) models. Nonlinear ODE models are widely used in applications. But their analytic solutions are usually...We consider the problem of parameter estimation in both linear and nonlinear ordinary differential equation(ODE) models. Nonlinear ODE models are widely used in applications. But their analytic solutions are usually not available. Thus regular methods usually depend on repetitive use of numerical solutions which bring huge computational cost. We proposed a new two-stage approach which includes a smoothing method(kernel smoothing or local polynomial fitting) in the first stage, and a numerical discretization method(Eulers discretization method, the trapezoidal discretization method,or the Runge–Kutta discretization method) in the second stage. Through numerical simulations, we find the proposed method gains a proper balance between estimation accuracy and computational cost.Asymptotic properties are also presented, which show the consistency and asymptotic normality of estimators under some mild conditions. The proposed method is compared to existing methods in term of accuracy and computational cost. The simulation results show that the estimators with local linear smoothing in the first stage and trapezoidal discretization in the second stage have the lowest average relative errors. We apply the proposed method to HIV dynamics data to illustrate the practicability of the estimator.展开更多
In real data analysis,the underlying model is frequently unknown.Hence,the modeling strategy plays a key role in the success of data analysis.Inspired by the idea of model averaging,we propose a novel semiparametric m...In real data analysis,the underlying model is frequently unknown.Hence,the modeling strategy plays a key role in the success of data analysis.Inspired by the idea of model averaging,we propose a novel semiparametric modeling strategy for the conditional quantile prediction,without assuming that the underlying model is any specific parametric or semiparametric model.Due to the optimality of the weights selected by leaveone-out cross-validation,the proposed modeling strategy provides a more precise prediction than those based on some commonly used semiparametric models such as the varying coefficient and additive models.Asymptotic properties are established in the proposed modeling strategy along with its estimation procedure.We conducted extensive simulations to compare our method with alternatives across various scenarios.The results show that our method provides more accurate predictions.Finally,we applied our approach to the Boston housing data,yielding more precise quantile predictions of house prices compared with commonly used methods,and thus offering a clearer picture of the Boston housing market.展开更多
The Box-Cox transformation model has been widely used in applied econometrics, positive accounting, positive finance and statistics. There is a large literature on Box-Cox transformation model with linear structure. H...The Box-Cox transformation model has been widely used in applied econometrics, positive accounting, positive finance and statistics. There is a large literature on Box-Cox transformation model with linear structure. However, there is seldom seen on the discussion for such a model with partially linear structure. Considering the importance of the partially linear model, in this paper, a relatively simple semi-parametric estimation procedure is proposed for the Box-Cox transformation model without presuming the linear functional form and without specifying any parametric form of the disturbance, which largely reduces the risk of model misspecification. We show that the proposed estimator is consistent and asymptotically normally distributed. Its covariance matrix is also in a closed form, which can be easily estimated. Finally, a simulation study is conducted to see the finite sample performance of our estimator.展开更多
The estimated seismic hazard based on the delineated seismic source model is used as the basis to assign the seismic design loads in Canadian structural design codes.An alternative for the estimation is based on a spa...The estimated seismic hazard based on the delineated seismic source model is used as the basis to assign the seismic design loads in Canadian structural design codes.An alternative for the estimation is based on a spatially smoothed source model.However,a quantification of differences in the Canadian seismic hazard maps(CanSHMs)obtained based on the delineated seismic source model and spatially smoothed model is unavailable.The quantification is valuable to identify epistemic uncertainty in the estimated seismic hazard and the degree of uncertainty in the CanSHMs.In the present study,we developed seismic source models using spatial smoothing and historical earthquake catalogue.We quantified the differences in the estimated Canadian seismic hazard by considering the delineated source model and spatially smoothed source models.For the development of the spatially smoothed seismic source models,we considered spatial kernel smoothing techniques with or without adaptive bandwidth.The results indicate that the use of the delineated seismic source model could lead to under or over-estimation of the seismic hazard as compared to those estimated based on spatially smoothed seismic source models.This suggests that an epistemic uncertainty caused by the seismic source models should be considered to map the seismic hazard.展开更多
It is of great interest to estimate quantile residual lifetime in medical science and many other fields. In survival analysis, Kaplan-Meier(K-M) estimator has been widely used to estimate the survival distribution. ...It is of great interest to estimate quantile residual lifetime in medical science and many other fields. In survival analysis, Kaplan-Meier(K-M) estimator has been widely used to estimate the survival distribution. However, it is well-known that the K-M estimator is not continuous, thus it can not always be used to calculate quantile residual lifetime. In this paper, the authors propose a kernel smoothing method to give an estimator of quantile residual lifetime. By using modern empirical process techniques, the consistency and the asymptotic normality of the proposed estimator are provided neatly.The authors also present the empirical small sample performances of the estimator. Deficiency is introduced to compare the performance of the proposed estimator with the naive unsmoothed estimator of the quantile residaul lifetime. Further simulation studies indicate that the proposed estimator performs very well.展开更多
As biological studies become more expensive to conduct,it is a frequently encountered question that how to take advantage of the available auxiliary covariate information when the exposure variable is not measured.In ...As biological studies become more expensive to conduct,it is a frequently encountered question that how to take advantage of the available auxiliary covariate information when the exposure variable is not measured.In this paper,we propose an induced cure rate mean residual life time regression model to accommodate the survival data with cure fraction and auxiliary covariate,in which the exposure variable is only assessed in a validation set,but a corresponding continuous auxiliary covariate is ascertained for all subjects in the study cohort.Simulation studies elucidate the practical performance of the proposed method under finite samples.As an illustration,we apply the proposed method to a heart disease data from the Study of Left Ventricular Dysfunction.展开更多
Missing covariate data arise frequently in biomedical studies.In this article,we propose a class of weighted estimating equations for the additive hazards regression model when some of the covariates are missing at ra...Missing covariate data arise frequently in biomedical studies.In this article,we propose a class of weighted estimating equations for the additive hazards regression model when some of the covariates are missing at random.Time-specific and subject-specific weights are incorporated into the formulation of weighted estimating equations.Unified results are established for estimating selection probabilities that cover both parametric and non-parametric modelling schemes.The resulting estimators have closed forms and are shown to be consistent and asymptotically normal.Simulation studies indicate that the proposed estimators perform well for practical settings.An application to a mouse leukemia study is illustrated.展开更多
Dimensional variation analysis in multistation manufacturing processes(MMPs)is a challenging research topic with great practical significance.Researchers have been focused on constructing various mathematical models t...Dimensional variation analysis in multistation manufacturing processes(MMPs)is a challenging research topic with great practical significance.Researchers have been focused on constructing various mathematical models to identify the correlations among the huge amounts of collected production data.However,current models have achieved insufficient insights into the variation correlation laws due to the complexity of the data’s mutual relations.In this study,a data-driven modeling method is developed for deep data-mining and dimensional variation analysis.The proposed initial mathematical expression originates from practical engineering knowledge.Through a mathematical treatment,the mathematical expression is transformed into a first-order AR(1)model format,which contains multiple dimensional variations’interstation and temporal correlating information.To obtain this information,the estimation of the proposed model is discussed in detail.A simulation case involving two key product characteristics of a grinding process is used to demonstrate the effectiveness and accuracy of the proposed method for dimensional variation analysis in MMPs.展开更多
基金co-supported by Aeronautical Science Foundation of China (No. 20183352030)Fund Project of Equipment Pre-research Field of China (No. JZX7Y20190243016301)
文摘As an emergency and auxiliary power source for aircraft,lithium(Li)-ion batteries are important components of aerospace power systems.The Remaining Useful Life(RUL)prediction of Li-ion batteries is a key technology to ensure the reliable operation of aviation power systems.Particle Filter(PF)is an effective method to predict the RUL of Li-ion batteries because of its uncertainty representation and management ability.However,there are problems that particle weights cannot be updated in the prediction stage and particles degradation.To settle these issues,an innovative technique of F-distribution PF and Kernel Smoothing(FPFKS)algorithm is proposed.In the prediction stage,the weights of the particles are dynamically updated by the F kernel instead of being fixed all the time.Meanwhile,a first-order independent Markov capacity degradation model is established.Moreover,the kernel smoothing algorithm is integrated into PF,so that the variance of the parameters of capacity degradation model keeps invariant.Experiments based on NASA battery data sets show that FPFKS can be excellently applied to RUL prediction of Liion batteries.
基金Support by National Natural Science Foundation of China(No.41274120)
文摘In the seismic profile interpretation process,as the seismic data are big and the small geological features are difficult to identify,improvement of the efficiency is needed. In this study,structure tensor method in computer image edge detection processing is applied into the 2D seismic profile. Coherent attribute is used to extract formation edge. At the same time,extracting the eigenvalues and eigenvectors to calculate the seismic geometric properties which include dip and apparent dip,automatic identification is achieved. Testing the Gaussian kernel function with synthetic models and comparing the coherent attribute and dip attribute extraction results before and after,the conclusion that Gaussian filter can remove the random noise is obtained.
文摘The issue of selection of bandwidth in kernel smoothing method is considered within the context of partially linear models, hi this paper, we study the asymptotic behavior of the bandwidth choice based on generalized cross-validation (CCV) approach and prove that this bandwidth choice is asymptotically optimal. Numerical simulation are also conducted to investigate the empirical performance of generalized cross-valldation.
基金Supported by the National Natural Science Foundation of China(No.10371034)
文摘There are already a lot of models to fit a set of stationary time series, such as AR, MA, and ARMA models. For the non-stationary data, an ARIMA or seasonal ARIMA models can be used to fit the given data. Moreover, there are also many statistical softwares that can be used to build a stationary or non-stationary time series model for a given set of time series data, such as SAS, SPLUS, etc. However, some statistical softwares wouldn't work well for small samples with or without missing data, especially for small time series data with seasonal trend. A nonparametric smoothing technique to build a forecasting model for a given small seasonal time series data is carried out in this paper. And then, both the method provided in this paper and that in SAS package are applied to the modeling of international airline passengers data respectively, the comparisons between the two methods are done afterwards. The results of the comparison show us the method provided in this paper has superiority over SAS's method.
文摘In this paper, we present influence analysis that can identify observations: which locally infulence the choice of smoothing parameter by cross-validation method when fitting a kelnel regnssion model. The diagnostic methods are illustrated with examples.
基金National Natural Science Foundations of China(Nos.61272015,11201123)the Scientific Research Foundation for the Doctor of Henan University of Science&Technology,China(No.09001476)School Foundation of Henan University of Science&Technology,China(No.2012QN011)
文摘To reduce the influences of outliers on support vector machine(SVM) classification problem,a new tangent loss function was constructed.Since the tangent loss function was not smooth in some interval,a smoothing function was used to approximate it in this interval.According to this loss function,the corresponding tangent SVM(TSVM) was got.The experimental results show that TSVM is less sensitive to outliers than SVM.So the proposed new loss function and TSVM are both effective.
基金Supported by NSFC(Grant Nos.11201317,11028103,11231010,11471223)Doctoral Fund of Ministry of Education of China(Grant No.20111108120002)+1 种基金the Beijing Municipal Education Commission Foundation(Grant No.KM201210028005)the Key Project of Beijing Municipal Educational Commission
文摘We consider the problem of parameter estimation in both linear and nonlinear ordinary differential equation(ODE) models. Nonlinear ODE models are widely used in applications. But their analytic solutions are usually not available. Thus regular methods usually depend on repetitive use of numerical solutions which bring huge computational cost. We proposed a new two-stage approach which includes a smoothing method(kernel smoothing or local polynomial fitting) in the first stage, and a numerical discretization method(Eulers discretization method, the trapezoidal discretization method,or the Runge–Kutta discretization method) in the second stage. Through numerical simulations, we find the proposed method gains a proper balance between estimation accuracy and computational cost.Asymptotic properties are also presented, which show the consistency and asymptotic normality of estimators under some mild conditions. The proposed method is compared to existing methods in term of accuracy and computational cost. The simulation results show that the estimators with local linear smoothing in the first stage and trapezoidal discretization in the second stage have the lowest average relative errors. We apply the proposed method to HIV dynamics data to illustrate the practicability of the estimator.
基金supported by National Natural Science Foundation of China(Grant Nos.11931014 and 12201091)the Natural Science Foundation of Chongqing(Grant No.CSTB2022NSCQ-MSX0852)+1 种基金the National Statistical Science Research Program of China(Grant No.2022LY019)the Science and Technology Research Program of the Chongqing Municipal Education Commission(Grant No.KJQN202100526)。
文摘In real data analysis,the underlying model is frequently unknown.Hence,the modeling strategy plays a key role in the success of data analysis.Inspired by the idea of model averaging,we propose a novel semiparametric modeling strategy for the conditional quantile prediction,without assuming that the underlying model is any specific parametric or semiparametric model.Due to the optimality of the weights selected by leaveone-out cross-validation,the proposed modeling strategy provides a more precise prediction than those based on some commonly used semiparametric models such as the varying coefficient and additive models.Asymptotic properties are established in the proposed modeling strategy along with its estimation procedure.We conducted extensive simulations to compare our method with alternatives across various scenarios.The results show that our method provides more accurate predictions.Finally,we applied our approach to the Boston housing data,yielding more precise quantile predictions of house prices compared with commonly used methods,and thus offering a clearer picture of the Boston housing market.
基金funded in part by National Natural Science Foundation of China (Grant No. 71032005)the MOE Project of Key Research Institute of Humanities and Social Science in University (Grant No. 10JJD630005)+3 种基金supported in part by New Century Excellent Talent Supporting program (Grant No. NCET-09-0538)National Natural Science Foundation of China(Grant Nos. 70871073 and 71171127)Shanghai Leading Academic Discipline Project (Grant No. B801)the Key Laboratory of Mathematical Economics (SUFE), Ministry of Education of China
文摘The Box-Cox transformation model has been widely used in applied econometrics, positive accounting, positive finance and statistics. There is a large literature on Box-Cox transformation model with linear structure. However, there is seldom seen on the discussion for such a model with partially linear structure. Considering the importance of the partially linear model, in this paper, a relatively simple semi-parametric estimation procedure is proposed for the Box-Cox transformation model without presuming the linear functional form and without specifying any parametric form of the disturbance, which largely reduces the risk of model misspecification. We show that the proposed estimator is consistent and asymptotically normally distributed. Its covariance matrix is also in a closed form, which can be easily estimated. Finally, a simulation study is conducted to see the finite sample performance of our estimator.
基金The support of the Fundamental Research Funds from the Central Universities,CHD(Grant No.300102282103)Natural Science Basic Research Program of Shaanxi(Program No.2023-JC-QN-0512)Harbin Institute of Technology(Shenzhen)。
文摘The estimated seismic hazard based on the delineated seismic source model is used as the basis to assign the seismic design loads in Canadian structural design codes.An alternative for the estimation is based on a spatially smoothed source model.However,a quantification of differences in the Canadian seismic hazard maps(CanSHMs)obtained based on the delineated seismic source model and spatially smoothed model is unavailable.The quantification is valuable to identify epistemic uncertainty in the estimated seismic hazard and the degree of uncertainty in the CanSHMs.In the present study,we developed seismic source models using spatial smoothing and historical earthquake catalogue.We quantified the differences in the estimated Canadian seismic hazard by considering the delineated source model and spatially smoothed source models.For the development of the spatially smoothed seismic source models,we considered spatial kernel smoothing techniques with or without adaptive bandwidth.The results indicate that the use of the delineated seismic source model could lead to under or over-estimation of the seismic hazard as compared to those estimated based on spatially smoothed seismic source models.This suggests that an epistemic uncertainty caused by the seismic source models should be considered to map the seismic hazard.
基金supported by the National Natural Science Foundation of China under Grant No.71271128the State Key Program of National Natural Science Foundation of China under Grant No.71331006+4 种基金NCMISKey Laboratory of RCSDSCAS and IRTSHUFEPCSIRT(IRT13077)supported by Graduate Innovation Fund of Shanghai University of Finance and Economics under Grant No.CXJJ-2011-429
文摘It is of great interest to estimate quantile residual lifetime in medical science and many other fields. In survival analysis, Kaplan-Meier(K-M) estimator has been widely used to estimate the survival distribution. However, it is well-known that the K-M estimator is not continuous, thus it can not always be used to calculate quantile residual lifetime. In this paper, the authors propose a kernel smoothing method to give an estimator of quantile residual lifetime. By using modern empirical process techniques, the consistency and the asymptotic normality of the proposed estimator are provided neatly.The authors also present the empirical small sample performances of the estimator. Deficiency is introduced to compare the performance of the proposed estimator with the naive unsmoothed estimator of the quantile residaul lifetime. Further simulation studies indicate that the proposed estimator performs very well.
基金supported by the National Natural Science Foundation of China(No.11971362,12101256)。
文摘As biological studies become more expensive to conduct,it is a frequently encountered question that how to take advantage of the available auxiliary covariate information when the exposure variable is not measured.In this paper,we propose an induced cure rate mean residual life time regression model to accommodate the survival data with cure fraction and auxiliary covariate,in which the exposure variable is only assessed in a validation set,but a corresponding continuous auxiliary covariate is ascertained for all subjects in the study cohort.Simulation studies elucidate the practical performance of the proposed method under finite samples.As an illustration,we apply the proposed method to a heart disease data from the Study of Left Ventricular Dysfunction.
基金supported by National Natural Science Foundation of China(Grant Nos.11771431,11690015,11926341,11601080 and 11671275)Key Laboratory of Random Complex Structures and Data Science,Chinese Academy of Sciences(Grant No.2008DP173182)the Fundamental Research Funds for the Central Universities in University of International Business and Economics(Grant No.CXTD10-09)。
文摘Missing covariate data arise frequently in biomedical studies.In this article,we propose a class of weighted estimating equations for the additive hazards regression model when some of the covariates are missing at random.Time-specific and subject-specific weights are incorporated into the formulation of weighted estimating equations.Unified results are established for estimating selection probabilities that cover both parametric and non-parametric modelling schemes.The resulting estimators have closed forms and are shown to be consistent and asymptotically normal.Simulation studies indicate that the proposed estimators perform well for practical settings.An application to a mouse leukemia study is illustrated.
基金The research work was supported by the natural science fund for colleges and universities in Jiangsu province(Nos.15KJB460016 and 14KJB460029)the major industrial technology project in Xuzhou city(No.KC16GZ015)+1 种基金the major industrial technology project in Jiangsu Province(No.BE2016047)the natural science foundation of China(No.71561016).The author would also like to gratefully acknowledge Professor Fugee Tsung and the other colleagues at Hong Kong University of Science and Technology for their valuable comments.
文摘Dimensional variation analysis in multistation manufacturing processes(MMPs)is a challenging research topic with great practical significance.Researchers have been focused on constructing various mathematical models to identify the correlations among the huge amounts of collected production data.However,current models have achieved insufficient insights into the variation correlation laws due to the complexity of the data’s mutual relations.In this study,a data-driven modeling method is developed for deep data-mining and dimensional variation analysis.The proposed initial mathematical expression originates from practical engineering knowledge.Through a mathematical treatment,the mathematical expression is transformed into a first-order AR(1)model format,which contains multiple dimensional variations’interstation and temporal correlating information.To obtain this information,the estimation of the proposed model is discussed in detail.A simulation case involving two key product characteristics of a grinding process is used to demonstrate the effectiveness and accuracy of the proposed method for dimensional variation analysis in MMPs.