China implemented the public hospital reform in 2012. This study utilized bootstrapping data envelopment analysis(DEA) to evaluate the technical efficiency(TE) and productivity of county public hospitals in Easter...China implemented the public hospital reform in 2012. This study utilized bootstrapping data envelopment analysis(DEA) to evaluate the technical efficiency(TE) and productivity of county public hospitals in Eastern, Central, and Western China after the 2012 public hospital reform. Data from 127 county public hospitals(39, 45, and 43 in Eastern, Central, and Western China, respectively) were collected during 2012–2015. Changes of TE and productivity over time were estimated by bootstrapping DEA and bootstrapping Malmquist. The disparities in TE and productivity among public hospitals in the three regions of China were compared by Kruskal–Wallis H test and Mann–Whitney U test. The average bias-corrected TE values for the four-year period were 0.6442, 0.5785, 0.6099, and 0.6094 in Eastern, Central, and Western China, and the entire country respectively, with average non-technical efficiency, low pure technical efficiency(PTE), and high scale efficiency found. Productivity increased by 8.12%, 0.25%, 12.11%, and 11.58% in China and its three regions during 2012–2015, and such increase in productivity resulted from progressive technological changes by 16.42%, 6.32%, 21.08%, and 21.42%, respectively. The TE and PTE of the county hospitals significantly differed among the three regions of China. Eastern and Western China showed significantly higher TE and PTE than Central China. More than 60% of county public hospitals in China and its three areas operated at decreasing return scales. There was a considerable space for TE improvement in county hospitals in China and its three regions. During 2012–2015, the hospitals experienced progressive productivity; however, the PTE changed adversely. Moreover, Central China continuously achieved a significantly lower efficiency score than Eastern and Western China. Decision makers and administrators in China should identify the causes of the observed inefficiencies and take appropriate measures to increase the efficiency of county public hospitals in the three areas of China, especially in Central China.展开更多
A new approach of relation extraction is described in this paper. It adopts a bootstrap- ping model with a novel iteration strategy, which generates more precise examples of specific relation. Compared with previous m...A new approach of relation extraction is described in this paper. It adopts a bootstrap- ping model with a novel iteration strategy, which generates more precise examples of specific relation. Compared with previous methods, the proposed method has three main advantages: first, it needs less manual intervention; second, more abundant and reasonable information are introduced to represent a relation pattern; third, it reduces the risk of circular dependency occurrence in bootstrapping. Scalable evaluation methodology and metrics are developed for our task with comparable techniques over TianWang 100G corpus. The experimental results show that it can get 90% precision and have excellent expansibility.展开更多
Purpose: Our study proposes a bootstrapping-based method to automatically extract data- usage statements from academic texts. Design/methodology/approach: The method for data-usage statements extraction starts with ...Purpose: Our study proposes a bootstrapping-based method to automatically extract data- usage statements from academic texts. Design/methodology/approach: The method for data-usage statements extraction starts with seed entities and iteratively learns patterns and data-usage statements from unlabeled text. In each iteration, new patterns are constructed and added to the pattern list based on their calculated score. Three seed-selection strategies are also proposed in this paper. Findings: The performance of the method is verified by means of experiments on real data collected from computer science journals. The results show that the method can achieve satisfactory performance regarding precision of extraction and extensibility of obtained patterns. Research limitations: While the triple representation of sentences is effective and efficient for extracting data-usage statements, it is unable to handle complex sentences. Additional features that can address complex sentences should thus be explored in the future. Practical implications: Data-usage statements extraction is beneficial for data-repository construction and facilitates research on data-usage tracking, dataset-based scholar search, and dataset evaluation. Originality/value: To the best of our knowledge, this paper is among the first to address the important task of automatically extracting data-usage statements from real data.展开更多
This paper aims to detect the short-term as well as long-term change point in the surface air temperature time series for Asansol weather observation station, West Bengal, India. Temperature data for the period from 1...This paper aims to detect the short-term as well as long-term change point in the surface air temperature time series for Asansol weather observation station, West Bengal, India. Temperature data for the period from 1941 to 2010 of the said weather observatory have been collected from Indian Meteorological Department, Kolkata. Variations and trends of annual mean temperature, annual mean maximum temperature and annual minimum temperature time series were examined. The cumulative sum charts (CUSUM) and bootstrapping were used for the detection of abrupt changes in the time series data set. Statistically significant abrupt changes and trends have been detected. The major change point in the annual mean temperatures occurred around 1986 (0.57°C) at the period of 25 years in the long-term regional scale. On the other side, the annual mean maximum and annual mean minimum temperatures have distinct change points at level 1. There are abrupt changes in the year 1961 (Confidence interval 1961, 1963) for the annual mean maximum and 1994 (Confidence interval 1993, 1996) for the annual mean minimum temperatures at a confidence level of 100% and 98%, respectively. Before the change, the annual mean maximum and annual mean minimum temperatures were 30.90°C and 23.99°C, respectively, while after the change, the temperatures became 33.93°C and 24.84°C, respectively. Over the entire period of consideration (1941-2010), 11 forward and backward changes were found in total. Out of 11, there are 3 changes (1961, 1986 and 2001) in annual mean temperatures, 4 changes (1957, 1961, 1980 and 1994) in annual mean maximum temperatures, and rest 4 changes (1968, 1981, 1994 and 2001) are associated with annual mean minimum temperature data set.展开更多
In order to predict the extreme load of the mechanical components during the entire life,an automatic method based on the bootstrapping technology(BT)is proposed to determine the most suitable threshold.Based on all t...In order to predict the extreme load of the mechanical components during the entire life,an automatic method based on the bootstrapping technology(BT)is proposed to determine the most suitable threshold.Based on all the turning points of the load history and a series of thresholds estimated in advance,the generalized Pareto distribution is established to fit the exceedances.The corresponding distribution parameters are estimated with the maximum likelihood method.Then,BT is employed to calculate the mean squared error(MSE)of each estimated threshold based on the exceedances and the specific distribution parameters.Finally,the threshold with the smallest MSE will be the optimal one.Compared to the kurtosis method and the mean excess function method,the average deviation of the probability density function of exceedances determined by BT reduces by 38.52%and 29.25%,respectively.Moreover,the quantile-quantile plot of the exceedances determined by BT is closer to a straight line.The results suggest the improvement of the modeling flexibility and the determined threshold precision.If the exceedances are insufficient,BT will enlarge their amount by resampling to solve the instability problem of the original distribution parameters.展开更多
The selection and optimization of model filters affect the precision of motion pattern identification and state estimation in maneuvering target tracking directly.Aiming at improving performance of model filters,a nov...The selection and optimization of model filters affect the precision of motion pattern identification and state estimation in maneuvering target tracking directly.Aiming at improving performance of model filters,a novel maneuvering target tracking algorithm based on central difference Kalman filter in observation bootstrapping strategy is proposed.The framework of interactive multiple model(IMM) is used to realize identification of motion pattern,and a central difference Kalman filter(CDKF) is selected as the model filter of IMM.Considering the advantage of multi-sensor fusion method in improving the stability and reliability of observation information,the hardware cost of the observation system for multiple sensors is adopted,meanwhile,according to the data assimilation technique in Ensemble Kalman filter(En KF),a bootstrapping observation set is constructed by integrating the latest observation and the prior information of observation noise.On that basis,these bootstrapping observations are reasonably used to optimize the filtering performance of CDKF by means of weight fusion way.The object of new algorithm is to improve the tracking precision of observed target by the multi-sensor fusion method without increasing the number of physical sensors.The theoretical analysis and experimental results show the feasibility and efficiency of the proposed algorithm.展开更多
Discrete software reliability measurement has a proper characteristic for describing a software reliability growth process which depends on a unit of the software fault-detection period, such as the number of test run...Discrete software reliability measurement has a proper characteristic for describing a software reliability growth process which depends on a unit of the software fault-detection period, such as the number of test runs, the number of executed test cases. This paper discusses discrete software reliability measurement based on a discretized nonhomogeneous Poisson process (NHPP) model. Especially, we use a bootstrapping method in our discrete software reliability measurement for discussing the statistical inference on parameters and software reliability assessment measures of our model. Finally we show numerical examples of interval estimations based on our bootstrapping method for the several software reliability assessment measures by using actual data.展开更多
Objective: To solve the problem of parameter estimate in the regression analysis of non-random sample. Methods: Calculating residuals according to the regression function based on original data. Modifying residuals an...Objective: To solve the problem of parameter estimate in the regression analysis of non-random sample. Methods: Calculating residuals according to the regression function based on original data. Modifying residuals and correcting them with mean. Adding mean-corrected residuals on original response and bootstrapping them to get 1000 samples. Fitting regression functions of 1000 resampling samples and calculating the 2.5th percentile and 97.5th percentile of corresponding coefficient. Results: The interval estimates deriving from bootstrap method had more statistical significance than that from usual method. Conclusion: Bootstrapping a regression with residuals is a valid method for estimating parameter in regression analysis.展开更多
基金supported by the National Natural Science Foundation of China(No.71473099)
文摘China implemented the public hospital reform in 2012. This study utilized bootstrapping data envelopment analysis(DEA) to evaluate the technical efficiency(TE) and productivity of county public hospitals in Eastern, Central, and Western China after the 2012 public hospital reform. Data from 127 county public hospitals(39, 45, and 43 in Eastern, Central, and Western China, respectively) were collected during 2012–2015. Changes of TE and productivity over time were estimated by bootstrapping DEA and bootstrapping Malmquist. The disparities in TE and productivity among public hospitals in the three regions of China were compared by Kruskal–Wallis H test and Mann–Whitney U test. The average bias-corrected TE values for the four-year period were 0.6442, 0.5785, 0.6099, and 0.6094 in Eastern, Central, and Western China, and the entire country respectively, with average non-technical efficiency, low pure technical efficiency(PTE), and high scale efficiency found. Productivity increased by 8.12%, 0.25%, 12.11%, and 11.58% in China and its three regions during 2012–2015, and such increase in productivity resulted from progressive technological changes by 16.42%, 6.32%, 21.08%, and 21.42%, respectively. The TE and PTE of the county hospitals significantly differed among the three regions of China. Eastern and Western China showed significantly higher TE and PTE than Central China. More than 60% of county public hospitals in China and its three areas operated at decreasing return scales. There was a considerable space for TE improvement in county hospitals in China and its three regions. During 2012–2015, the hospitals experienced progressive productivity; however, the PTE changed adversely. Moreover, Central China continuously achieved a significantly lower efficiency score than Eastern and Western China. Decision makers and administrators in China should identify the causes of the observed inefficiencies and take appropriate measures to increase the efficiency of county public hospitals in the three areas of China, especially in Central China.
基金Supported by the National Natural Science Foundation of China (No.60503072, No.60575042 and No.60435020).
文摘A new approach of relation extraction is described in this paper. It adopts a bootstrap- ping model with a novel iteration strategy, which generates more precise examples of specific relation. Compared with previous methods, the proposed method has three main advantages: first, it needs less manual intervention; second, more abundant and reasonable information are introduced to represent a relation pattern; third, it reduces the risk of circular dependency occurrence in bootstrapping. Scalable evaluation methodology and metrics are developed for our task with comparable techniques over TianWang 100G corpus. The experimental results show that it can get 90% precision and have excellent expansibility.
基金supported by the National Natural Science Foundation of China (Grant No.:71473183)
文摘Purpose: Our study proposes a bootstrapping-based method to automatically extract data- usage statements from academic texts. Design/methodology/approach: The method for data-usage statements extraction starts with seed entities and iteratively learns patterns and data-usage statements from unlabeled text. In each iteration, new patterns are constructed and added to the pattern list based on their calculated score. Three seed-selection strategies are also proposed in this paper. Findings: The performance of the method is verified by means of experiments on real data collected from computer science journals. The results show that the method can achieve satisfactory performance regarding precision of extraction and extensibility of obtained patterns. Research limitations: While the triple representation of sentences is effective and efficient for extracting data-usage statements, it is unable to handle complex sentences. Additional features that can address complex sentences should thus be explored in the future. Practical implications: Data-usage statements extraction is beneficial for data-repository construction and facilitates research on data-usage tracking, dataset-based scholar search, and dataset evaluation. Originality/value: To the best of our knowledge, this paper is among the first to address the important task of automatically extracting data-usage statements from real data.
文摘This paper aims to detect the short-term as well as long-term change point in the surface air temperature time series for Asansol weather observation station, West Bengal, India. Temperature data for the period from 1941 to 2010 of the said weather observatory have been collected from Indian Meteorological Department, Kolkata. Variations and trends of annual mean temperature, annual mean maximum temperature and annual minimum temperature time series were examined. The cumulative sum charts (CUSUM) and bootstrapping were used for the detection of abrupt changes in the time series data set. Statistically significant abrupt changes and trends have been detected. The major change point in the annual mean temperatures occurred around 1986 (0.57°C) at the period of 25 years in the long-term regional scale. On the other side, the annual mean maximum and annual mean minimum temperatures have distinct change points at level 1. There are abrupt changes in the year 1961 (Confidence interval 1961, 1963) for the annual mean maximum and 1994 (Confidence interval 1993, 1996) for the annual mean minimum temperatures at a confidence level of 100% and 98%, respectively. Before the change, the annual mean maximum and annual mean minimum temperatures were 30.90°C and 23.99°C, respectively, while after the change, the temperatures became 33.93°C and 24.84°C, respectively. Over the entire period of consideration (1941-2010), 11 forward and backward changes were found in total. Out of 11, there are 3 changes (1961, 1986 and 2001) in annual mean temperatures, 4 changes (1957, 1961, 1980 and 1994) in annual mean maximum temperatures, and rest 4 changes (1968, 1981, 1994 and 2001) are associated with annual mean minimum temperature data set.
基金The National Science and Technology Pillar Program of China(No.2015BAF07B00)
文摘In order to predict the extreme load of the mechanical components during the entire life,an automatic method based on the bootstrapping technology(BT)is proposed to determine the most suitable threshold.Based on all the turning points of the load history and a series of thresholds estimated in advance,the generalized Pareto distribution is established to fit the exceedances.The corresponding distribution parameters are estimated with the maximum likelihood method.Then,BT is employed to calculate the mean squared error(MSE)of each estimated threshold based on the exceedances and the specific distribution parameters.Finally,the threshold with the smallest MSE will be the optimal one.Compared to the kurtosis method and the mean excess function method,the average deviation of the probability density function of exceedances determined by BT reduces by 38.52%and 29.25%,respectively.Moreover,the quantile-quantile plot of the exceedances determined by BT is closer to a straight line.The results suggest the improvement of the modeling flexibility and the determined threshold precision.If the exceedances are insufficient,BT will enlarge their amount by resampling to solve the instability problem of the original distribution parameters.
基金Supported by the Postdoctoral Science Foundation of China(No.2014M551999)the Open Foundation of Key Laboratory of Spectral Imaging Technology of the Chinese Academy of Sciences(No.LSIT201711D)
文摘The selection and optimization of model filters affect the precision of motion pattern identification and state estimation in maneuvering target tracking directly.Aiming at improving performance of model filters,a novel maneuvering target tracking algorithm based on central difference Kalman filter in observation bootstrapping strategy is proposed.The framework of interactive multiple model(IMM) is used to realize identification of motion pattern,and a central difference Kalman filter(CDKF) is selected as the model filter of IMM.Considering the advantage of multi-sensor fusion method in improving the stability and reliability of observation information,the hardware cost of the observation system for multiple sensors is adopted,meanwhile,according to the data assimilation technique in Ensemble Kalman filter(En KF),a bootstrapping observation set is constructed by integrating the latest observation and the prior information of observation noise.On that basis,these bootstrapping observations are reasonably used to optimize the filtering performance of CDKF by means of weight fusion way.The object of new algorithm is to improve the tracking precision of observed target by the multi-sensor fusion method without increasing the number of physical sensors.The theoretical analysis and experimental results show the feasibility and efficiency of the proposed algorithm.
文摘Discrete software reliability measurement has a proper characteristic for describing a software reliability growth process which depends on a unit of the software fault-detection period, such as the number of test runs, the number of executed test cases. This paper discusses discrete software reliability measurement based on a discretized nonhomogeneous Poisson process (NHPP) model. Especially, we use a bootstrapping method in our discrete software reliability measurement for discussing the statistical inference on parameters and software reliability assessment measures of our model. Finally we show numerical examples of interval estimations based on our bootstrapping method for the several software reliability assessment measures by using actual data.
文摘Objective: To solve the problem of parameter estimate in the regression analysis of non-random sample. Methods: Calculating residuals according to the regression function based on original data. Modifying residuals and correcting them with mean. Adding mean-corrected residuals on original response and bootstrapping them to get 1000 samples. Fitting regression functions of 1000 resampling samples and calculating the 2.5th percentile and 97.5th percentile of corresponding coefficient. Results: The interval estimates deriving from bootstrap method had more statistical significance than that from usual method. Conclusion: Bootstrapping a regression with residuals is a valid method for estimating parameter in regression analysis.