BACKGROUND Hepatitis B(HB)and hepatitis C(HC)place the largest burden in China,and a goal of eliminating them as a major public health threat by 2030 has been set.Making more informed and accurate forecasts of their s...BACKGROUND Hepatitis B(HB)and hepatitis C(HC)place the largest burden in China,and a goal of eliminating them as a major public health threat by 2030 has been set.Making more informed and accurate forecasts of their spread is essential for developing effective strategies,heightening the requirement for early warning to deal with such a major public health threat.AIM To monitor HB and HC epidemics by the design of a paradigmatic seasonal autoregressive fractionally integrated moving average(SARFIMA)for projections into 2030,and to compare the effectiveness with the seasonal autoregressive integrated moving average(SARIMA).METHODS Monthly HB and HC incidence cases in China were obtained from January 2004 to June 2023.Descriptive analysis and the Hodrick-Prescott method were employed to identify trends and seasonality.Two periods(from January 2004 to June 2022 and from January 2004 to December 2015,respectively)were used as the training sets to develop both models,while the remaining periods served as the test sets to evaluate the forecasting accuracy.RESULTS There were incidents of 23400874 HB cases and 3590867 HC cases from January 2004 to June 2023.Overall,HB remained steady[average annual percentage change(AAPC)=0.44,95%confidence interval(95%CI):-0.94-1.84]while HC was increasing(AAPC=8.91,95%CI:6.98-10.88),and both had a peak in March and a trough in February.In the 12-step-ahead HB forecast,the mean absolute deviation(15211.94),root mean square error(18762.94),mean absolute percentage error(0.17),mean error rate(0.15),and root mean square percentage error(0.25)under the best SARFIMA(3,0,0)(0,0.449,2)12 were smaller than those under the best SARIMA(3,0,0)(0,1,2)12(16867.71,20775.12,0.19,0.17,and 0.27,respectively).Similar results were also observed for the 90-step-ahead HB,12-step-ahead HC,and 90-step-ahead HC forecasts.The predicted HB incidents totaled 9865400(95%CI:7508093-12222709)cases and HC totaled 1659485(95%CI:856681-2462290)cases during 2023-2030.CONCLUSION Under current interventions,China faces enormous challenges to eliminate HB and HC epidemics by 2030,and effective strategies must be reinforced.The integration of SARFIMA into public health for the management of HB and HC epidemics can potentially result in more informed and efficient interventions,surpassing the capabilities of SARIMA.展开更多
The price prediction task is a well-studied problem due to its impact on the business domain.There are several research studies that have been conducted to predict the future price of items by capturing the patterns o...The price prediction task is a well-studied problem due to its impact on the business domain.There are several research studies that have been conducted to predict the future price of items by capturing the patterns of price change,but there is very limited work to study the price prediction of seasonal goods(e.g.,Christmas gifts).Seasonal items’prices have different patterns than normal items;this can be linked to the offers and discounted prices of seasonal items.This lack of research studies motivates the current work to investigate the problem of seasonal items’prices as a time series task.We proposed utilizing two different approaches to address this problem,namely,1)machine learning(ML)-based models and 2)deep learning(DL)-based models.Thus,this research tuned a set of well-known predictive models on a real-life dataset.Those models are ensemble learning-based models,random forest,Ridge,Lasso,and Linear regression.Moreover,two new DL architectures based on gated recurrent unit(GRU)and long short-term memory(LSTM)models are proposed.Then,the performance of the utilized ensemble learning and classic ML models are compared against the proposed two DL architectures on different accuracy metrics,where the evaluation includes both numerical and visual comparisons of the examined models.The obtained results show that the ensemble learning models outperformed the classic machine learning-based models(e.g.,linear regression and random forest)and the DL-based models.展开更多
Time series analysis is a valuable tool in epidemiology that complements the classical epidemiological models in two different ways:Prediction and forecast.Prediction is related to explaining past and current data bas...Time series analysis is a valuable tool in epidemiology that complements the classical epidemiological models in two different ways:Prediction and forecast.Prediction is related to explaining past and current data based on various internal and external influences that may or may not have a causative role.Forecasting is an exploration of the possible future values based on the predictive ability of the model and hypothesized future values of the external and/or internal influences.The time series analysis approach has the advantage of being easier to use(in the cases of more straightforward and linear models such as Auto-Regressive Integrated Moving Average).Still,it is limited in forecasting time,unlike the classical models such as Susceptible-Exposed-Infectious-Removed.Its applicability in forecasting comes from its better accuracy for short-term prediction.In its basic form,it does not assume much theoretical knowledge of the mechanisms of spreading and mutating pathogens or the reaction of people and regulatory structures(governments,companies,etc.).Instead,it estimates from the data directly.Its predictive ability allows testing hypotheses for different factors that positively or negatively contribute to the pandemic spread;be it school closures,emerging variants,etc.It can be used in mortality or hospital risk estimation from new cases,seroprevalence studies,assessing properties of emerging variants,and estimating excess mortality and its relationship with a pandemic.展开更多
Rapidly spreading COVID-19 virus and its variants, especially in metropolitan areas around the world, became a major health public concern. The tendency of COVID-19 pandemic and statistical modelling represents an urg...Rapidly spreading COVID-19 virus and its variants, especially in metropolitan areas around the world, became a major health public concern. The tendency of COVID-19 pandemic and statistical modelling represents an urgent challenge in the United States for which there are few solutions. In this paper, we demonstrate combining Fourier terms for capturing seasonality with ARIMA errors and other dynamics in the data. Therefore, we have analyzed 156 weeks COVID-19 dataset on national level using Dynamic Harmonic Regression model, including simulation analysis and accuracy improvement from 2020 to 2023. Most importantly, we provide new advanced pathways which may serve as targets for developing new solutions and approaches.展开更多
Aiming at the problem of on-line damage diagnosis in structural health monitoring (SHM), an algorithm of feature extraction and damage alarming based on auto-regressive moving-average (ARMA) time series analysis i...Aiming at the problem of on-line damage diagnosis in structural health monitoring (SHM), an algorithm of feature extraction and damage alarming based on auto-regressive moving-average (ARMA) time series analysis is presented. The monitoring data were first modeled as ARMA models, while a principalcomponent matrix derived from the AR coefficients of these models was utilized to establish the Mahalanobisdistance criterion functions. Then, a new damage-sensitive feature index DDSF is proposed. A hypothesis test involving the t-test method is further applied to obtain a decision of damage alarming as the mean value of DDSF had significantly changed after damage. The numerical results of a three-span-girder model shows that the defined index is sensitive to subtle structural damage, and the proposed algorithm can be applied to the on-line damage alarming in SHM.展开更多
In this paper, we propose a cross reference method for nonlinear time series analyzing in semi blind case, that is, the dynamic equations modeling the time series are known but the corresponding parameters are not. ...In this paper, we propose a cross reference method for nonlinear time series analyzing in semi blind case, that is, the dynamic equations modeling the time series are known but the corresponding parameters are not. The tasks of noise reduction and parameter estimation which were fulfilled separately before are combined iteratively. With the positive interaction between the two processing modules, the method is somewhat superior. Some prior work can be viewed as special cases of this general framework. The simulations for noise reduction and parameter estimation of contaminated chaotic time series show improved performance of our method compared with previous work.展开更多
This paper presents a novel approach to identify and correct the gross errors in the microelectromechanical system (MEMS) gyroscope used in ground vehicles by means of time series analysis. According to the characte...This paper presents a novel approach to identify and correct the gross errors in the microelectromechanical system (MEMS) gyroscope used in ground vehicles by means of time series analysis. According to the characteristics of autocorrelation function (ACF) and partial autocorrelation function (PACF), an autoregressive integrated moving average (ARIMA) model is roughly constructed. The rough model is optimized by combining with Akaike's information criterion (A/C), and the parameters are estimated based on the least squares algorithm. After validation testing, the model is utilized to forecast the next output on the basis of the previous measurement. When the difference between the measurement and its prediction exceeds the defined threshold, the measurement is identified as a gross error and remedied by its prediction. A case study on the yaw rate is performed to illustrate the developed algorithm. Experimental results demonstrate that the proposed approach can effectively distinguish gross errors and make some reasonable remedies.展开更多
In order to study the hydrodynamic characteristics of the karst aquifers in northern China,time series analyses(correlation and spectral analysis in addition with hydrograph recession analysis)are applied on Baotu Spr...In order to study the hydrodynamic characteristics of the karst aquifers in northern China,time series analyses(correlation and spectral analysis in addition with hydrograph recession analysis)are applied on Baotu Spring and Heihu Spring in Jinan karst spring system,a typical karst spring system in northern China.Results show that the auto-correlation coefficient of spring water level reaches the value of 0.2 after 123 days and 117 days for Baotu Spring and Heihu Spring,respectively.The regulation time obtained from the simple spectral density function in the same period is 187 days and 175 days for Baotu Spring and Heihu Spring.The auto-correlation coefficient of spring water level reaches the value of 0.2 in 34-82 days,and regulation time ranges among 40-59 days for every single hydrological year.The delay time between precipitation and spring water level obtained from cross correlation function is around 56 days for the period of 2012-2019,and varies among 30-79 days for every single hydrological year.In addition,the spectral bands in cross amplitude functions and gain functions are small with 0.02,and the values in the coherence functions are small.All these behaviors illustrate that Jinan karst spring system has a strong memory effect,large storage capacity,noticeable regulation effect,and time series analysis is a useful tool for studying the hydrodynamic characteristics of karst spring system in northern China.展开更多
Purpose: One of the main indicators of scientific production is the number of papers published in scholarly journals. Turkey ranks 18th place in the world based on the number of scholarly publications. The objective ...Purpose: One of the main indicators of scientific production is the number of papers published in scholarly journals. Turkey ranks 18th place in the world based on the number of scholarly publications. The objective of this paper is to find out if the monetary support program initiated in 1993 by the Turkish Scientific and Technological Research Council (TUBITAK) to incentivize researchers and increase the number, impact, and quality of international publications has been effective in doing so.Design/methodology/approach: We analyzed some 390,000 publications with Turkish affiliations listed in the Web of Science (WoS) database between 1976 and 2015 along with about 157,000 supported ones between 1997 and 2015. We used the interrupted time series (ITS) analysis technique (also known as "quasi-experimental time series analysis" or "intervention analysis") to test if TOBITAK's support program helped increase the number of publications. We defined ARIMA (1,1,0) model for ITS data and observed the impact of TOBiTAK's support program in 1994, 1997, and 2003 (after one, four and 10 years of its start, respectively). The majority of publications (93%) were full papers (articles), which were used as the experimental group while other types of contributions functioned as the control group. We also carried out a multiple regression analysis.Findings: TUBITAK's support program has had negligible effect on the increase of the number of papers with Turkish affiliations. Yet, the number of other types of contributions continued to increase even though they were not well supported, suggesting that TUBITAK's support program is probably not the main factor causing the increase in the number of papers with Turkish affiliations. Research limitations: Interrupted time series analysis shows if the "intervention" has had any significant effect on the dependent variable but it does not explain what caused the increase in the number of papers if it was not the intervention. Moreover, except the"intervention", other "event(s)" that might affect the time series data (e.g., increase in the number of research personnel over the years) should not occur during the period of analysis, a prerequisite that is beyond the control of the researcher. Practical implications: TUBITAK's "cash-for-publication" program did not seem to have direct impact on the increase of the number of papers published by Turkish authors, suggesting that small amounts of payments are not much of an incentive for authors to publish more. It might perhaps be a better strategy to concentrate limited resources on a few high impact projects rather than to disperse them to thousands of authors as "micropayments." Originality/value: Based on 25 years' worth of payments data, this is perhaps one of the first large-scale studies showing that "cash-for-publication" policies or "piece rates" paid to researchers tend to have little or no effect on the increase of researchers' productivity. The main finding of this paper has some implications for countries wherein publication subsidies are used as an incentive to increase the number and quality of papers published in international journals. They should be prepared to consider reviewing their existing support programs (based usually on bibliometric measures such as journal impact factors) and revising their reward policies.展开更多
FMS is a sort of highly automatic machining system, how to ensure partquality is master key to system highly active running. At first, series of machining dimension andprocess capability of flexible manufacturing syst...FMS is a sort of highly automatic machining system, how to ensure partquality is master key to system highly active running. At first, series of machining dimension andprocess capability of flexible manufacturing system(FMS), is analyzed. Result of its, strongself-correlation of data series shows that time series analysis is applicable to data seriesanalyzed. Based on-line modeling and forecasting for data series, principle and method of feedbackcompensation control is proposed. On a foundation of the virtual instrument platform, Labview ofnational instrument (NI), FMS dimension and process capability monitoring system(monitoring system)is developed. In practice, it is proved that part quality and process capability of FMS are greatlyimproved.展开更多
The paper builds up the forecasting model of air temperature according to the data (1994~1998) of Fu Jin area.At the same time,the writer inquires into the relation of water requirement of well irrigation rice (ET) a...The paper builds up the forecasting model of air temperature according to the data (1994~1998) of Fu Jin area.At the same time,the writer inquires into the relation of water requirement of well irrigation rice (ET) and average air temperature (T).Furthermore,the rice irrigation water requirement (ET) of Fu Jin area has been forecast in 1999.Thus,we can apply the model in irrigation management.展开更多
Sea water intrusion is an environmental problem cause by the irrational exploitation of coastal groundwater resources and has attracted the attention of many coastal countries.In this study,we used time series monitor...Sea water intrusion is an environmental problem cause by the irrational exploitation of coastal groundwater resources and has attracted the attention of many coastal countries.In this study,we used time series monitoring data of groundwater levels and tidal waves to analyze the influence of tide flow on groundwater dynamics in the southern Laizhou Bay.The auto-correlation and cross-correlation coefficients between groundwater level and tidal wave level were calculated specifically to measure the boundary conditions along the coastline.In addition,spectrum analysis was employed to assess the periodicity and hysteresis of various tide and groundwater level fluctuations.The results of time series analysis show that groundwater level fluctuation is noticeably influenced by tides,but the influence is limited to a certain distance and cannot reach the saltwater-freshwater interface in the southern Laizhou Bay.There are three main periodic components of groundwater level in tidal effect range(i.e.23.804 h,12.500 h and 12.046 h),the pattern of which is the same as the tides.The affected groundwater level fluctuations lag behind the tides.The dynamic analysis of groundwater indicates that the coastal aquifer has a hydraulic connection with seawater but not in a direct way.Owing to the existence of the groundwater mound between the salty groundwater(brine)and fresh groundwater,the maximum influencing distance of the tide on the groundwater is 8.85 km.Considering that the fresh-saline groundwater interface is about 30 km away from the coastline,modern seawater has a limited contribution to sea-salt water intrusion in Laizhou Bay.The results of this study are expected to provide a reference for the study on sea water intrusion.展开更多
Fault management is crucial to pro vi de quality of service grantees for the future networks, and fault identification is an essential part of it. A novel fault identification algorithm is proposed in this paper, wh...Fault management is crucial to pro vi de quality of service grantees for the future networks, and fault identification is an essential part of it. A novel fault identification algorithm is proposed in this paper, which focuses on the anomaly detection of network traffic. Since the fault identification has been achieved using statistical information in mana gement information base, the algorithm is compatible with the existing simple ne twork management protocol framework. The network traffic time series is verified to be non-stationary. By fitting the adaptive autoregressive model, the series is transformed into a multidimensional vector. The training samples and identif iers are acquired from the network simulation. A k-nearest neighbor classif ier identifies the system faults after being trained. The experiment results are consistent with the given fault scenarios, which prove the accuracy of the algo rithm. The identification errors are discussed to illustrate that the novel faul t identification algorithm is adaptive in the fault scenarios with network traff ic change.展开更多
In the context of rapid digitization in industrial environments,how effective are advanced unsupervised learning models,particularly hybrid autoencoder models,at detecting anomalies in industrial control system(ICS)da...In the context of rapid digitization in industrial environments,how effective are advanced unsupervised learning models,particularly hybrid autoencoder models,at detecting anomalies in industrial control system(ICS)datasets?This study is crucial because it addresses the challenge of identifying rare and complex anomalous patterns in the vast amounts of time series data generated by Internet of Things(IoT)devices,which can significantly improve the reliability and safety of these systems.In this paper,we propose a hybrid autoencoder model,called ConvBiLSTMAE,which combines convolutional neural network(CNN)and bidirectional long short-term memory(BiLSTM)to more effectively train complex temporal data patterns in anomaly detection.On the hardware-in-the-loopbased extended industrial control system dataset,the ConvBiLSTM-AE model demonstrated remarkable anomaly detection performance,achieving F1 scores of 0.78 and 0.41 for the first and second datasets,respectively.The results suggest that hybrid autoencoder models are not only viable,but potentially superior alternatives for unsupervised anomaly detection in complex industrial systems,offering a promising approach to improving their reliability and safety.展开更多
Long-term time series forecasting stands as a crucial research domain within the realm of automated machine learning(AutoML).At present,forecasting,whether rooted in machine learning or statistical learning,typically ...Long-term time series forecasting stands as a crucial research domain within the realm of automated machine learning(AutoML).At present,forecasting,whether rooted in machine learning or statistical learning,typically relies on expert input and necessitates substantial manual involvement.This manual effort spans model development,feature engineering,hyper-parameter tuning,and the intricate construction of time series models.The complexity of these tasks renders complete automation unfeasible,as they inherently demand human intervention at multiple junctures.To surmount these challenges,this article proposes leveraging Long Short-Term Memory,which is the variant of Recurrent Neural Networks,harnessing memory cells and gating mechanisms to facilitate long-term time series prediction.However,forecasting accuracy by particular neural network and traditional models can degrade significantly,when addressing long-term time-series tasks.Therefore,our research demonstrates that this innovative approach outperforms the traditional Autoregressive Integrated Moving Average(ARIMA)method in forecasting long-term univariate time series.ARIMA is a high-quality and competitive model in time series prediction,and yet it requires significant preprocessing efforts.Using multiple accuracy metrics,we have evaluated both ARIMA and proposed method on the simulated time-series data and real data in both short and long term.Furthermore,our findings indicate its superiority over alternative network architectures,including Fully Connected Neural Networks,Convolutional Neural Networks,and Nonpooling Convolutional Neural Networks.Our AutoML approach enables non-professional to attain highly accurate and effective time series forecasting,and can be widely applied to various domains,particularly in business and finance.展开更多
Objective:To explore the modeling of time series of animal bite occurrence in northwest Iran.Methods:In this study,we analyzed surveillance time series data for animal bite cases in the northwest Iran province of Iran...Objective:To explore the modeling of time series of animal bite occurrence in northwest Iran.Methods:In this study,we analyzed surveillance time series data for animal bite cases in the northwest Iran province of Iran from 2011 to 2017.We used decomposition methods to explore seasonality and long-term trends and applied the Autoregressive Integrated Moving Average(ARIMA)model to fit a univariate time series of animal bite incidence.The ARIMA modeling process involved selecting the time series,transforming the series,selecting the appropriate model,estimating parameters,and forecasting.Results:Our results using the Box Jenkins model showed a significant seasonal trend and an overall increase in animal bite incidents during the study period.The best-fitting model for the available data was a seasonal ARIMA model with drift in the form of ARIMA(2,0,0)(1,1,1).This model can be used to forecast the frequency of animal attacks in northwest Iran over the next two years,suggesting that the incidence of animal attacks in the region would continue to increase during this time frame(2018-2019).Conclusion:Our findings suggest that time series analysis is a useful method for investigating animal bite cases and predicting future occurrences.The existence of a seasonal trend in animal bites can also aid in planning healthcare services during different seasons of the year.Therefore,our study highlights the importance of implementing proactive measures to address the growing issue of animal bites in Iran.展开更多
Aiming to realize fast and accurate fault diagnosis in complex network environment, this article proposes a set of anomaly detection algorithm and intelligent fault diagnosis model. Firstly, a novel anomaly detection ...Aiming to realize fast and accurate fault diagnosis in complex network environment, this article proposes a set of anomaly detection algorithm and intelligent fault diagnosis model. Firstly, a novel anomaly detection algorithm based on time series analysis is put forward to improve the generalized likelihood ratio (GLR) test, and thus, detection accuracy is enhanced and the algorithm complexity is reduced. Secondly, the intelligent fault diagnosis model is established by introducing neural network technology, and thereby, the anomaly information of each node in end-to-end network is integrated and processed in parallel to intelligently diagnose the fault cause. Finally, server backup solution in enterprise information network is taken as the simulation scenario. The results demonstrate that the proposed method can not only detect fault occurrence in time, but can also implement online diagnosis for fault cause, and thus, real-time and intelligent fault management process is achieved.展开更多
In forecasting real time environmental factors,large data is needed to analyse the pattern behind the data values.Air pollution is a major threat towards developing countries and it is proliferating every year.Many me...In forecasting real time environmental factors,large data is needed to analyse the pattern behind the data values.Air pollution is a major threat towards developing countries and it is proliferating every year.Many methods in time ser-ies prediction and deep learning models to estimate the severity of air pollution.Each independent variable contributing towards pollution is necessary to analyse the trend behind the air pollution in that particular locality.This approach selects multivariate time series and coalesce a real time updatable autoregressive model to forecast Particulate matter(PM)PM2.5.To perform experimental analysis the data from the Central Pollution Control Board(CPCB)is used.Prediction is car-ried out for Chennai with seven locations and estimated PM’s using the weighted ensemble method.Proposed method for air pollution prediction unveiled effective and moored performance in long term prediction.Dynamic budge with high weighted k-models are used simultaneously and devising an ensemble helps to achieve stable forecasting.Computational time of ensemble decreases with paral-lel processing in each sub model.Weighted ensemble model shows high perfor-mance in long term prediction when compared to the traditional time series models like Vector Auto-Regression(VAR),Autoregressive Integrated with Mov-ing Average(ARIMA),Autoregressive Moving Average with Extended terms(ARMEX).Evaluation metrics like Root Mean Square Error(RMSE),Mean Absolute Error(MAE)and the time to achieve the time series are compared.展开更多
Objective To recognize the spatial and temporal characteristics of iodine deficiency disorders(IDD),China national IDD surveillance data for the years of 1995–2018 were analyzed.Methods Time series analysis was used ...Objective To recognize the spatial and temporal characteristics of iodine deficiency disorders(IDD),China national IDD surveillance data for the years of 1995–2018 were analyzed.Methods Time series analysis was used to describe and predict the IDD related indicators,and spatial analysis was used to analyze the spatial distribution of salt iodine levels.Results In China,the median urinary iodine concentration increased in 1995–1997,then decreased to adequate levels,and are expected to remain appropriate in 2019–2022.The goiter rate continually decreased and is expected to be maintained at a low level.Since 2002,the coverage rates of iodized salt and the consumption rates of qualified iodized salt(the percentage of qualified iodized salt in all tested salt) increased and began to decline in 2012;they are expected to continue to decrease.Spatial epidemiological analysis indicated a positive spatial correlation in 2016–2018 and revealed feature regarding the spatial distribution of salt related indicators in coastal areas and areas near iodine-excess areas.Conclusions Iodine nutrition in China showed gradual improvements.However,a recent decline has been observed in some areas following changes in the iodized salt supply in China.In the future,more regulations regarding salt management should be issued to strengthen IDD control and prevention measures,and avoid the recurrence of IDD.展开更多
Time-series analysis is important to a wide range of disciplines transcending both the physical and social sciences for proactive policy decisions. Statistical models have sound theoretical basis and have been success...Time-series analysis is important to a wide range of disciplines transcending both the physical and social sciences for proactive policy decisions. Statistical models have sound theoretical basis and have been successfully used in a number of problem domains in time series forecasting. Due to power and flexibility, Box-Jenkins ARIMA model has gained enormous popularity in many areas and research practice for the last three decades. More recently, the neural networks have been shown to be a promising alternative tool for modeling and forecasting owing to their ability to capture the nonlinearity in the data. However, despite the popularity and the superiority of ARIMA and ANN models, the empirical forecasting performance has been rather mixed so that no single method is best in every situation. In this study, a hybrid ARIMA and neural networks model to time series forecasting is proposed. The basic idea behind the model combination is to use each model’s unique features to capture different patterns in the data. With three real data sets, empirical results evidently show that the hybrid model outperforms ARIMA and ANN model noticeably in terms of forecasting accuracy used in isolation.展开更多
基金Supported by the Key Scientific Research Project of Universities in Henan Province,No.21A330004Natural Science Foundation in Henan Province,No.222300420265.
文摘BACKGROUND Hepatitis B(HB)and hepatitis C(HC)place the largest burden in China,and a goal of eliminating them as a major public health threat by 2030 has been set.Making more informed and accurate forecasts of their spread is essential for developing effective strategies,heightening the requirement for early warning to deal with such a major public health threat.AIM To monitor HB and HC epidemics by the design of a paradigmatic seasonal autoregressive fractionally integrated moving average(SARFIMA)for projections into 2030,and to compare the effectiveness with the seasonal autoregressive integrated moving average(SARIMA).METHODS Monthly HB and HC incidence cases in China were obtained from January 2004 to June 2023.Descriptive analysis and the Hodrick-Prescott method were employed to identify trends and seasonality.Two periods(from January 2004 to June 2022 and from January 2004 to December 2015,respectively)were used as the training sets to develop both models,while the remaining periods served as the test sets to evaluate the forecasting accuracy.RESULTS There were incidents of 23400874 HB cases and 3590867 HC cases from January 2004 to June 2023.Overall,HB remained steady[average annual percentage change(AAPC)=0.44,95%confidence interval(95%CI):-0.94-1.84]while HC was increasing(AAPC=8.91,95%CI:6.98-10.88),and both had a peak in March and a trough in February.In the 12-step-ahead HB forecast,the mean absolute deviation(15211.94),root mean square error(18762.94),mean absolute percentage error(0.17),mean error rate(0.15),and root mean square percentage error(0.25)under the best SARFIMA(3,0,0)(0,0.449,2)12 were smaller than those under the best SARIMA(3,0,0)(0,1,2)12(16867.71,20775.12,0.19,0.17,and 0.27,respectively).Similar results were also observed for the 90-step-ahead HB,12-step-ahead HC,and 90-step-ahead HC forecasts.The predicted HB incidents totaled 9865400(95%CI:7508093-12222709)cases and HC totaled 1659485(95%CI:856681-2462290)cases during 2023-2030.CONCLUSION Under current interventions,China faces enormous challenges to eliminate HB and HC epidemics by 2030,and effective strategies must be reinforced.The integration of SARFIMA into public health for the management of HB and HC epidemics can potentially result in more informed and efficient interventions,surpassing the capabilities of SARIMA.
文摘The price prediction task is a well-studied problem due to its impact on the business domain.There are several research studies that have been conducted to predict the future price of items by capturing the patterns of price change,but there is very limited work to study the price prediction of seasonal goods(e.g.,Christmas gifts).Seasonal items’prices have different patterns than normal items;this can be linked to the offers and discounted prices of seasonal items.This lack of research studies motivates the current work to investigate the problem of seasonal items’prices as a time series task.We proposed utilizing two different approaches to address this problem,namely,1)machine learning(ML)-based models and 2)deep learning(DL)-based models.Thus,this research tuned a set of well-known predictive models on a real-life dataset.Those models are ensemble learning-based models,random forest,Ridge,Lasso,and Linear regression.Moreover,two new DL architectures based on gated recurrent unit(GRU)and long short-term memory(LSTM)models are proposed.Then,the performance of the utilized ensemble learning and classic ML models are compared against the proposed two DL architectures on different accuracy metrics,where the evaluation includes both numerical and visual comparisons of the examined models.The obtained results show that the ensemble learning models outperformed the classic machine learning-based models(e.g.,linear regression and random forest)and the DL-based models.
基金Supported by European Union-NextGenerationEU,Through the National Recovery and Resilience Plan of the Republic of Bulgaria,No.BG-RRP-2.004-0008-C01.
文摘Time series analysis is a valuable tool in epidemiology that complements the classical epidemiological models in two different ways:Prediction and forecast.Prediction is related to explaining past and current data based on various internal and external influences that may or may not have a causative role.Forecasting is an exploration of the possible future values based on the predictive ability of the model and hypothesized future values of the external and/or internal influences.The time series analysis approach has the advantage of being easier to use(in the cases of more straightforward and linear models such as Auto-Regressive Integrated Moving Average).Still,it is limited in forecasting time,unlike the classical models such as Susceptible-Exposed-Infectious-Removed.Its applicability in forecasting comes from its better accuracy for short-term prediction.In its basic form,it does not assume much theoretical knowledge of the mechanisms of spreading and mutating pathogens or the reaction of people and regulatory structures(governments,companies,etc.).Instead,it estimates from the data directly.Its predictive ability allows testing hypotheses for different factors that positively or negatively contribute to the pandemic spread;be it school closures,emerging variants,etc.It can be used in mortality or hospital risk estimation from new cases,seroprevalence studies,assessing properties of emerging variants,and estimating excess mortality and its relationship with a pandemic.
文摘Rapidly spreading COVID-19 virus and its variants, especially in metropolitan areas around the world, became a major health public concern. The tendency of COVID-19 pandemic and statistical modelling represents an urgent challenge in the United States for which there are few solutions. In this paper, we demonstrate combining Fourier terms for capturing seasonality with ARIMA errors and other dynamics in the data. Therefore, we have analyzed 156 weeks COVID-19 dataset on national level using Dynamic Harmonic Regression model, including simulation analysis and accuracy improvement from 2020 to 2023. Most importantly, we provide new advanced pathways which may serve as targets for developing new solutions and approaches.
基金The National High Technology Research and Devel-opment Program of China (863Program) (No2006AA04Z416)the National Natural Science Foundation of China (No50538020)
文摘Aiming at the problem of on-line damage diagnosis in structural health monitoring (SHM), an algorithm of feature extraction and damage alarming based on auto-regressive moving-average (ARMA) time series analysis is presented. The monitoring data were first modeled as ARMA models, while a principalcomponent matrix derived from the AR coefficients of these models was utilized to establish the Mahalanobisdistance criterion functions. Then, a new damage-sensitive feature index DDSF is proposed. A hypothesis test involving the t-test method is further applied to obtain a decision of damage alarming as the mean value of DDSF had significantly changed after damage. The numerical results of a three-span-girder model shows that the defined index is sensitive to subtle structural damage, and the proposed algorithm can be applied to the on-line damage alarming in SHM.
文摘In this paper, we propose a cross reference method for nonlinear time series analyzing in semi blind case, that is, the dynamic equations modeling the time series are known but the corresponding parameters are not. The tasks of noise reduction and parameter estimation which were fulfilled separately before are combined iteratively. With the positive interaction between the two processing modules, the method is somewhat superior. Some prior work can be viewed as special cases of this general framework. The simulations for noise reduction and parameter estimation of contaminated chaotic time series show improved performance of our method compared with previous work.
基金The National Natural Science Foundation of China(No.61273236)the Natural Science Foundation of Jiangsu Province(No.BK2010239)the Ph.D.Programs Foundation of Ministry of Education of China(No.200802861061)
文摘This paper presents a novel approach to identify and correct the gross errors in the microelectromechanical system (MEMS) gyroscope used in ground vehicles by means of time series analysis. According to the characteristics of autocorrelation function (ACF) and partial autocorrelation function (PACF), an autoregressive integrated moving average (ARIMA) model is roughly constructed. The rough model is optimized by combining with Akaike's information criterion (A/C), and the parameters are estimated based on the least squares algorithm. After validation testing, the model is utilized to forecast the next output on the basis of the previous measurement. When the difference between the measurement and its prediction exceeds the defined threshold, the measurement is identified as a gross error and remedied by its prediction. A case study on the yaw rate is performed to illustrate the developed algorithm. Experimental results demonstrate that the proposed approach can effectively distinguish gross errors and make some reasonable remedies.
基金This study is supported by the geological survey project:National Glacier and Desertification Remote Sensing Geological Survey(DD20190515)Youth Innovation Fund of China Aero Geophysical Prospecting and Remote Sensing Center for Natural Resources(2020YFL18).
文摘In order to study the hydrodynamic characteristics of the karst aquifers in northern China,time series analyses(correlation and spectral analysis in addition with hydrograph recession analysis)are applied on Baotu Spring and Heihu Spring in Jinan karst spring system,a typical karst spring system in northern China.Results show that the auto-correlation coefficient of spring water level reaches the value of 0.2 after 123 days and 117 days for Baotu Spring and Heihu Spring,respectively.The regulation time obtained from the simple spectral density function in the same period is 187 days and 175 days for Baotu Spring and Heihu Spring.The auto-correlation coefficient of spring water level reaches the value of 0.2 in 34-82 days,and regulation time ranges among 40-59 days for every single hydrological year.The delay time between precipitation and spring water level obtained from cross correlation function is around 56 days for the period of 2012-2019,and varies among 30-79 days for every single hydrological year.In addition,the spectral bands in cross amplitude functions and gain functions are small with 0.02,and the values in the coherence functions are small.All these behaviors illustrate that Jinan karst spring system has a strong memory effect,large storage capacity,noticeable regulation effect,and time series analysis is a useful tool for studying the hydrodynamic characteristics of karst spring system in northern China.
文摘Purpose: One of the main indicators of scientific production is the number of papers published in scholarly journals. Turkey ranks 18th place in the world based on the number of scholarly publications. The objective of this paper is to find out if the monetary support program initiated in 1993 by the Turkish Scientific and Technological Research Council (TUBITAK) to incentivize researchers and increase the number, impact, and quality of international publications has been effective in doing so.Design/methodology/approach: We analyzed some 390,000 publications with Turkish affiliations listed in the Web of Science (WoS) database between 1976 and 2015 along with about 157,000 supported ones between 1997 and 2015. We used the interrupted time series (ITS) analysis technique (also known as "quasi-experimental time series analysis" or "intervention analysis") to test if TOBITAK's support program helped increase the number of publications. We defined ARIMA (1,1,0) model for ITS data and observed the impact of TOBiTAK's support program in 1994, 1997, and 2003 (after one, four and 10 years of its start, respectively). The majority of publications (93%) were full papers (articles), which were used as the experimental group while other types of contributions functioned as the control group. We also carried out a multiple regression analysis.Findings: TUBITAK's support program has had negligible effect on the increase of the number of papers with Turkish affiliations. Yet, the number of other types of contributions continued to increase even though they were not well supported, suggesting that TUBITAK's support program is probably not the main factor causing the increase in the number of papers with Turkish affiliations. Research limitations: Interrupted time series analysis shows if the "intervention" has had any significant effect on the dependent variable but it does not explain what caused the increase in the number of papers if it was not the intervention. Moreover, except the"intervention", other "event(s)" that might affect the time series data (e.g., increase in the number of research personnel over the years) should not occur during the period of analysis, a prerequisite that is beyond the control of the researcher. Practical implications: TUBITAK's "cash-for-publication" program did not seem to have direct impact on the increase of the number of papers published by Turkish authors, suggesting that small amounts of payments are not much of an incentive for authors to publish more. It might perhaps be a better strategy to concentrate limited resources on a few high impact projects rather than to disperse them to thousands of authors as "micropayments." Originality/value: Based on 25 years' worth of payments data, this is perhaps one of the first large-scale studies showing that "cash-for-publication" policies or "piece rates" paid to researchers tend to have little or no effect on the increase of researchers' productivity. The main finding of this paper has some implications for countries wherein publication subsidies are used as an incentive to increase the number and quality of papers published in international journals. They should be prepared to consider reviewing their existing support programs (based usually on bibliometric measures such as journal impact factors) and revising their reward policies.
基金This project is supported by Weaponry Advanced Fund Item of China (No.2000JS38.5.1 OT2001)
文摘FMS is a sort of highly automatic machining system, how to ensure partquality is master key to system highly active running. At first, series of machining dimension andprocess capability of flexible manufacturing system(FMS), is analyzed. Result of its, strongself-correlation of data series shows that time series analysis is applicable to data seriesanalyzed. Based on-line modeling and forecasting for data series, principle and method of feedbackcompensation control is proposed. On a foundation of the virtual instrument platform, Labview ofnational instrument (NI), FMS dimension and process capability monitoring system(monitoring system)is developed. In practice, it is proved that part quality and process capability of FMS are greatlyimproved.
基金Funditem:China Postdoctoral Science Fund(2 0 0 0 ).The Youth Fund of Sichuan U niversity.(43 2 0 2 8)
文摘The paper builds up the forecasting model of air temperature according to the data (1994~1998) of Fu Jin area.At the same time,the writer inquires into the relation of water requirement of well irrigation rice (ET) and average air temperature (T).Furthermore,the rice irrigation water requirement (ET) of Fu Jin area has been forecast in 1999.Thus,we can apply the model in irrigation management.
基金funded by the“Investigation and Evaluation of Geologic Environment for Laizhou Bay”of the China Geological Survey project(12120113003800)financially supported by National Natural Science Foundation of China(No.42102298)the program of China Geology Survey(No.DD20221759).
文摘Sea water intrusion is an environmental problem cause by the irrational exploitation of coastal groundwater resources and has attracted the attention of many coastal countries.In this study,we used time series monitoring data of groundwater levels and tidal waves to analyze the influence of tide flow on groundwater dynamics in the southern Laizhou Bay.The auto-correlation and cross-correlation coefficients between groundwater level and tidal wave level were calculated specifically to measure the boundary conditions along the coastline.In addition,spectrum analysis was employed to assess the periodicity and hysteresis of various tide and groundwater level fluctuations.The results of time series analysis show that groundwater level fluctuation is noticeably influenced by tides,but the influence is limited to a certain distance and cannot reach the saltwater-freshwater interface in the southern Laizhou Bay.There are three main periodic components of groundwater level in tidal effect range(i.e.23.804 h,12.500 h and 12.046 h),the pattern of which is the same as the tides.The affected groundwater level fluctuations lag behind the tides.The dynamic analysis of groundwater indicates that the coastal aquifer has a hydraulic connection with seawater but not in a direct way.Owing to the existence of the groundwater mound between the salty groundwater(brine)and fresh groundwater,the maximum influencing distance of the tide on the groundwater is 8.85 km.Considering that the fresh-saline groundwater interface is about 30 km away from the coastline,modern seawater has a limited contribution to sea-salt water intrusion in Laizhou Bay.The results of this study are expected to provide a reference for the study on sea water intrusion.
文摘Fault management is crucial to pro vi de quality of service grantees for the future networks, and fault identification is an essential part of it. A novel fault identification algorithm is proposed in this paper, which focuses on the anomaly detection of network traffic. Since the fault identification has been achieved using statistical information in mana gement information base, the algorithm is compatible with the existing simple ne twork management protocol framework. The network traffic time series is verified to be non-stationary. By fitting the adaptive autoregressive model, the series is transformed into a multidimensional vector. The training samples and identif iers are acquired from the network simulation. A k-nearest neighbor classif ier identifies the system faults after being trained. The experiment results are consistent with the given fault scenarios, which prove the accuracy of the algo rithm. The identification errors are discussed to illustrate that the novel faul t identification algorithm is adaptive in the fault scenarios with network traff ic change.
基金supported by the Culture,Sports,and Tourism R&D Program through the Korea Creative Content Agency grant funded by the Ministry of Culture,Sports,and Tourism in 2024(Project Name:Development of Distribution and Management Platform Technology and Human Resource Development for Blockchain-Based SW Copyright Protection,Project Number:RS-2023-00228867,Contribution Rate:100%)and also supported by the Soonchunhyang University Research Fund.
文摘In the context of rapid digitization in industrial environments,how effective are advanced unsupervised learning models,particularly hybrid autoencoder models,at detecting anomalies in industrial control system(ICS)datasets?This study is crucial because it addresses the challenge of identifying rare and complex anomalous patterns in the vast amounts of time series data generated by Internet of Things(IoT)devices,which can significantly improve the reliability and safety of these systems.In this paper,we propose a hybrid autoencoder model,called ConvBiLSTMAE,which combines convolutional neural network(CNN)and bidirectional long short-term memory(BiLSTM)to more effectively train complex temporal data patterns in anomaly detection.On the hardware-in-the-loopbased extended industrial control system dataset,the ConvBiLSTM-AE model demonstrated remarkable anomaly detection performance,achieving F1 scores of 0.78 and 0.41 for the first and second datasets,respectively.The results suggest that hybrid autoencoder models are not only viable,but potentially superior alternatives for unsupervised anomaly detection in complex industrial systems,offering a promising approach to improving their reliability and safety.
文摘Long-term time series forecasting stands as a crucial research domain within the realm of automated machine learning(AutoML).At present,forecasting,whether rooted in machine learning or statistical learning,typically relies on expert input and necessitates substantial manual involvement.This manual effort spans model development,feature engineering,hyper-parameter tuning,and the intricate construction of time series models.The complexity of these tasks renders complete automation unfeasible,as they inherently demand human intervention at multiple junctures.To surmount these challenges,this article proposes leveraging Long Short-Term Memory,which is the variant of Recurrent Neural Networks,harnessing memory cells and gating mechanisms to facilitate long-term time series prediction.However,forecasting accuracy by particular neural network and traditional models can degrade significantly,when addressing long-term time-series tasks.Therefore,our research demonstrates that this innovative approach outperforms the traditional Autoregressive Integrated Moving Average(ARIMA)method in forecasting long-term univariate time series.ARIMA is a high-quality and competitive model in time series prediction,and yet it requires significant preprocessing efforts.Using multiple accuracy metrics,we have evaluated both ARIMA and proposed method on the simulated time-series data and real data in both short and long term.Furthermore,our findings indicate its superiority over alternative network architectures,including Fully Connected Neural Networks,Convolutional Neural Networks,and Nonpooling Convolutional Neural Networks.Our AutoML approach enables non-professional to attain highly accurate and effective time series forecasting,and can be widely applied to various domains,particularly in business and finance.
文摘Objective:To explore the modeling of time series of animal bite occurrence in northwest Iran.Methods:In this study,we analyzed surveillance time series data for animal bite cases in the northwest Iran province of Iran from 2011 to 2017.We used decomposition methods to explore seasonality and long-term trends and applied the Autoregressive Integrated Moving Average(ARIMA)model to fit a univariate time series of animal bite incidence.The ARIMA modeling process involved selecting the time series,transforming the series,selecting the appropriate model,estimating parameters,and forecasting.Results:Our results using the Box Jenkins model showed a significant seasonal trend and an overall increase in animal bite incidents during the study period.The best-fitting model for the available data was a seasonal ARIMA model with drift in the form of ARIMA(2,0,0)(1,1,1).This model can be used to forecast the frequency of animal attacks in northwest Iran over the next two years,suggesting that the incidence of animal attacks in the region would continue to increase during this time frame(2018-2019).Conclusion:Our findings suggest that time series analysis is a useful method for investigating animal bite cases and predicting future occurrences.The existence of a seasonal trend in animal bites can also aid in planning healthcare services during different seasons of the year.Therefore,our study highlights the importance of implementing proactive measures to address the growing issue of animal bites in Iran.
文摘Aiming to realize fast and accurate fault diagnosis in complex network environment, this article proposes a set of anomaly detection algorithm and intelligent fault diagnosis model. Firstly, a novel anomaly detection algorithm based on time series analysis is put forward to improve the generalized likelihood ratio (GLR) test, and thus, detection accuracy is enhanced and the algorithm complexity is reduced. Secondly, the intelligent fault diagnosis model is established by introducing neural network technology, and thereby, the anomaly information of each node in end-to-end network is integrated and processed in parallel to intelligently diagnose the fault cause. Finally, server backup solution in enterprise information network is taken as the simulation scenario. The results demonstrate that the proposed method can not only detect fault occurrence in time, but can also implement online diagnosis for fault cause, and thus, real-time and intelligent fault management process is achieved.
文摘In forecasting real time environmental factors,large data is needed to analyse the pattern behind the data values.Air pollution is a major threat towards developing countries and it is proliferating every year.Many methods in time ser-ies prediction and deep learning models to estimate the severity of air pollution.Each independent variable contributing towards pollution is necessary to analyse the trend behind the air pollution in that particular locality.This approach selects multivariate time series and coalesce a real time updatable autoregressive model to forecast Particulate matter(PM)PM2.5.To perform experimental analysis the data from the Central Pollution Control Board(CPCB)is used.Prediction is car-ried out for Chennai with seven locations and estimated PM’s using the weighted ensemble method.Proposed method for air pollution prediction unveiled effective and moored performance in long term prediction.Dynamic budge with high weighted k-models are used simultaneously and devising an ensemble helps to achieve stable forecasting.Computational time of ensemble decreases with paral-lel processing in each sub model.Weighted ensemble model shows high perfor-mance in long term prediction when compared to the traditional time series models like Vector Auto-Regression(VAR),Autoregressive Integrated with Mov-ing Average(ARIMA),Autoregressive Moving Average with Extended terms(ARMEX).Evaluation metrics like Root Mean Square Error(RMSE),Mean Absolute Error(MAE)and the time to achieve the time series are compared.
基金partly supported by the National Natural Science Foundation of China [81773370 and 82173638]the Natural Science Foundation of Heilongjiang Province [TD2019H001]
文摘Objective To recognize the spatial and temporal characteristics of iodine deficiency disorders(IDD),China national IDD surveillance data for the years of 1995–2018 were analyzed.Methods Time series analysis was used to describe and predict the IDD related indicators,and spatial analysis was used to analyze the spatial distribution of salt iodine levels.Results In China,the median urinary iodine concentration increased in 1995–1997,then decreased to adequate levels,and are expected to remain appropriate in 2019–2022.The goiter rate continually decreased and is expected to be maintained at a low level.Since 2002,the coverage rates of iodized salt and the consumption rates of qualified iodized salt(the percentage of qualified iodized salt in all tested salt) increased and began to decline in 2012;they are expected to continue to decrease.Spatial epidemiological analysis indicated a positive spatial correlation in 2016–2018 and revealed feature regarding the spatial distribution of salt related indicators in coastal areas and areas near iodine-excess areas.Conclusions Iodine nutrition in China showed gradual improvements.However,a recent decline has been observed in some areas following changes in the iodized salt supply in China.In the future,more regulations regarding salt management should be issued to strengthen IDD control and prevention measures,and avoid the recurrence of IDD.
文摘Time-series analysis is important to a wide range of disciplines transcending both the physical and social sciences for proactive policy decisions. Statistical models have sound theoretical basis and have been successfully used in a number of problem domains in time series forecasting. Due to power and flexibility, Box-Jenkins ARIMA model has gained enormous popularity in many areas and research practice for the last three decades. More recently, the neural networks have been shown to be a promising alternative tool for modeling and forecasting owing to their ability to capture the nonlinearity in the data. However, despite the popularity and the superiority of ARIMA and ANN models, the empirical forecasting performance has been rather mixed so that no single method is best in every situation. In this study, a hybrid ARIMA and neural networks model to time series forecasting is proposed. The basic idea behind the model combination is to use each model’s unique features to capture different patterns in the data. With three real data sets, empirical results evidently show that the hybrid model outperforms ARIMA and ANN model noticeably in terms of forecasting accuracy used in isolation.