BACKGROUND Hepatitis B(HB)and hepatitis C(HC)place the largest burden in China,and a goal of eliminating them as a major public health threat by 2030 has been set.Making more informed and accurate forecasts of their s...BACKGROUND Hepatitis B(HB)and hepatitis C(HC)place the largest burden in China,and a goal of eliminating them as a major public health threat by 2030 has been set.Making more informed and accurate forecasts of their spread is essential for developing effective strategies,heightening the requirement for early warning to deal with such a major public health threat.AIM To monitor HB and HC epidemics by the design of a paradigmatic seasonal autoregressive fractionally integrated moving average(SARFIMA)for projections into 2030,and to compare the effectiveness with the seasonal autoregressive integrated moving average(SARIMA).METHODS Monthly HB and HC incidence cases in China were obtained from January 2004 to June 2023.Descriptive analysis and the Hodrick-Prescott method were employed to identify trends and seasonality.Two periods(from January 2004 to June 2022 and from January 2004 to December 2015,respectively)were used as the training sets to develop both models,while the remaining periods served as the test sets to evaluate the forecasting accuracy.RESULTS There were incidents of 23400874 HB cases and 3590867 HC cases from January 2004 to June 2023.Overall,HB remained steady[average annual percentage change(AAPC)=0.44,95%confidence interval(95%CI):-0.94-1.84]while HC was increasing(AAPC=8.91,95%CI:6.98-10.88),and both had a peak in March and a trough in February.In the 12-step-ahead HB forecast,the mean absolute deviation(15211.94),root mean square error(18762.94),mean absolute percentage error(0.17),mean error rate(0.15),and root mean square percentage error(0.25)under the best SARFIMA(3,0,0)(0,0.449,2)12 were smaller than those under the best SARIMA(3,0,0)(0,1,2)12(16867.71,20775.12,0.19,0.17,and 0.27,respectively).Similar results were also observed for the 90-step-ahead HB,12-step-ahead HC,and 90-step-ahead HC forecasts.The predicted HB incidents totaled 9865400(95%CI:7508093-12222709)cases and HC totaled 1659485(95%CI:856681-2462290)cases during 2023-2030.CONCLUSION Under current interventions,China faces enormous challenges to eliminate HB and HC epidemics by 2030,and effective strategies must be reinforced.The integration of SARFIMA into public health for the management of HB and HC epidemics can potentially result in more informed and efficient interventions,surpassing the capabilities of SARIMA.展开更多
China has resolved its overall regional poverty in 2020 by attaining moderate societal prosperity.The country has entered a new development stage designed to achieve its second centenary goal.However,ecological fragil...China has resolved its overall regional poverty in 2020 by attaining moderate societal prosperity.The country has entered a new development stage designed to achieve its second centenary goal.However,ecological fragility and risk susceptibility have increased the risk of returning to ecological poverty.In this paper,the Liupan Mountain Region of China was used as a case study,and the counties were used as the scale to reveal the spatiotempora differentiation and influcing factors of the risk of returning to poverty in study area.The indicator data for returning to ecological poverty from 2011-2020 were collected and summarized in three dimensions:ecological,economic and social.The autoregressive integrated moving average model(ARIMA)time series and exponential smoothing method(ES)were used to predict the multidimensional indicators of returning to ecological poverty for 61 counties(districts)in the Liupan Mountain Region for 2021-2030.The back propagation neural network(BPNN)and geographic information system(GIS)were used to generate the spatial distribution and time variation for the index of the risk of returning to ecological poverty(RREP index).The results show that 1)ecological factors were the main factors in the risk of returning to ecological poverty in Liupan Mountain Region.2)The RREP index for the 61 counties(districts)exhibited a downward trend from 2021-2030.The RREP index declined more in medium-and high-risk areas than in low-risk areas.From 2021 to 2025,the RREP index exhibited a slight downward trend.From 2026 to2030,the RREP index was expected to decline faster,especially from 2029-2030.3)Based on the RREP index,it can be roughly divided into three types,namely,the high-risk areas,the medium-risk areas,and the low-risk areas.The natural resource conditions in lowrisk areas of returning to ecological poverty,were better than those in medium-and high-risk areas.展开更多
Long-term time series forecasting stands as a crucial research domain within the realm of automated machine learning(AutoML).At present,forecasting,whether rooted in machine learning or statistical learning,typically ...Long-term time series forecasting stands as a crucial research domain within the realm of automated machine learning(AutoML).At present,forecasting,whether rooted in machine learning or statistical learning,typically relies on expert input and necessitates substantial manual involvement.This manual effort spans model development,feature engineering,hyper-parameter tuning,and the intricate construction of time series models.The complexity of these tasks renders complete automation unfeasible,as they inherently demand human intervention at multiple junctures.To surmount these challenges,this article proposes leveraging Long Short-Term Memory,which is the variant of Recurrent Neural Networks,harnessing memory cells and gating mechanisms to facilitate long-term time series prediction.However,forecasting accuracy by particular neural network and traditional models can degrade significantly,when addressing long-term time-series tasks.Therefore,our research demonstrates that this innovative approach outperforms the traditional Autoregressive Integrated Moving Average(ARIMA)method in forecasting long-term univariate time series.ARIMA is a high-quality and competitive model in time series prediction,and yet it requires significant preprocessing efforts.Using multiple accuracy metrics,we have evaluated both ARIMA and proposed method on the simulated time-series data and real data in both short and long term.Furthermore,our findings indicate its superiority over alternative network architectures,including Fully Connected Neural Networks,Convolutional Neural Networks,and Nonpooling Convolutional Neural Networks.Our AutoML approach enables non-professional to attain highly accurate and effective time series forecasting,and can be widely applied to various domains,particularly in business and finance.展开更多
气温衍生品是天气衍生品交易中最活跃的合约之一,确定合理预测气温动态变化的模型,是气温衍生品开发设计的基础。考虑到气温在时间变化上具有趋势性、季节性和周期性等特点,文中使用了以O-U均值回复过程为基础的Continuous Time Autoreg...气温衍生品是天气衍生品交易中最活跃的合约之一,确定合理预测气温动态变化的模型,是气温衍生品开发设计的基础。考虑到气温在时间变化上具有趋势性、季节性和周期性等特点,文中使用了以O-U均值回复过程为基础的Continuous Time Autoregressive Model(CAR)模型、Seasonal Autoregressive Integrated Moving Average(SARIMA)模型和小波神经网络算法,并选择漠河、北京、乌鲁木齐、芜湖、昆明和海口具有地域性代表的城市气温进行拟合,使用无偏绝对百分比误差、绝对百分比误差和平均绝对比例误差检验指标检验了模型的预测精度。研究结果表明,小波神经网络算法在预测6个城市的无偏绝对百分比误差、绝对百分比误差和平均绝对比例误差的值最小;同时,相比CAR模型、SARIMA模型,其预测效果最优。因此,小波神经网络算法能够很好地拟合气温数据的变化,可以为我国气温天气衍生品的定价提供一定的指导。展开更多
Background:Improving financial time series forecasting is one of the most challenging and vital issues facing numerous financial analysts and decision makers.Given its direct impact on related decisions,various attemp...Background:Improving financial time series forecasting is one of the most challenging and vital issues facing numerous financial analysts and decision makers.Given its direct impact on related decisions,various attempts have been made to achieve more accurate and reliable forecasting results,of which the combining of individual models remains a widely applied approach.In general,individual models are combined under two main strategies:series and parallel.While it has been proven that these strategies can improve overall forecasting accuracy,the literature on time series forecasting remains vague on the choice of an appropriate strategy to generate a more accurate hybrid model.Methods:Therefore,this study’s key aim is to evaluate the performance of series and parallel strategies to determine a more accurate one.Results:Accordingly,the predictive capabilities of five hybrid models are constructed on the basis of series and parallel strategies compared with each other and with their base models to forecast stock price.To do so,autoregressive integrated moving average(ARIMA)and multilayer perceptrons(MLPs)are used to construct two series hybrid models,ARIMA-MLP and MLP-ARIMA,and three parallel hybrid models,simple average,linear regression,and genetic algorithm models.Conclusion:The empirical forecasting results for two benchmark datasets,that is,the closing of the Shenzhen Integrated Index(SZII)and that of Standard and Poor’s 500(S&P 500),indicate that although all hybrid models perform better than at least one of their individual components,the series combination strategy produces more accurate hybrid models for financial time series forecasting.展开更多
Time series analysis has two goals, modeling random mechanisms and predicting future series using historical data. In the present work, a uni-variate time series autoregressive integrated moving average (ARIMA) mode...Time series analysis has two goals, modeling random mechanisms and predicting future series using historical data. In the present work, a uni-variate time series autoregressive integrated moving average (ARIMA) model has been developed for (a) simulating and forecasting mean rainfall, obtained using Theissen weights; over the Mahanadi River Basin in India, and (b) simula^ag and forecasting mean rainfall at 38 rain-gauge stations in district towns across the basin. For the analysis, monthly rainfall data of each district town for the years 1901-2002 (102 years) were used. Theissen weights were obtained over the basin and mean monthly rainfall was estimated. The trend and seasonality observed in ACF and PACF plots of rainfall data were removed using power transformation (a=0.5) and first order seasonal differencing prior to the development of the ARIMA model. Interestingly, the AR1MA model (1,0,0)(0,1,1)12 developed here was found to be most suitable for simulating and forecasting mean rainfall over the Mahanadi River Basin and for all 38 district town rain-gauge stations, separately. The Akaike Information Criterion (AIC), good- ness of fit (Chi-square), R2 (coefficient of determination), MSE (mean square error) and MAE (mea absolute error) were used to test the validity and applicability of the developed ARIMA model at different stages. This model is considered appropriate to forecast the monthly rainfall for the upcoming 12 years in each district town to assist decision makers and policy makers establish priorities for water demand, storage, distribution, and disaster management.展开更多
We introduce a novel approach to multifractal data in order to achieve transcended modeling and forecasting performances by extracting time series out of local Hurst exponent calculations at a specified scale.First,th...We introduce a novel approach to multifractal data in order to achieve transcended modeling and forecasting performances by extracting time series out of local Hurst exponent calculations at a specified scale.First,the long range and co-movement dependencies of the time series are scrutinized on time-frequency space using multiple wavelet coherence analysis.Then,the multifractal behaviors of the series are verified by multifractal de-trended fluctuation analysis and its local Hurst exponents are calculated.Additionally,root mean squares of residuals at the specified scale are procured from an intermediate step during local Hurst exponent calculations.These internally calculated series have been used to estimate the process with vector autoregressive fractionally integrated moving average(VARFIMA)model and forecasted accordingly.In our study,the daily prices of gold,silver and platinum are used for assessment.The results have shown that all metals do behave in phase movement on long term periods and possess multifractal features.Furthermore,the intermediate time series obtained during local Hurst exponent calculations still appertain the co-movement as well as multifractal characteristics of the raw data and may be successfully re-scaled,modeled and forecasted by using VARFIMA model.Conclusively,VARFIMA model have notably surpassed its univariate counterpart(ARFIMA)in all efficacious trials while re-emphasizing the importance of comovement procurement in modeling.Our study’s novelty lies in using a multifractal de-trended fluctuation analysis,along with multiple wavelet coherence analysis,for forecasting purposes to an extent not seen before.The results will be of particular significance to finance researchers and practitioners.展开更多
Aiming at the problem of abnormal data generated by a power transformer on-line monitoring system due to the influences of transformer operation state change,external environmental interference,communication interrupt...Aiming at the problem of abnormal data generated by a power transformer on-line monitoring system due to the influences of transformer operation state change,external environmental interference,communication interruption,and other factors,a method of anomaly recognition and differentiation for monitoring data was proposed.Firstly,the empirical wavelet transform(EWT)and the autoregressive integrated moving average(ARIMA)model were used for time series modelling of monitoring data to obtain the residual sequence reflecting the anomaly monitoring data value,and then the isolation forest algorithm was used to identify the abnormal information,and the monitoring sequence was segmented according to the recognition results.Secondly,the segmented sequence was symbolised by the improved multi-dimensional SAX vector representation method,and the assessment of the anomaly pattern was made by calculating the similarity score of the adjacent symbol vectors,and the monitoring sequence correlation was further used to verify the assessment.Finally,the case study result shows that the proposed method can reliably recognise abnormal data and accurately distinguish between invalid and valid anomaly patterns.展开更多
Wholesale and retail markets for electricity and power require consumers to forecast electricity consumption at different time intervals. The study aims to</span><span style="font-family:Verdana;"&g...Wholesale and retail markets for electricity and power require consumers to forecast electricity consumption at different time intervals. The study aims to</span><span style="font-family:Verdana;"> increase economic efficiency of the enterprise through the introduction of algorithm for forecasting electric energy consumption unchanged in technological process. Qualitative forecast allows you to essentially reduce costs of electrical </span><span style="font-family:Verdana;">energy, because power cannot be stockpiled. Therefore, when buying excess electrical power, costs can increase either by selling it on the balancing energy </span><span style="font-family:Verdana;">market or by maintaining reserve capacity. If the purchased power is insufficient, the costs increase is due to the purchase of additional capacity. This paper illustrates three methods of forecasting electric energy consumption: autoregressive integrated moving average method, artificial neural networks and classification and regression trees. Actual data from consuming of electrical energy was </span><span style="font-family:Verdana;">used to make day, week and month ahead prediction. The prediction effect of</span><span> </span><span style="font-family:Verdana;">prediction model was proved in Statistica simulation environment. Analysis of estimation of the economic efficiency of prediction methods demonstrated that the use of the artificial neural networks method for short-term forecast </span><span style="font-family:Verdana;">allowed reducing the cost of electricity more efficiently. However, for mid-</span></span><span style="font-family:""> </span><span style="font-family:Verdana;">range predictions, the classification and regression tree was the most efficient method for a Jerky Enterprise. The results indicate that calculation error reduction allows decreases expenses for the purchase of electric energy.展开更多
There has been a moderate increase in newly diagnosed HIV-infected Minna populace, which calls for serious attention.<span style="font-family:;" "=""> </span><span style="f...There has been a moderate increase in newly diagnosed HIV-infected Minna populace, which calls for serious attention.<span style="font-family:;" "=""> </span><span style="font-family:Verdana;">This study</span><span style="font-family:;" "=""> </span><span style="font-family:Verdana;">used time series data based on monthly HIV cases from January 2007 to December 2018 taken from the statistical data document on HIV prevalence recorded in General Hospital Minna, Niger State.</span><span style="font-family:;" "=""> </span><span style="font-family:Verdana;">The methodology employed to analyze the data is base</span><span style="font-family:Verdana;">d</span><span style="font-family:;" "=""><span style="font-family:Verdana;"> on mathematical models of ARMA, ARIMA and SARIMA which were computed and diagnosed. From the results of parameter estimation of </span><span style="font-family:Verdana;">the models, ARMA(2, 1) model was the best model among the other ARMA models using information criteria (AIC). Diagnostic test was run on the ARMA(2, 1) model where the results show that the model was adequate and normally distributed using Box-Lung test and Q</span></span><span style="font-family:Verdana;">-</span><span style="font-family:Verdana;">Q plot respectively. Fur</span><span style="font-family:Verdana;">thermore, ARIMA of first and second differences w</span><span style="font-family:Verdana;">as</span><span style="font-family:Verdana;"> estimated and ARIMA(1,</span><span style="font-family:;" "=""> </span><span style="font-family:Verdana;">0,</span><span style="font-family:;" "=""> </span><span style="font-family:Verdana;">1) was the best model from the result of the AIC and diagnostic test carried out which revealed that the model was adequate and normally distributed using Box-Lung and Q-Q plot respectively. Furthermore, the results obtained in the ARMA and ARIMA models were used to arrive at a combined model given as ARIMA(1, 0, 1) </span><span style="font-family:;" "=""><span style="font-family:Verdana;">×</span><span><span style="font-family:Verdana;"> SARIMA(1, 0, 1)</span><sub><span style="font-family:Verdana;">12</span></sub></span></span><span style="font-family:;" "=""> </span><span style="font-family:Verdana;">which was subsequently estimated and found to be adequate from the result of the Box-Lung and Q-Q plot respectively. Post forecasting estimation and performance evolution were evaluated using the RMSE and MAE. The results showed that, ARIMA(1, 0, 1) </span><span style="font-family:;" "=""><span style="font-family:Verdana;">×</span><span><span style="font-family:Verdana;"> SARIMA(1, 0, 1)</span><sub><span style="font-family:Verdana;">12</span></sub><span style="font-family:Verdana;"> is the best forecasting model followed by ARIMA(1, 0, 2) on monthly HIV prevalence in Minna, Niger state.</span></span></span>展开更多
Air pollution is a severe environmental problem in urban areas.Accurate air quality prediction can help governments and individuals make proper decisions to cope with potential air pollution.As a classic time series f...Air pollution is a severe environmental problem in urban areas.Accurate air quality prediction can help governments and individuals make proper decisions to cope with potential air pollution.As a classic time series forecasting model,the AutoRegressive Integrated Moving Average(ARIMA)has been widely adopted in air quality prediction.However,because of the volatility of air quality and the lack of additional context information,i.e.,the spatial relationships among monitor stations,traditional ARIMA models suffer from unstable prediction performance.Though some deep networks can achieve higher accuracy,a mass of training data,heavy computing,and time cost are required.In this paper,we propose a hybrid model to simultaneously predict seven air pollution indicators from multiple monitoring stations.The proposed model consists of three components:(1)an extended ARIMA to predict matrix series of multiple air quality indicators from several adjacent monitoring stations;(2)the Empirical Mode Decomposition(EMD)to decompose the air quality time series data into multiple smooth sub-series;and(3)the truncated Singular Value Decomposition(SvD)to compress and denoise the expanded matrix.Experimental results on the public dataset show that our proposed model outperforms the state-of-art air quality forecasting models in both accuracy and time cost.展开更多
Electricity prices have complex features,such as high frequency,multiple seasonality,and nonlinearity.These factors will make the prediction of electricity prices difficult.However,accurate electricity price predictio...Electricity prices have complex features,such as high frequency,multiple seasonality,and nonlinearity.These factors will make the prediction of electricity prices difficult.However,accurate electricity price prediction is important for energy producers and consumers to develop bidding strategies.To improve the accuracy of prediction by using each algorithms’advantages,this paper proposes a hybrid model that uses the Empirical Mode Decomposition(EMD),Autoregressive Integrated Moving Average(ARIMA),and Temporal Convolutional Network(TCN).EMD is used to decompose the electricity prices into low and high frequency components.Low frequency components are forecasted by the ARIMA model and the high frequency series are predicted by the TCN model.Experimental results using the realistic electricity price data from Pennsylvania-New Jersey-Maryland(PJM)electricity markets show that the proposed method has a higher prediction accuracy than other single methods and hybrid methods.展开更多
This paper presents a control strategy for residential battery energy storage systems,which is aware of volatile electricity markets and uncertain daily cycling loads.The economic benefits of energy trading for prosum...This paper presents a control strategy for residential battery energy storage systems,which is aware of volatile electricity markets and uncertain daily cycling loads.The economic benefits of energy trading for prosumers are achieved through a novel modification of a conventional model predictive control(MPC).The proposed control strategy guarantees an optimal global solution for the applied control action.A new cost function is introduced to model the effects of volatility on customer benefits more effectively.Specifically,the newly presented cost function models a probabilistic relation between the power exchanged with the grid,the net load,and the electricity market.The probabilistic calculation of the cost function shows the dependence on the mathematical expectation of market price and net load.Computational techniques for calculating this value are presented.The proposed strategy differs from the stochastic and robust MPC in that the cost is calculated across the market price and net load variations rather than across model constraints and parameter variations.展开更多
Time series forecasting research area mainly focuses on developing effective forecasting models toimprove prediction accuracy. An ensemble model composed of autoregressive integrated movingaverage (ARIMA), artificia...Time series forecasting research area mainly focuses on developing effective forecasting models toimprove prediction accuracy. An ensemble model composed of autoregressive integrated movingaverage (ARIMA), artificial neural network (ANN), restricted Boltzmann machines (RBM), anddiscrete wavelet transform (DWT) is presented in this paper. In the proposed model, DWT firstdecomposes time series into approximation and detail. Then Khashei and Bijari's model, which is anensemble model of ARIMA and ANN, is applied to the approximation and detail to extract their bothlinear and nonlinear components and fit the relationship between the components as a function insteadof additive relationship. Furthermore, RBM is used to perform pre-training for generating initialweights and biases based on inputs feature for ANN. Finally, the forecasted approximation and detailare combined to obtain final forecasting. The forecasting capability of the proposed model is testedwith three well-known time series: sunspot, Canadian lynx, exchange rate time series. The predictionperformance is compared to the other six forecasting models. The results indicate that the proposedmodel gives the best performance in all three data sets and all three measures (i.e. MSE, MAE andMAPE).展开更多
Traffic prediction plays an integral role in telecommunication network planning and network optimization. In this paper, we investigate the traffic forecasting for data services in 3G mobile networks. Although the Box...Traffic prediction plays an integral role in telecommunication network planning and network optimization. In this paper, we investigate the traffic forecasting for data services in 3G mobile networks. Although the Box-Jenkins model has been proven to be appropriate for voice traffic (since the arrival of calls follows a Poisson distribution), it has been demonstrated that the Internet traffic exhibits statistical self-similarity and has to be modeled using the Fractional AutoRegressive Integrated Moving Average (FARIMA) process. However, a few studies have concluded that the FARIMA process may fail in modeling the Internet traffic. To this end, we conducted experiments on the modeling of benchmark Internet traffic and found that the FARIMA process fails because of the significant multifractal characteristic inherent in the traffic series. Thereafter, we investigate the traffic series of data services in a 3G mobile network from a province in China. Rich multifractal spectra are found in this series. Based on this observation, an integrated method combining the AutoRegressive Moving Average (ARMA) and FARIMA processes is applied. The obtained experimental results verify the effectiveness of the integrated prediction method.展开更多
Purpose-The purpose of this paper is to propose a data prediction framework for scenarios which require forecasting demand for large-scale data sources,e.g.,sensor networks,securities exchange,electric power secondary...Purpose-The purpose of this paper is to propose a data prediction framework for scenarios which require forecasting demand for large-scale data sources,e.g.,sensor networks,securities exchange,electric power secondary system,etc.Concretely,the proposed framework should handle several difficult requirements including the management of gigantic data sources,the need for a fast self-adaptive algorithm,the relatively accurate prediction of multiple time series,and the real-time demand.Design/methodology/approach-First,the autoregressive integrated moving average-based prediction algorithm is introduced.Second,the processing framework is designed,which includes a time-series data storage model based on the HBase,and a real-time distributed prediction platform based on Storm.Then,the work principle of this platform is described.Finally,a proof-of-concept testbed is illustrated to verify the proposed framework.Findings-Several tests based on Power Grid monitoring data are provided for the proposed framework.The experimental results indicate that prediction data are basically consistent with actual data,processing efficiency is relatively high,and resources consumption is reasonable.Originality/value-This paper provides a distributed real-time data prediction framework for large-scale time-series data,which can exactly achieve the requirement of the effective management,prediction efficiency,accuracy,and high concurrency for massive data sources.展开更多
The mode of telecommunication network management is changing from“network oriented”to“subscriber oriented”.Aimed at enhancing subscribers’feeling,proactive performance monitoring(PPM)can enable a fast fault corre...The mode of telecommunication network management is changing from“network oriented”to“subscriber oriented”.Aimed at enhancing subscribers’feeling,proactive performance monitoring(PPM)can enable a fast fault correction by detecting anomalies designating performance degradation.In this paper,a novel anomaly detection approach is the proposed taking advantage of time series prediction and the associated confidence interval based on multiplicative autoregressive integrated moving average(ARIMA).Furthermore,under the assumption that the training residual is a white noise process following a normal distribution,the associated confidence interval of prediction can be figured out under any given confidence degree 1–αby constructing random variables satisfying t distribution.Experimental results verify the method’s effectiveness.展开更多
基金Supported by the Key Scientific Research Project of Universities in Henan Province,No.21A330004Natural Science Foundation in Henan Province,No.222300420265.
文摘BACKGROUND Hepatitis B(HB)and hepatitis C(HC)place the largest burden in China,and a goal of eliminating them as a major public health threat by 2030 has been set.Making more informed and accurate forecasts of their spread is essential for developing effective strategies,heightening the requirement for early warning to deal with such a major public health threat.AIM To monitor HB and HC epidemics by the design of a paradigmatic seasonal autoregressive fractionally integrated moving average(SARFIMA)for projections into 2030,and to compare the effectiveness with the seasonal autoregressive integrated moving average(SARIMA).METHODS Monthly HB and HC incidence cases in China were obtained from January 2004 to June 2023.Descriptive analysis and the Hodrick-Prescott method were employed to identify trends and seasonality.Two periods(from January 2004 to June 2022 and from January 2004 to December 2015,respectively)were used as the training sets to develop both models,while the remaining periods served as the test sets to evaluate the forecasting accuracy.RESULTS There were incidents of 23400874 HB cases and 3590867 HC cases from January 2004 to June 2023.Overall,HB remained steady[average annual percentage change(AAPC)=0.44,95%confidence interval(95%CI):-0.94-1.84]while HC was increasing(AAPC=8.91,95%CI:6.98-10.88),and both had a peak in March and a trough in February.In the 12-step-ahead HB forecast,the mean absolute deviation(15211.94),root mean square error(18762.94),mean absolute percentage error(0.17),mean error rate(0.15),and root mean square percentage error(0.25)under the best SARFIMA(3,0,0)(0,0.449,2)12 were smaller than those under the best SARIMA(3,0,0)(0,1,2)12(16867.71,20775.12,0.19,0.17,and 0.27,respectively).Similar results were also observed for the 90-step-ahead HB,12-step-ahead HC,and 90-step-ahead HC forecasts.The predicted HB incidents totaled 9865400(95%CI:7508093-12222709)cases and HC totaled 1659485(95%CI:856681-2462290)cases during 2023-2030.CONCLUSION Under current interventions,China faces enormous challenges to eliminate HB and HC epidemics by 2030,and effective strategies must be reinforced.The integration of SARFIMA into public health for the management of HB and HC epidemics can potentially result in more informed and efficient interventions,surpassing the capabilities of SARIMA.
基金Under the auspices of National Natural Science Foundation of China(No.42071230)。
文摘China has resolved its overall regional poverty in 2020 by attaining moderate societal prosperity.The country has entered a new development stage designed to achieve its second centenary goal.However,ecological fragility and risk susceptibility have increased the risk of returning to ecological poverty.In this paper,the Liupan Mountain Region of China was used as a case study,and the counties were used as the scale to reveal the spatiotempora differentiation and influcing factors of the risk of returning to poverty in study area.The indicator data for returning to ecological poverty from 2011-2020 were collected and summarized in three dimensions:ecological,economic and social.The autoregressive integrated moving average model(ARIMA)time series and exponential smoothing method(ES)were used to predict the multidimensional indicators of returning to ecological poverty for 61 counties(districts)in the Liupan Mountain Region for 2021-2030.The back propagation neural network(BPNN)and geographic information system(GIS)were used to generate the spatial distribution and time variation for the index of the risk of returning to ecological poverty(RREP index).The results show that 1)ecological factors were the main factors in the risk of returning to ecological poverty in Liupan Mountain Region.2)The RREP index for the 61 counties(districts)exhibited a downward trend from 2021-2030.The RREP index declined more in medium-and high-risk areas than in low-risk areas.From 2021 to 2025,the RREP index exhibited a slight downward trend.From 2026 to2030,the RREP index was expected to decline faster,especially from 2029-2030.3)Based on the RREP index,it can be roughly divided into three types,namely,the high-risk areas,the medium-risk areas,and the low-risk areas.The natural resource conditions in lowrisk areas of returning to ecological poverty,were better than those in medium-and high-risk areas.
文摘Long-term time series forecasting stands as a crucial research domain within the realm of automated machine learning(AutoML).At present,forecasting,whether rooted in machine learning or statistical learning,typically relies on expert input and necessitates substantial manual involvement.This manual effort spans model development,feature engineering,hyper-parameter tuning,and the intricate construction of time series models.The complexity of these tasks renders complete automation unfeasible,as they inherently demand human intervention at multiple junctures.To surmount these challenges,this article proposes leveraging Long Short-Term Memory,which is the variant of Recurrent Neural Networks,harnessing memory cells and gating mechanisms to facilitate long-term time series prediction.However,forecasting accuracy by particular neural network and traditional models can degrade significantly,when addressing long-term time-series tasks.Therefore,our research demonstrates that this innovative approach outperforms the traditional Autoregressive Integrated Moving Average(ARIMA)method in forecasting long-term univariate time series.ARIMA is a high-quality and competitive model in time series prediction,and yet it requires significant preprocessing efforts.Using multiple accuracy metrics,we have evaluated both ARIMA and proposed method on the simulated time-series data and real data in both short and long term.Furthermore,our findings indicate its superiority over alternative network architectures,including Fully Connected Neural Networks,Convolutional Neural Networks,and Nonpooling Convolutional Neural Networks.Our AutoML approach enables non-professional to attain highly accurate and effective time series forecasting,and can be widely applied to various domains,particularly in business and finance.
文摘Background:Improving financial time series forecasting is one of the most challenging and vital issues facing numerous financial analysts and decision makers.Given its direct impact on related decisions,various attempts have been made to achieve more accurate and reliable forecasting results,of which the combining of individual models remains a widely applied approach.In general,individual models are combined under two main strategies:series and parallel.While it has been proven that these strategies can improve overall forecasting accuracy,the literature on time series forecasting remains vague on the choice of an appropriate strategy to generate a more accurate hybrid model.Methods:Therefore,this study’s key aim is to evaluate the performance of series and parallel strategies to determine a more accurate one.Results:Accordingly,the predictive capabilities of five hybrid models are constructed on the basis of series and parallel strategies compared with each other and with their base models to forecast stock price.To do so,autoregressive integrated moving average(ARIMA)and multilayer perceptrons(MLPs)are used to construct two series hybrid models,ARIMA-MLP and MLP-ARIMA,and three parallel hybrid models,simple average,linear regression,and genetic algorithm models.Conclusion:The empirical forecasting results for two benchmark datasets,that is,the closing of the Shenzhen Integrated Index(SZII)and that of Standard and Poor’s 500(S&P 500),indicate that although all hybrid models perform better than at least one of their individual components,the series combination strategy produces more accurate hybrid models for financial time series forecasting.
文摘Time series analysis has two goals, modeling random mechanisms and predicting future series using historical data. In the present work, a uni-variate time series autoregressive integrated moving average (ARIMA) model has been developed for (a) simulating and forecasting mean rainfall, obtained using Theissen weights; over the Mahanadi River Basin in India, and (b) simula^ag and forecasting mean rainfall at 38 rain-gauge stations in district towns across the basin. For the analysis, monthly rainfall data of each district town for the years 1901-2002 (102 years) were used. Theissen weights were obtained over the basin and mean monthly rainfall was estimated. The trend and seasonality observed in ACF and PACF plots of rainfall data were removed using power transformation (a=0.5) and first order seasonal differencing prior to the development of the ARIMA model. Interestingly, the AR1MA model (1,0,0)(0,1,1)12 developed here was found to be most suitable for simulating and forecasting mean rainfall over the Mahanadi River Basin and for all 38 district town rain-gauge stations, separately. The Akaike Information Criterion (AIC), good- ness of fit (Chi-square), R2 (coefficient of determination), MSE (mean square error) and MAE (mea absolute error) were used to test the validity and applicability of the developed ARIMA model at different stages. This model is considered appropriate to forecast the monthly rainfall for the upcoming 12 years in each district town to assist decision makers and policy makers establish priorities for water demand, storage, distribution, and disaster management.
文摘We introduce a novel approach to multifractal data in order to achieve transcended modeling and forecasting performances by extracting time series out of local Hurst exponent calculations at a specified scale.First,the long range and co-movement dependencies of the time series are scrutinized on time-frequency space using multiple wavelet coherence analysis.Then,the multifractal behaviors of the series are verified by multifractal de-trended fluctuation analysis and its local Hurst exponents are calculated.Additionally,root mean squares of residuals at the specified scale are procured from an intermediate step during local Hurst exponent calculations.These internally calculated series have been used to estimate the process with vector autoregressive fractionally integrated moving average(VARFIMA)model and forecasted accordingly.In our study,the daily prices of gold,silver and platinum are used for assessment.The results have shown that all metals do behave in phase movement on long term periods and possess multifractal features.Furthermore,the intermediate time series obtained during local Hurst exponent calculations still appertain the co-movement as well as multifractal characteristics of the raw data and may be successfully re-scaled,modeled and forecasted by using VARFIMA model.Conclusively,VARFIMA model have notably surpassed its univariate counterpart(ARFIMA)in all efficacious trials while re-emphasizing the importance of comovement procurement in modeling.Our study’s novelty lies in using a multifractal de-trended fluctuation analysis,along with multiple wavelet coherence analysis,for forecasting purposes to an extent not seen before.The results will be of particular significance to finance researchers and practitioners.
基金supported by State Grid Hebei Electric Power Co.,Ltd.(kj2020-040).
文摘Aiming at the problem of abnormal data generated by a power transformer on-line monitoring system due to the influences of transformer operation state change,external environmental interference,communication interruption,and other factors,a method of anomaly recognition and differentiation for monitoring data was proposed.Firstly,the empirical wavelet transform(EWT)and the autoregressive integrated moving average(ARIMA)model were used for time series modelling of monitoring data to obtain the residual sequence reflecting the anomaly monitoring data value,and then the isolation forest algorithm was used to identify the abnormal information,and the monitoring sequence was segmented according to the recognition results.Secondly,the segmented sequence was symbolised by the improved multi-dimensional SAX vector representation method,and the assessment of the anomaly pattern was made by calculating the similarity score of the adjacent symbol vectors,and the monitoring sequence correlation was further used to verify the assessment.Finally,the case study result shows that the proposed method can reliably recognise abnormal data and accurately distinguish between invalid and valid anomaly patterns.
文摘Wholesale and retail markets for electricity and power require consumers to forecast electricity consumption at different time intervals. The study aims to</span><span style="font-family:Verdana;"> increase economic efficiency of the enterprise through the introduction of algorithm for forecasting electric energy consumption unchanged in technological process. Qualitative forecast allows you to essentially reduce costs of electrical </span><span style="font-family:Verdana;">energy, because power cannot be stockpiled. Therefore, when buying excess electrical power, costs can increase either by selling it on the balancing energy </span><span style="font-family:Verdana;">market or by maintaining reserve capacity. If the purchased power is insufficient, the costs increase is due to the purchase of additional capacity. This paper illustrates three methods of forecasting electric energy consumption: autoregressive integrated moving average method, artificial neural networks and classification and regression trees. Actual data from consuming of electrical energy was </span><span style="font-family:Verdana;">used to make day, week and month ahead prediction. The prediction effect of</span><span> </span><span style="font-family:Verdana;">prediction model was proved in Statistica simulation environment. Analysis of estimation of the economic efficiency of prediction methods demonstrated that the use of the artificial neural networks method for short-term forecast </span><span style="font-family:Verdana;">allowed reducing the cost of electricity more efficiently. However, for mid-</span></span><span style="font-family:""> </span><span style="font-family:Verdana;">range predictions, the classification and regression tree was the most efficient method for a Jerky Enterprise. The results indicate that calculation error reduction allows decreases expenses for the purchase of electric energy.
文摘There has been a moderate increase in newly diagnosed HIV-infected Minna populace, which calls for serious attention.<span style="font-family:;" "=""> </span><span style="font-family:Verdana;">This study</span><span style="font-family:;" "=""> </span><span style="font-family:Verdana;">used time series data based on monthly HIV cases from January 2007 to December 2018 taken from the statistical data document on HIV prevalence recorded in General Hospital Minna, Niger State.</span><span style="font-family:;" "=""> </span><span style="font-family:Verdana;">The methodology employed to analyze the data is base</span><span style="font-family:Verdana;">d</span><span style="font-family:;" "=""><span style="font-family:Verdana;"> on mathematical models of ARMA, ARIMA and SARIMA which were computed and diagnosed. From the results of parameter estimation of </span><span style="font-family:Verdana;">the models, ARMA(2, 1) model was the best model among the other ARMA models using information criteria (AIC). Diagnostic test was run on the ARMA(2, 1) model where the results show that the model was adequate and normally distributed using Box-Lung test and Q</span></span><span style="font-family:Verdana;">-</span><span style="font-family:Verdana;">Q plot respectively. Fur</span><span style="font-family:Verdana;">thermore, ARIMA of first and second differences w</span><span style="font-family:Verdana;">as</span><span style="font-family:Verdana;"> estimated and ARIMA(1,</span><span style="font-family:;" "=""> </span><span style="font-family:Verdana;">0,</span><span style="font-family:;" "=""> </span><span style="font-family:Verdana;">1) was the best model from the result of the AIC and diagnostic test carried out which revealed that the model was adequate and normally distributed using Box-Lung and Q-Q plot respectively. Furthermore, the results obtained in the ARMA and ARIMA models were used to arrive at a combined model given as ARIMA(1, 0, 1) </span><span style="font-family:;" "=""><span style="font-family:Verdana;">×</span><span><span style="font-family:Verdana;"> SARIMA(1, 0, 1)</span><sub><span style="font-family:Verdana;">12</span></sub></span></span><span style="font-family:;" "=""> </span><span style="font-family:Verdana;">which was subsequently estimated and found to be adequate from the result of the Box-Lung and Q-Q plot respectively. Post forecasting estimation and performance evolution were evaluated using the RMSE and MAE. The results showed that, ARIMA(1, 0, 1) </span><span style="font-family:;" "=""><span style="font-family:Verdana;">×</span><span><span style="font-family:Verdana;"> SARIMA(1, 0, 1)</span><sub><span style="font-family:Verdana;">12</span></sub><span style="font-family:Verdana;"> is the best forecasting model followed by ARIMA(1, 0, 2) on monthly HIV prevalence in Minna, Niger state.</span></span></span>
文摘Air pollution is a severe environmental problem in urban areas.Accurate air quality prediction can help governments and individuals make proper decisions to cope with potential air pollution.As a classic time series forecasting model,the AutoRegressive Integrated Moving Average(ARIMA)has been widely adopted in air quality prediction.However,because of the volatility of air quality and the lack of additional context information,i.e.,the spatial relationships among monitor stations,traditional ARIMA models suffer from unstable prediction performance.Though some deep networks can achieve higher accuracy,a mass of training data,heavy computing,and time cost are required.In this paper,we propose a hybrid model to simultaneously predict seven air pollution indicators from multiple monitoring stations.The proposed model consists of three components:(1)an extended ARIMA to predict matrix series of multiple air quality indicators from several adjacent monitoring stations;(2)the Empirical Mode Decomposition(EMD)to decompose the air quality time series data into multiple smooth sub-series;and(3)the truncated Singular Value Decomposition(SvD)to compress and denoise the expanded matrix.Experimental results on the public dataset show that our proposed model outperforms the state-of-art air quality forecasting models in both accuracy and time cost.
基金supported by the Sichuan Science and Technology Program under Grant 2020JDJQ0037 and 2020YFG0312.
文摘Electricity prices have complex features,such as high frequency,multiple seasonality,and nonlinearity.These factors will make the prediction of electricity prices difficult.However,accurate electricity price prediction is important for energy producers and consumers to develop bidding strategies.To improve the accuracy of prediction by using each algorithms’advantages,this paper proposes a hybrid model that uses the Empirical Mode Decomposition(EMD),Autoregressive Integrated Moving Average(ARIMA),and Temporal Convolutional Network(TCN).EMD is used to decompose the electricity prices into low and high frequency components.Low frequency components are forecasted by the ARIMA model and the high frequency series are predicted by the TCN model.Experimental results using the realistic electricity price data from Pennsylvania-New Jersey-Maryland(PJM)electricity markets show that the proposed method has a higher prediction accuracy than other single methods and hybrid methods.
基金supported by Australian Research Council (ARC)Discovery Project (No.160102571)。
文摘This paper presents a control strategy for residential battery energy storage systems,which is aware of volatile electricity markets and uncertain daily cycling loads.The economic benefits of energy trading for prosumers are achieved through a novel modification of a conventional model predictive control(MPC).The proposed control strategy guarantees an optimal global solution for the applied control action.A new cost function is introduced to model the effects of volatility on customer benefits more effectively.Specifically,the newly presented cost function models a probabilistic relation between the power exchanged with the grid,the net load,and the electricity market.The probabilistic calculation of the cost function shows the dependence on the mathematical expectation of market price and net load.Computational techniques for calculating this value are presented.The proposed strategy differs from the stochastic and robust MPC in that the cost is calculated across the market price and net load variations rather than across model constraints and parameter variations.
文摘Time series forecasting research area mainly focuses on developing effective forecasting models toimprove prediction accuracy. An ensemble model composed of autoregressive integrated movingaverage (ARIMA), artificial neural network (ANN), restricted Boltzmann machines (RBM), anddiscrete wavelet transform (DWT) is presented in this paper. In the proposed model, DWT firstdecomposes time series into approximation and detail. Then Khashei and Bijari's model, which is anensemble model of ARIMA and ANN, is applied to the approximation and detail to extract their bothlinear and nonlinear components and fit the relationship between the components as a function insteadof additive relationship. Furthermore, RBM is used to perform pre-training for generating initialweights and biases based on inputs feature for ANN. Finally, the forecasted approximation and detailare combined to obtain final forecasting. The forecasting capability of the proposed model is testedwith three well-known time series: sunspot, Canadian lynx, exchange rate time series. The predictionperformance is compared to the other six forecasting models. The results indicate that the proposedmodel gives the best performance in all three data sets and all three measures (i.e. MSE, MAE andMAPE).
基金the National Key project of Scientific and Technical Supporting Programs of China (No. 2009BAH39B03)the National Natural Science Foundation of China (No. 61072060)+4 种基金the National High-Tech Research and Development (863) Program of China (No. 2011AA100706)the Program for New Century Excellent Talents in University (No. NECET-08-0738)the Research Fund for the Doctoral Program of Higher Education (No. 20110005120007)the Co-construction Program with Beijing Municipal Commission of EducationEngineering Research Center of Information Networks, Ministry of Education
文摘Traffic prediction plays an integral role in telecommunication network planning and network optimization. In this paper, we investigate the traffic forecasting for data services in 3G mobile networks. Although the Box-Jenkins model has been proven to be appropriate for voice traffic (since the arrival of calls follows a Poisson distribution), it has been demonstrated that the Internet traffic exhibits statistical self-similarity and has to be modeled using the Fractional AutoRegressive Integrated Moving Average (FARIMA) process. However, a few studies have concluded that the FARIMA process may fail in modeling the Internet traffic. To this end, we conducted experiments on the modeling of benchmark Internet traffic and found that the FARIMA process fails because of the significant multifractal characteristic inherent in the traffic series. Thereafter, we investigate the traffic series of data services in a 3G mobile network from a province in China. Rich multifractal spectra are found in this series. Based on this observation, an integrated method combining the AutoRegressive Moving Average (ARMA) and FARIMA processes is applied. The obtained experimental results verify the effectiveness of the integrated prediction method.
基金supported by“the Fundamental Research Funds for the Central Universities(2015XS72).”。
文摘Purpose-The purpose of this paper is to propose a data prediction framework for scenarios which require forecasting demand for large-scale data sources,e.g.,sensor networks,securities exchange,electric power secondary system,etc.Concretely,the proposed framework should handle several difficult requirements including the management of gigantic data sources,the need for a fast self-adaptive algorithm,the relatively accurate prediction of multiple time series,and the real-time demand.Design/methodology/approach-First,the autoregressive integrated moving average-based prediction algorithm is introduced.Second,the processing framework is designed,which includes a time-series data storage model based on the HBase,and a real-time distributed prediction platform based on Storm.Then,the work principle of this platform is described.Finally,a proof-of-concept testbed is illustrated to verify the proposed framework.Findings-Several tests based on Power Grid monitoring data are provided for the proposed framework.The experimental results indicate that prediction data are basically consistent with actual data,processing efficiency is relatively high,and resources consumption is reasonable.Originality/value-This paper provides a distributed real-time data prediction framework for large-scale time-series data,which can exactly achieve the requirement of the effective management,prediction efficiency,accuracy,and high concurrency for massive data sources.
基金supported by the National Key Technologies R&D program of China during the 11th Five-Year Plan Period (No.2006BAH02A03).
文摘The mode of telecommunication network management is changing from“network oriented”to“subscriber oriented”.Aimed at enhancing subscribers’feeling,proactive performance monitoring(PPM)can enable a fast fault correction by detecting anomalies designating performance degradation.In this paper,a novel anomaly detection approach is the proposed taking advantage of time series prediction and the associated confidence interval based on multiplicative autoregressive integrated moving average(ARIMA).Furthermore,under the assumption that the training residual is a white noise process following a normal distribution,the associated confidence interval of prediction can be figured out under any given confidence degree 1–αby constructing random variables satisfying t distribution.Experimental results verify the method’s effectiveness.