Coronavirus disease 2019(COVID-19)is an emerging infectious disease,and it is important to detect early and monitor the disease trend for policymakers to make informed decisions.We explored the predictive utility of B...Coronavirus disease 2019(COVID-19)is an emerging infectious disease,and it is important to detect early and monitor the disease trend for policymakers to make informed decisions.We explored the predictive utility of Baidu Search Index and Baidu Information Index for early warning of COVID-19 and identified search keywords for further monitoring of epidemic trends in Guangxi.A time-series analysis and Spearman correlation between the daily number of cases and both the Baidu Search Index and Baidu Information Index were performed for seven keywords related to COVID-19 from January 8 to March 9,2020.The time series showed that the temporal distributions of the search terms“coronavirus,”“pneumonia”and“mask”in the Baidu Search Index were consistent and had 2 to 3 days'lead time to the reported cases;the correlation coefficients were higher than 0.81.The Baidu Search Index volume in 14 prefectures of Guangxi was closely related with the number of reported cases;it was not associated with the local GDP.The Baidu Information Index search terms“coronavirus”and“pneumonia”were used as frequently as 192,405.0 and 110,488.6 per million population,respectively,and they were also significantly associated with the number of reported cases(rs>0.6),but they fluctuated more than for the Baidu Search Index and had 0 to 14 days'lag time to the reported cases.The Baidu Search Index with search terms“coronavirus,”“pneumonia”and“mask”can be used for early warning and monitoring of the epidemic trend of COVID-19 in Guangxi,with 2 to 3 days'lead time.展开更多
Influenza is a kind of infectious disease, which spreads quickly and widely. The outbreak of influenza has brought huge losses to society. In this paper, four major categories of flu keywords, “prevention phase”, “...Influenza is a kind of infectious disease, which spreads quickly and widely. The outbreak of influenza has brought huge losses to society. In this paper, four major categories of flu keywords, “prevention phase”, “symptom phase”, “treatment phase”, and “commonly-used phrase” were set. Python web crawler was used to obtain relevant influenza data from the National Influenza Center’s influenza surveillance weekly report and Baidu Index. The establishment of support vector regression (SVR), least absolute shrinkage and selection operator (LASSO), convolutional neural networks (CNN) prediction models through machine learning, took into account the seasonal characteristics of the influenza, also established the time series model (ARMA). The results show that, it is feasible to predict influenza based on web search data. Machine learning shows a certain forecast effect in the prediction of influenza based on web search data. In the future, it will have certain reference value in influenza prediction. The ARMA(3,0) model predicts better results and has greater generalization. Finally, the lack of research in this paper and future research directions are given.展开更多
基金supported by the Health and Emergency Skills Training Center of Guangxi(HESTCG202104)National Natural Science Foundation of China(11971479)Guangxi Bagui Honor Scholarship and Chinese State Key Laboratory of Infectious Disease Prevention and Control.
文摘Coronavirus disease 2019(COVID-19)is an emerging infectious disease,and it is important to detect early and monitor the disease trend for policymakers to make informed decisions.We explored the predictive utility of Baidu Search Index and Baidu Information Index for early warning of COVID-19 and identified search keywords for further monitoring of epidemic trends in Guangxi.A time-series analysis and Spearman correlation between the daily number of cases and both the Baidu Search Index and Baidu Information Index were performed for seven keywords related to COVID-19 from January 8 to March 9,2020.The time series showed that the temporal distributions of the search terms“coronavirus,”“pneumonia”and“mask”in the Baidu Search Index were consistent and had 2 to 3 days'lead time to the reported cases;the correlation coefficients were higher than 0.81.The Baidu Search Index volume in 14 prefectures of Guangxi was closely related with the number of reported cases;it was not associated with the local GDP.The Baidu Information Index search terms“coronavirus”and“pneumonia”were used as frequently as 192,405.0 and 110,488.6 per million population,respectively,and they were also significantly associated with the number of reported cases(rs>0.6),but they fluctuated more than for the Baidu Search Index and had 0 to 14 days'lag time to the reported cases.The Baidu Search Index with search terms“coronavirus,”“pneumonia”and“mask”can be used for early warning and monitoring of the epidemic trend of COVID-19 in Guangxi,with 2 to 3 days'lead time.
文摘Influenza is a kind of infectious disease, which spreads quickly and widely. The outbreak of influenza has brought huge losses to society. In this paper, four major categories of flu keywords, “prevention phase”, “symptom phase”, “treatment phase”, and “commonly-used phrase” were set. Python web crawler was used to obtain relevant influenza data from the National Influenza Center’s influenza surveillance weekly report and Baidu Index. The establishment of support vector regression (SVR), least absolute shrinkage and selection operator (LASSO), convolutional neural networks (CNN) prediction models through machine learning, took into account the seasonal characteristics of the influenza, also established the time series model (ARMA). The results show that, it is feasible to predict influenza based on web search data. Machine learning shows a certain forecast effect in the prediction of influenza based on web search data. In the future, it will have certain reference value in influenza prediction. The ARMA(3,0) model predicts better results and has greater generalization. Finally, the lack of research in this paper and future research directions are given.