Influenza is a kind of infectious disease, which spreads quickly and widely. The outbreak of influenza has brought huge losses to society. In this paper, four major categories of flu keywords, “prevention phase”, “...Influenza is a kind of infectious disease, which spreads quickly and widely. The outbreak of influenza has brought huge losses to society. In this paper, four major categories of flu keywords, “prevention phase”, “symptom phase”, “treatment phase”, and “commonly-used phrase” were set. Python web crawler was used to obtain relevant influenza data from the National Influenza Center’s influenza surveillance weekly report and Baidu Index. The establishment of support vector regression (SVR), least absolute shrinkage and selection operator (LASSO), convolutional neural networks (CNN) prediction models through machine learning, took into account the seasonal characteristics of the influenza, also established the time series model (ARMA). The results show that, it is feasible to predict influenza based on web search data. Machine learning shows a certain forecast effect in the prediction of influenza based on web search data. In the future, it will have certain reference value in influenza prediction. The ARMA(3,0) model predicts better results and has greater generalization. Finally, the lack of research in this paper and future research directions are given.展开更多
Web search query data are obtained to reflect social spots and serve as novel economic indicators. When faced with high-dimensional query data, selecting keywords that have plausible predictive ability and can reduce ...Web search query data are obtained to reflect social spots and serve as novel economic indicators. When faced with high-dimensional query data, selecting keywords that have plausible predictive ability and can reduce dimensionality is critical. This paper presents a new integrative method that combines Hurst Exponent (HE) and Time Difference Correlation (TDC) analysis to select keywords with powerful predictive ability. The method is called the HE-TDC screening method and requires keywords with predictive ability to satisfy two characteristics, namely, high correlation and fluctuation memorability similar to the predicting target series. An empirical study is employed to predict the volume of tourism visitors in the Jiuzhai Valley scenic area. The study shows that keywords selected using HE-TDC method produce a model with better robustness and predictive ability.展开更多
As a new knowledge mining means,Web mining provides a new solution for the utilization of Web information resources This article describes the application of Web mining technologies in search engines,and discusses how...As a new knowledge mining means,Web mining provides a new solution for the utilization of Web information resources This article describes the application of Web mining technologies in search engines,and discusses how to mine the latest technologies for search engines so as to improve their retrieving展开更多
为了更好地解决分布式遥感数据检索的问题,提出了基于REST Web Services的分布式检索实现方法。根据分布式数据检索的需求和遥感数据检索系统的特点,研究了分布式数据检索的实现技术、REST的基本概念、目标与设计原则、主要思想以及实...为了更好地解决分布式遥感数据检索的问题,提出了基于REST Web Services的分布式检索实现方法。根据分布式数据检索的需求和遥感数据检索系统的特点,研究了分布式数据检索的实现技术、REST的基本概念、目标与设计原则、主要思想以及实现方式,在此基本上设计了基于REST Web Services分布式遥感数据检索原型系统,并实现了基于REST Web Services分布式遥感数据检索原型系统。展开更多
文摘Influenza is a kind of infectious disease, which spreads quickly and widely. The outbreak of influenza has brought huge losses to society. In this paper, four major categories of flu keywords, “prevention phase”, “symptom phase”, “treatment phase”, and “commonly-used phrase” were set. Python web crawler was used to obtain relevant influenza data from the National Influenza Center’s influenza surveillance weekly report and Baidu Index. The establishment of support vector regression (SVR), least absolute shrinkage and selection operator (LASSO), convolutional neural networks (CNN) prediction models through machine learning, took into account the seasonal characteristics of the influenza, also established the time series model (ARMA). The results show that, it is feasible to predict influenza based on web search data. Machine learning shows a certain forecast effect in the prediction of influenza based on web search data. In the future, it will have certain reference value in influenza prediction. The ARMA(3,0) model predicts better results and has greater generalization. Finally, the lack of research in this paper and future research directions are given.
文摘Web search query data are obtained to reflect social spots and serve as novel economic indicators. When faced with high-dimensional query data, selecting keywords that have plausible predictive ability and can reduce dimensionality is critical. This paper presents a new integrative method that combines Hurst Exponent (HE) and Time Difference Correlation (TDC) analysis to select keywords with powerful predictive ability. The method is called the HE-TDC screening method and requires keywords with predictive ability to satisfy two characteristics, namely, high correlation and fluctuation memorability similar to the predicting target series. An empirical study is employed to predict the volume of tourism visitors in the Jiuzhai Valley scenic area. The study shows that keywords selected using HE-TDC method produce a model with better robustness and predictive ability.
文摘As a new knowledge mining means,Web mining provides a new solution for the utilization of Web information resources This article describes the application of Web mining technologies in search engines,and discusses how to mine the latest technologies for search engines so as to improve their retrieving
文摘为了更好地解决分布式遥感数据检索的问题,提出了基于REST Web Services的分布式检索实现方法。根据分布式数据检索的需求和遥感数据检索系统的特点,研究了分布式数据检索的实现技术、REST的基本概念、目标与设计原则、主要思想以及实现方式,在此基本上设计了基于REST Web Services分布式遥感数据检索原型系统,并实现了基于REST Web Services分布式遥感数据检索原型系统。