The quantitative precipitation forecast(QPF)performance by numerical weather prediction(NWP)methods depends fundamentally on the adopted physical parameterization schemes(PS).However,due to the complexity of the physi...The quantitative precipitation forecast(QPF)performance by numerical weather prediction(NWP)methods depends fundamentally on the adopted physical parameterization schemes(PS).However,due to the complexity of the physical mechanisms of precipitation processes,the uncertainties of PSs result in a lower QPF performance than their prediction of the basic meteorological variables such as air temperature,wind,geopotential height,and humidity.This study proposes a deep learning model named QPFNet,which uses basic meteorological variables in the ERA5 dataset by fitting a non-linear mapping relationship between the basic variables and precipitation.Basic variables forecasted by the highest-resolution model(HRES)of the European Centre for Medium-Range Weather Forecasts(ECMWF)were fed into QPFNet to forecast precipitation.Evaluation results show that QPFNet achieved better QPF performance than ECMWF HRES itself.The threat score for 3-h accumulated precipitation with depths of 0.1,3,10,and 20 mm increased by 19.7%,15.2%,43.2%,and 87.1%,respectively,indicating the proposed performance QPFNet improved with increasing levels of precipitation.The sensitivities of these meteorological variables for QPF in different pressure layers were analyzed based on the output of the QPFNet,and its performance limitations are also discussed.Using DL to extract features from basic meteorological variables can provide an important reference for QPF,and avoid some uncertainties of PSs.展开更多
An intrusion detection system collects and analyzes information from different areas within a computer or a network to identify possible security threats that include threats from both outside as well as inside of the...An intrusion detection system collects and analyzes information from different areas within a computer or a network to identify possible security threats that include threats from both outside as well as inside of the organization. It deals with large amount of data, which contains various ir-relevant and redundant features and results in increased processing time and low detection rate. Therefore, feature selection should be treated as an indispensable pre-processing step to improve the overall system performance significantly while mining on huge datasets. In this context, in this paper, we focus on a two-step approach of feature selection based on Random Forest. The first step selects the features with higher variable importance score and guides the initialization of search process for the second step whose outputs the final feature subset for classification and in-terpretation. The effectiveness of this algorithm is demonstrated on KDD’99 intrusion detection datasets, which are based on DARPA 98 dataset, provides labeled data for researchers working in the field of intrusion detection. The important deficiency in the KDD’99 data set is the huge number of redundant records as observed earlier. Therefore, we have derived a data set RRE-KDD by eliminating redundant record from KDD’99 train and test dataset, so the classifiers and feature selection method will not be biased towards more frequent records. This RRE-KDD consists of both KDD99Train+ and KDD99Test+ dataset for training and testing purposes, respectively. The experimental results show that the Random Forest based proposed approach can select most im-portant and relevant features useful for classification, which, in turn, reduces not only the number of input features and time but also increases the classification accuracy.展开更多
基金the financial support of the National Key Research and Development Program (Grant No. 2017YFC1502000)the National Natural Science Foundation of China (Key Program, 91937301)
文摘The quantitative precipitation forecast(QPF)performance by numerical weather prediction(NWP)methods depends fundamentally on the adopted physical parameterization schemes(PS).However,due to the complexity of the physical mechanisms of precipitation processes,the uncertainties of PSs result in a lower QPF performance than their prediction of the basic meteorological variables such as air temperature,wind,geopotential height,and humidity.This study proposes a deep learning model named QPFNet,which uses basic meteorological variables in the ERA5 dataset by fitting a non-linear mapping relationship between the basic variables and precipitation.Basic variables forecasted by the highest-resolution model(HRES)of the European Centre for Medium-Range Weather Forecasts(ECMWF)were fed into QPFNet to forecast precipitation.Evaluation results show that QPFNet achieved better QPF performance than ECMWF HRES itself.The threat score for 3-h accumulated precipitation with depths of 0.1,3,10,and 20 mm increased by 19.7%,15.2%,43.2%,and 87.1%,respectively,indicating the proposed performance QPFNet improved with increasing levels of precipitation.The sensitivities of these meteorological variables for QPF in different pressure layers were analyzed based on the output of the QPFNet,and its performance limitations are also discussed.Using DL to extract features from basic meteorological variables can provide an important reference for QPF,and avoid some uncertainties of PSs.
文摘An intrusion detection system collects and analyzes information from different areas within a computer or a network to identify possible security threats that include threats from both outside as well as inside of the organization. It deals with large amount of data, which contains various ir-relevant and redundant features and results in increased processing time and low detection rate. Therefore, feature selection should be treated as an indispensable pre-processing step to improve the overall system performance significantly while mining on huge datasets. In this context, in this paper, we focus on a two-step approach of feature selection based on Random Forest. The first step selects the features with higher variable importance score and guides the initialization of search process for the second step whose outputs the final feature subset for classification and in-terpretation. The effectiveness of this algorithm is demonstrated on KDD’99 intrusion detection datasets, which are based on DARPA 98 dataset, provides labeled data for researchers working in the field of intrusion detection. The important deficiency in the KDD’99 data set is the huge number of redundant records as observed earlier. Therefore, we have derived a data set RRE-KDD by eliminating redundant record from KDD’99 train and test dataset, so the classifiers and feature selection method will not be biased towards more frequent records. This RRE-KDD consists of both KDD99Train+ and KDD99Test+ dataset for training and testing purposes, respectively. The experimental results show that the Random Forest based proposed approach can select most im-portant and relevant features useful for classification, which, in turn, reduces not only the number of input features and time but also increases the classification accuracy.