Nowadays,air pollution is a big environmental problem in develop-ing countries.In this problem,particulate matter 2.5(PM2.5)in the air is an air pollutant.When its concentration in the air is high in developing countr...Nowadays,air pollution is a big environmental problem in develop-ing countries.In this problem,particulate matter 2.5(PM2.5)in the air is an air pollutant.When its concentration in the air is high in developing countries like Vietnam,it will harm everyone’s health.Accurate prediction of PM2.5 concentrations can help to make the correct decision in protecting the health of the citizen.This study develops a hybrid deep learning approach named PM25-CBL model for PM2.5 concentration prediction in Ho Chi Minh City,Vietnam.Firstly,this study analyzes the effects of variables on PM2.5 concentrations in Air Quality HCMC dataset.Only variables that affect the results will be selected for PM2.5 concentration prediction.Secondly,an efficient PM25-CBL model that integrates a convolutional neural network(CNN)andBidirectionalLongShort-TermMemory(Bi-LSTM)isdeveloped.This model consists of three following modules:CNN,Bi-LSTM,and Fully connected modules.Finally,this study conducts the experiment to compare the performance of our approach and several state-of-the-art deep learning models for time series prediction such as LSTM,Bi-LSTM,the combination of CNN and LSTM(CNN-LSTM),and ARIMA.The empirical results confirm that PM25-CBL model outperforms other methods for Air Quality HCMC dataset in terms of several metrics including Mean Squared Error(MSE),Root Mean Squared Error(RMSE),Mean Absolute Error(MAE),and Mean Absolute Percentage Error(MAPE).展开更多
Text summarization aims to generate a concise version of the original text.The longer the summary text is,themore detailed it will be fromthe original text,and this depends on the intended use.Therefore,the problem of...Text summarization aims to generate a concise version of the original text.The longer the summary text is,themore detailed it will be fromthe original text,and this depends on the intended use.Therefore,the problem of generating summary texts with desired lengths is a vital task to put the research into practice.To solve this problem,in this paper,we propose a new method to integrate the desired length of the summarized text into the encoder-decoder model for the abstractive text summarization problem.This length parameter is integrated into the encoding phase at each self-attention step and the decoding process by preserving the remaining length for calculating headattention in the generation process and using it as length embeddings added to theword embeddings.We conducted experiments for the proposed model on the two data sets,Cable News Network(CNN)Daily and NEWSROOM,with different desired output lengths.The obtained results show the proposed model’s effectiveness compared with related studies.展开更多
文摘Nowadays,air pollution is a big environmental problem in develop-ing countries.In this problem,particulate matter 2.5(PM2.5)in the air is an air pollutant.When its concentration in the air is high in developing countries like Vietnam,it will harm everyone’s health.Accurate prediction of PM2.5 concentrations can help to make the correct decision in protecting the health of the citizen.This study develops a hybrid deep learning approach named PM25-CBL model for PM2.5 concentration prediction in Ho Chi Minh City,Vietnam.Firstly,this study analyzes the effects of variables on PM2.5 concentrations in Air Quality HCMC dataset.Only variables that affect the results will be selected for PM2.5 concentration prediction.Secondly,an efficient PM25-CBL model that integrates a convolutional neural network(CNN)andBidirectionalLongShort-TermMemory(Bi-LSTM)isdeveloped.This model consists of three following modules:CNN,Bi-LSTM,and Fully connected modules.Finally,this study conducts the experiment to compare the performance of our approach and several state-of-the-art deep learning models for time series prediction such as LSTM,Bi-LSTM,the combination of CNN and LSTM(CNN-LSTM),and ARIMA.The empirical results confirm that PM25-CBL model outperforms other methods for Air Quality HCMC dataset in terms of several metrics including Mean Squared Error(MSE),Root Mean Squared Error(RMSE),Mean Absolute Error(MAE),and Mean Absolute Percentage Error(MAPE).
基金funded by Vietnam National Foundation for Science and Technology Development(NAFOSTED)under Grant Number 102.05-2020.26。
文摘Text summarization aims to generate a concise version of the original text.The longer the summary text is,themore detailed it will be fromthe original text,and this depends on the intended use.Therefore,the problem of generating summary texts with desired lengths is a vital task to put the research into practice.To solve this problem,in this paper,we propose a new method to integrate the desired length of the summarized text into the encoder-decoder model for the abstractive text summarization problem.This length parameter is integrated into the encoding phase at each self-attention step and the decoding process by preserving the remaining length for calculating headattention in the generation process and using it as length embeddings added to theword embeddings.We conducted experiments for the proposed model on the two data sets,Cable News Network(CNN)Daily and NEWSROOM,with different desired output lengths.The obtained results show the proposed model’s effectiveness compared with related studies.