The increasing penetration rate of electric kickboard vehicles has been popularized and promoted primarily because of its clean and efficient features.Electric kickboards are gradually growing in popularity in tourist...The increasing penetration rate of electric kickboard vehicles has been popularized and promoted primarily because of its clean and efficient features.Electric kickboards are gradually growing in popularity in tourist and education-centric localities.In the upcoming arrival of electric kickboard vehicles,deploying a customer rental service is essential.Due to its freefloating nature,the shared electric kickboard is a common and practical means of transportation.Relocation plans for shared electric kickboards are required to increase the quality of service,and forecasting demand for their use in a specific region is crucial.Predicting demand accurately with small data is troublesome.Extensive data is necessary for training machine learning algorithms for effective prediction.Data generation is a method for expanding the amount of data that will be further accessible for training.In this work,we proposed a model that takes time-series customers’electric kickboard demand data as input,pre-processes it,and generates synthetic data according to the original data distribution using generative adversarial networks(GAN).The electric kickboard mobility demand prediction error was reduced when we combined synthetic data with the original data.We proposed Tabular-GAN-Modified-WGAN-GP for generating synthetic data for better prediction results.We modified The Wasserstein GAN-gradient penalty(GP)with the RMSprop optimizer and then employed Spectral Normalization(SN)to improve training stability and faster convergence.Finally,we applied a regression-based blending ensemble technique that can help us to improve performance of demand prediction.We used various evaluation criteria and visual representations to compare our proposed model’s performance.Synthetic data generated by our suggested GAN model is also evaluated.The TGAN-Modified-WGAN-GP model mitigates the overfitting and mode collapse problem,and it also converges faster than previous GAN models for synthetic data creation.The presented model’s performance is compared to existing ensemble and baseline models.The experimental findings imply that combining synthetic and actual data can significantly reduce prediction error rates in the mean absolute percentage error(MAPE)of 4.476 and increase prediction accuracy.展开更多
Sarcasm detection in text data is an increasingly vital area of research due to the prevalence of sarcastic content in online communication.This study addresses challenges associated with small datasets and class imba...Sarcasm detection in text data is an increasingly vital area of research due to the prevalence of sarcastic content in online communication.This study addresses challenges associated with small datasets and class imbalances in sarcasm detection by employing comprehensive data pre-processing and Generative Adversial Network(GAN)based augmentation on diverse datasets,including iSarcasm,SemEval-18,and Ghosh.This research offers a novel pipeline for augmenting sarcasm data with Reverse Generative Adversarial Network(RGAN).The proposed RGAN method works by inverting labels between original and synthetic data during the training process.This inversion of labels provides feedback to the generator for generating high-quality data closely resembling the original distribution.Notably,the proposed RGAN model exhibits performance on par with standard GAN,showcasing its robust efficacy in augmenting text data.The exploration of various datasets highlights the nuanced impact of augmentation on model performance,with cautionary insights into maintaining a delicate balance between synthetic and original data.The methodological framework encompasses comprehensive data pre-processing and GAN-based augmentation,with a meticulous comparison against Natural Language Processing Augmentation(NLPAug)as an alternative augmentation technique.Overall,the F1-score of our proposed technique outperforms that of the synonym replacement augmentation technique using NLPAug.The increase in F1-score in experiments using RGAN ranged from 0.066%to 1.054%,and the use of standard GAN resulted in a 2.88%increase in F1-score.The proposed RGAN model outperformed the NLPAug method and demonstrated comparable performance to standard GAN,emphasizing its efficacy in text data augmentation.展开更多
随着可再生能源逐步渗透,电力系统随机性不断加强,其不确定性为调度、规划、运行带来了更大的挑战,因此需要研究针对不确定进行建模的方法。提出一种基于生成对抗网络的负荷场景随机生成方法,该方法基于深度卷积生成对抗网络架构,以JS...随着可再生能源逐步渗透,电力系统随机性不断加强,其不确定性为调度、规划、运行带来了更大的挑战,因此需要研究针对不确定进行建模的方法。提出一种基于生成对抗网络的负荷场景随机生成方法,该方法基于深度卷积生成对抗网络架构,以JS散度作为目标函数,对生成器以及判别器交替进行训练。针对生成负荷序列质量的衡量,从数据多样性以及锐度2个方面,提出TSTR(train on synthetic test on real)以及TRTS(test on real train on synthetic)2个指标,基于支撑向量回归模型进行判断,实验结果表明,随着训练的进行,生成器产生的数据质量逐渐提高,且当训练完成时可以产生满足多样性以及锐度要求的数据。展开更多
The production of true color images requires observational data in the red,green,and blue(RGB)bands.The Advanced Geostationary Radiation Imager(AGRI)onboard China’s Fengyun-4(FY-4)series of geostationary satellites o...The production of true color images requires observational data in the red,green,and blue(RGB)bands.The Advanced Geostationary Radiation Imager(AGRI)onboard China’s Fengyun-4(FY-4)series of geostationary satellites only has blue and red bands,and we therefore have to synthesize a green band to produce RGB true color images.We used random forest regression and conditional generative adversarial networks to train the green band model using Himawari-8 Advanced Himawari Imager data.The model was then used to simulate the green channel reflectance of the FY-4 AGRI.A single-scattering radiative transfer model was used to eliminate the contribution of Rayleigh scattering from the atmosphere and a logarithmic enhancement was applied to process the true color image.The conditional generative adversarial network model was better than random forest regression for the green band model in terms of statistical significance(e.g.,a higher determination coefficient,peak signal-to-noise ratio,and structural similarity index).The sharpness of the images was significantly improved after applying a correction for Rayleigh scattering,and the images were able to show natural phenomena more vividly.The AGRI true color images could be used to monitor dust storms,forest fires,typhoons,volcanic eruptions,and other natural events.展开更多
基金This work was supported by Korea Institute for Advancement of Technology(KIAT)grant funded by the Korea Government(MOTIE)(P0016977,The Establishment Project of Industry-University Fusion District).
文摘The increasing penetration rate of electric kickboard vehicles has been popularized and promoted primarily because of its clean and efficient features.Electric kickboards are gradually growing in popularity in tourist and education-centric localities.In the upcoming arrival of electric kickboard vehicles,deploying a customer rental service is essential.Due to its freefloating nature,the shared electric kickboard is a common and practical means of transportation.Relocation plans for shared electric kickboards are required to increase the quality of service,and forecasting demand for their use in a specific region is crucial.Predicting demand accurately with small data is troublesome.Extensive data is necessary for training machine learning algorithms for effective prediction.Data generation is a method for expanding the amount of data that will be further accessible for training.In this work,we proposed a model that takes time-series customers’electric kickboard demand data as input,pre-processes it,and generates synthetic data according to the original data distribution using generative adversarial networks(GAN).The electric kickboard mobility demand prediction error was reduced when we combined synthetic data with the original data.We proposed Tabular-GAN-Modified-WGAN-GP for generating synthetic data for better prediction results.We modified The Wasserstein GAN-gradient penalty(GP)with the RMSprop optimizer and then employed Spectral Normalization(SN)to improve training stability and faster convergence.Finally,we applied a regression-based blending ensemble technique that can help us to improve performance of demand prediction.We used various evaluation criteria and visual representations to compare our proposed model’s performance.Synthetic data generated by our suggested GAN model is also evaluated.The TGAN-Modified-WGAN-GP model mitigates the overfitting and mode collapse problem,and it also converges faster than previous GAN models for synthetic data creation.The presented model’s performance is compared to existing ensemble and baseline models.The experimental findings imply that combining synthetic and actual data can significantly reduce prediction error rates in the mean absolute percentage error(MAPE)of 4.476 and increase prediction accuracy.
文摘Sarcasm detection in text data is an increasingly vital area of research due to the prevalence of sarcastic content in online communication.This study addresses challenges associated with small datasets and class imbalances in sarcasm detection by employing comprehensive data pre-processing and Generative Adversial Network(GAN)based augmentation on diverse datasets,including iSarcasm,SemEval-18,and Ghosh.This research offers a novel pipeline for augmenting sarcasm data with Reverse Generative Adversarial Network(RGAN).The proposed RGAN method works by inverting labels between original and synthetic data during the training process.This inversion of labels provides feedback to the generator for generating high-quality data closely resembling the original distribution.Notably,the proposed RGAN model exhibits performance on par with standard GAN,showcasing its robust efficacy in augmenting text data.The exploration of various datasets highlights the nuanced impact of augmentation on model performance,with cautionary insights into maintaining a delicate balance between synthetic and original data.The methodological framework encompasses comprehensive data pre-processing and GAN-based augmentation,with a meticulous comparison against Natural Language Processing Augmentation(NLPAug)as an alternative augmentation technique.Overall,the F1-score of our proposed technique outperforms that of the synonym replacement augmentation technique using NLPAug.The increase in F1-score in experiments using RGAN ranged from 0.066%to 1.054%,and the use of standard GAN resulted in a 2.88%increase in F1-score.The proposed RGAN model outperformed the NLPAug method and demonstrated comparable performance to standard GAN,emphasizing its efficacy in text data augmentation.
文摘随着可再生能源逐步渗透,电力系统随机性不断加强,其不确定性为调度、规划、运行带来了更大的挑战,因此需要研究针对不确定进行建模的方法。提出一种基于生成对抗网络的负荷场景随机生成方法,该方法基于深度卷积生成对抗网络架构,以JS散度作为目标函数,对生成器以及判别器交替进行训练。针对生成负荷序列质量的衡量,从数据多样性以及锐度2个方面,提出TSTR(train on synthetic test on real)以及TRTS(test on real train on synthetic)2个指标,基于支撑向量回归模型进行判断,实验结果表明,随着训练的进行,生成器产生的数据质量逐渐提高,且当训练完成时可以产生满足多样性以及锐度要求的数据。
基金Supported by the National Key Research and Development Program of China(2018YFC150650)National Satellite Meteorological Center Mountain Flood Geological Disaster Prevention Meteorological Guarantee Project 2020 Construction Project(IN_JS_202004)。
文摘The production of true color images requires observational data in the red,green,and blue(RGB)bands.The Advanced Geostationary Radiation Imager(AGRI)onboard China’s Fengyun-4(FY-4)series of geostationary satellites only has blue and red bands,and we therefore have to synthesize a green band to produce RGB true color images.We used random forest regression and conditional generative adversarial networks to train the green band model using Himawari-8 Advanced Himawari Imager data.The model was then used to simulate the green channel reflectance of the FY-4 AGRI.A single-scattering radiative transfer model was used to eliminate the contribution of Rayleigh scattering from the atmosphere and a logarithmic enhancement was applied to process the true color image.The conditional generative adversarial network model was better than random forest regression for the green band model in terms of statistical significance(e.g.,a higher determination coefficient,peak signal-to-noise ratio,and structural similarity index).The sharpness of the images was significantly improved after applying a correction for Rayleigh scattering,and the images were able to show natural phenomena more vividly.The AGRI true color images could be used to monitor dust storms,forest fires,typhoons,volcanic eruptions,and other natural events.