基于生成对抗网络的数据不确定性量化方法

Generative adversarial network based data uncertainty quantification method

下载PDF

导出

摘要针对直接使用高维、高频、含有噪声的现实世界数据进行数据处理时会导致估计器不可靠的问题,提出一种基于生成对抗网络(GAN)的数据不确定性量化方法。首先,通过GAN重构原始数据分布,构建噪声空间到原始数据空间的映射分布;其次,使用马尔可夫链蒙特卡洛(MCMC)方法抽取样本,从而得到基于原始数据分布的新样本;然后,基于指定的函数定义样本的不确定性置信区间;最后,使用置信区间对原始数据进行不确定性估计,并选择置信区间内的数据作为估计器使用的数据。实验结果表明,与使用原始数据相比,使用置信区间内的数据进行估计器训练达到性能上限所需要的样本数减少了50%;同时,对比原始训练数据,置信区间内的数据在达到相同测试精度时所需要的样本数平均降低了30%。 To solve the problem that the direct use of high-dimensional,high-frequency,noise-containing real-world data to perform data processing leads to unreliable estimators,a data uncertainty quantification method based on Generative Adversarial Network(GAN)was proposed.Firstly,the original data distribution was reconstructed by GAN to construct a mapping distribution from the noise space to the space of the original data.Secondly,the samples were extracted by Markov Chain Monte Carlo(MCMC)method to obtain new samples based on the original data distribution.Thirdly,confidence intervals for the uncertainty of the samples were defined based on the specified functions.Finally,the confidence intervals were used to estimate the uncertainty of the original data,and within the data the confidence intervals was selected as the data used by the estimator.Experimental results show that 50%fewer samples are required to train the estimator to reach the upper limit by using the data within the confidence intervals compared to the samples required by using the original data.At the same time,compared to the original data,the data within the confidence intervals requires 30%fewer samples on average to achieve the same test accuracy.

作者王昊王子成张超马韵升 WANG Hao;WANG Zicheng;ZHANG Chao;MA Yunsheng(School of Mathematical Sciences,Dalian University of Technology,Dalian Liaoning 116024,China;Shandong Chambroad Holding Group Company Limited,Binzhou Shandong 256500,China)

机构地区大连理工大学数学科学学院山东京博控股集团有限公司

出处《计算机应用》 CSCD 北大核心 2023年第4期1094-1101,共8页 journal of Computer Applications

基金国家重点研发计划项目(2020YFB1711104)。

关键词生成对抗网络不确定性量化马尔可夫链蒙特卡洛方法置信区间不确定性估计 Generative Adversarial Network(GAN) uncertainty quantification Markov Chain Monte Carlo(MCMC)method confidence interval uncertainty estimation

分类号 TP399 [自动化与计算机技术—计算机应用技术]

引文网络
相关文献

参考文献15

1翟俊海,张素芳,王聪,沈矗,刘晓萌.基于MapReduce的大数据主动学习[J].计算机应用,2018,38(10):2759-2763. 被引量：5
2Xu Han,Binyang Li,Zhuoran Wang.An Attention-Based Neural Framework for Uncertainty Identification on Social Media Texts[J].Tsinghua Science and Technology,2020,25(1):117-126. 被引量：5
3梁天锡,彭忠明,沈展鹏,徐勇,张元章.基于裕量与不确定性量化的系统可靠性评估[J].科学技术与工程,2017,17(3):121-129. 被引量：4
4熊芬芬,陈江涛,任成坤,张立,李泽贤.不确定性传播的混沌多项式方法研究进展[J].中国舰船研究,2021,16(4):19-36. 被引量：7
5王纲胜,夏军,陈军锋.模型多参数灵敏度与不确定性分析[J].地理研究,2010,29(2):263-270. 被引量：28
6李艳,郭劼,范斌.元学习的不确定性特征构建及初步分析[J].计算机应用,2022,42(2):343-348. 被引量：4
7管其杰,张挺,李德亚,周绍景,杜奕.基于多分辨率生成对抗网络的空间数据不确定性重建方法[J].计算机应用,2021,41(8):2306-2311. 被引量：1
8卜宇,任晓芳,唐学军,孙挺.不确定性估计结合主动外观模型三维特征提取的人脸识别方法[J].计算机应用,2016,36(7):1971-1975. 被引量：1
9夏彬,白宇轩,殷俊杰.基于生成对抗网络的系统日志级异常检测算法[J].计算机应用,2020,40(10):2960-2966. 被引量：10
10孙鹤立,孙玉柱,张晓云.基于生成对抗网络的事件描述生成[J].计算机应用,2021,41(5):1256-1261. 被引量：1

二级参考文献82

1南方哲,钱育蓉,行艳妮,赵京霞.基于深度学习的单图像超分辨率重建研究综述[J].计算机应用研究,2020,37(2):321-326. 被引量：24
2刘全,王瑞利,林忠.非嵌入式多项式混沌方法在拉氏计算中的应用[J].固体力学学报,2013,34(S1):224-233. 被引量：12
3胡旺,李志蜀.一种更简化而高效的粒子群优化算法[J].软件学报,2007,18(4):861-868. 被引量：331
4Gupta Hoshin V, Sorooshian Soroosh, Hogue Terri S,et al. Advances in automatic calibration of watershed Mod- els. In.. Duan Q. et al. (Eds), Calibration of Watershed Models, Water Sci. and Appl. 6, AGU, Washington, DC, 2003, 9-28.
5Schultz G A, Engman E T. (Eds.) Remote Sensing in Hydrology and Water Management. Springer Verlag Berlin Heidelberg, Germany, 2000.
6Rozos Evangelos, Efstratiadis Andreas, Nalbantis Ioannis,et al. Calibration of a semi-distributed model for conjunctive simulation of surface and groundwater flows. Hydrological Sciences Journal, 2004, 49(5) : 819-842.
7Duan Q Y. Global optimization for watershed model calibration. In: Duan Q. et al. (Eds), Calibration of Watershed Models, Water Sci. and Appl. 6, AGU, Washington, DC;, 2003: 89-102.
8Duan Q, Sorooshian S, Gupta V K. Effective and Efficient Global Optimization for Conceptual Rainfall-Runoff Models. WaterResour. Res., 1992, 28(4): 1015-1031.
9Duan Q, Sorooshian S,Gupta V K. Optimal Use of the SCE UA Global Optimization Method for Calibrating Wa- tershed Models. J. of Hydrol. , 1994, 158: 265-284.
10Duan Q, Gupta V K,Sorooshian S. A Shuffled Complex Evolution Approach for Effective and Efficient Global Optimization. J. Optim. Theo. and Its Appl., 1993, 76(3): 501-521.

共引文献74

1Guanlin Zhai,Yan Yang,Heng Wang,Shengdong Du.Multi-Attention Fusion Modeling for Sentiment Analysis of Educational Big Data[J].Big Data Mining and Analytics,2020,3(4):311-319. 被引量：4
2孔凡哲,宋晓猛,占车生,叶爱中.水文模型参数敏感性快速定量评估的RSMSobol方法[J].地理学报,2011,66(9):1270-1280. 被引量：20
3SONG XiaoMeng,ZHAN CheSheng,XIA Jun.Integration of a statistical emulator approach with the SCE-UA method for parameter optimization of a hydrological model[J].Chinese Science Bulletin,2012,57(26):3397-3403. 被引量：13
4杨红,丁骏,王春峰,陈健,刘成秀,戴桂香,赵瀛.象山港围隔生态系水质模型研究[J].海洋科学,2012,36(7):14-22. 被引量：5
5宋晓猛,占车生,夏军.集成统计仿真技术和SCE-UA方法的水文模型参数优化[J].科学通报,2012,57(26):2530-2536. 被引量：5
6陈炳峰,徐岩,于海生,刘春生,曲立才.徐深气田火山岩气藏密井网精细解剖与三维地质建模[J].大庆石油地质与开发,2013,32(1):65-70. 被引量：10
7陈芬,陈兴伟,谢剑斌.HEC-HMS模型次洪模拟的参数敏感性分析及应用[J].水资源与水工程学报,2012,23(5):119-122. 被引量：17
8倪祥龙,康建设,王广彦,白永生.黑箱模型输出不确定性的敏感性分析[J].计算机仿真,2014,31(4):22-26. 被引量：3
9张质明,王晓燕,李明涛.基于全局敏感性分析方法的WASP模型不确定性分析[J].中国环境科学,2014,34(5):1336-1346. 被引量：21
10王瑞利,江松.多物理耦合非线性偏微分方程与数值解不确定度量化数学方法[J].中国科学：数学,2015,45(6):723-738. 被引量：19

1杨璐.小儿手足口病感染防控中应用个性化护理配合健康教育措施的重要价值[J].中文科技期刊数据库（全文版）医药卫生,2022(1):0199-0202.
2刘竹.全球碳排放的近实时定量方法[J].科学通报,2023,68(7):830-840. 被引量：5
3马梦宇,胡春玲.参数边缘耦合条件下的基因调控网络建模研究[J].软件工程,2023,26(4):24-27.
4曲树青.系统护理干预在病毒性脑炎患儿治疗中的应用[J].中文科技期刊数据库（全文版）医药卫生,2021(1):0190-0191.
5邓晓辉.腹腔镜与结肠镜双镜联合在结直肠息肉切除术中的应用效果[J].中文科技期刊数据库（全文版）医药卫生,2020(11):0225-0225.
6张冠宇,曹鸿猷,杨宏印,陈渝鹏.基于改进多保真度模型的T梁桥自振频率的不确定性量化方法研究[J].武汉理工大学学报,2023,45(2):66-74.
7郝云权,赵大志,李伟斌,孔满昭,刘森云.POD-BPNN预测模型及结冰条件不确定性量化[J].南京航空航天大学学报,2023,55(2):302-310.
8刘凡,李利祥,赵岩.移动荷载作用下具有不确定参数桥梁动力响应分析[J].应用数学和力学,2023,44(3):241-247.
9江淇.重型颅脑损伤患者脑脊液乳酸与血清IL-6的相关性探究[J].中文科技期刊数据库（全文版）医药卫生,2020(10):0030-0031.
10李玉,楚武利,姬田园.叶片安装角偏差对动叶性能影响的不确定性研究[J].西安交通大学学报,2023,57(4):49-59. 被引量：1

计算机应用

2023年第4期

浏览历史

内容加载中请稍等...

基于生成对抗网络的数据不确定性量化方法

参考文献15

二级参考文献82

共引文献74

相关作者

相关机构

相关主题

浏览历史