摘要
针对小样本数据在深度学习中训练难的问题,为提高DCGAN训练效率,提出了一种改进的DCGAN算法对小样本数据进行增强。首先,使用Wasserstein距离替换原模型中的损失模型;其次,在生成网络和判别网络中加入谱归一化,以得到稳定的网络结构;最后,通过极大似然估计算法和实验估算得到样本的最佳噪声输入维度,从而提高生成样本的多样性。在MNIST、Celeb A和Cartoon这三个数据集上的实验结果表明:改进后的DCGAN所生成样本的清晰度以及识别率比改进前均得到了明显提高,其中平均识别率在这几个数据集上分别提高了8.1%、16.4%和16.7%,几种清晰度评价指标在各数据集上均有不同程度的提高。可见该方法能够有效地实现小样本数据增强。
In order to solve the training difficulty of small sample data in deep learning and increase the training efficiency of DCGAN(Deep Convolutional Generative Adversarial Network),an improved DCGAN algorithm was proposed to perform the augmentation of small sample data.In the method,Wasserstein distance was used to replace the loss model in the original model at first.Then,spectral normalization was added in the generation network,and discrimination network to acquire a stable network structure.Finally,the optimal noise input dimension of sample was obtained by the maximum likelihood estimation and experimental estimation,so that the generated samples became more diversified.Experimental result on three datasets MNIST,CelebA and Cartoon indicated that the improved DCGAN could generate samples with higher definition and recognition rate compared to that before improvement.In particular,the average recognition rate on these datasets were improved by 8.1%,16.4%and 16.7%respectively,and several definition evaluation indices on the datasets were increased with different degrees,suggesting that the method can realize the small sample data augmentation effectively.
作者
甘岚
沈鸿飞
王瑶
张跃进
GAN Lan;SHEN Hongfei;WANG Yao;ZHANG Yuejin(School of Information Engineering,East China Jiaotong University,Nanchang Jiangxi 330013,China)
出处
《计算机应用》
CSCD
北大核心
2021年第5期1305-1313,共9页
journal of Computer Applications
基金
国家自然科学基金资助项目(11862006)。
关键词
小样本
数据增强
DCGAN
Wasserstein距离
谱归一化
内在维数
small sample
data augmentation
Deep Convolutional Generative Adversarial Network(DCGAN)
Wasserstein distance
spectral normalization
intrinsic dimension