摘要
Deep learning based on neural networks has made new progress in a wide variety of domain,however,it is lack of protection for sensitive information.The large amount of data used for training is easy to cause leakage of private information,thus the attacker can easily restore input through the representation of latent natural language.The privacy preserving deep learning aims to solve the above problems.In this paper,first,we introduce how to reduce training samples in order to reduce the amount of sensitive information,and then describe how to unbiasedly represent the data with respect to specific attributes,clarify the research results of other directions of privacy protection and its corresponding algorithms,summarize the common thoughts and existing problems.Finally,the commonly used datasets in the privacy protection research are discussed in this paper.
基金
supported by the NSFC[Grant Nos.61772281,61703212,61602254]
Jiangsu Province Natural Science Foundation[Grant No.BK2160968]
the Priority Academic Program Development of Jiangsu Higher Education Institutions(PAPD)and Jiangsu Collaborative Innovation Center on Atmospheric Environment and Equipment Technology(CICAEET).