摘要
The virtual-to-real paradigm,i.e.,training models on virtual data and then applying them to solve real-world problems,has attracted more and more attention from various domains by successfully alleviating the data shortage problem in machine learning.To summarize the advances in recent years,this survey comprehensively reviews the literature,from the viewport of parallel intelligence.First,an extended parallel learning framework is proposed to cover main domains including computer vision,natural language processing,robotics,and autonomous driving.Second,a multi-dimensional taxonomy is designed to organize the literature in a hierarchical structure.Third,the related virtual-toreal works are analyzed and compared according to the three principles of parallel learning known as description,prediction,and prescription,which cover the methods for constructing virtual worlds,generating labeled data,domain transferring,model training and testing,as well as optimizing the strategies to guide the task-oriented data generator for better learning performance.Key issues remained in virtual-to-real are discussed.Furthermore,the future research directions from the viewpoint of parallel learning are suggested.
基金
partially supported by the National Key Research and Development Program of China(2020YFB2104001)
the National Natural Science Foundation of China(62271485,61903363,U1811463)
Open Project of the State Key Laboratory for Management and Control of Complex Systems(20220117)。