In recent years,gesture recognition has been widely used in the fields of intelligent driving,virtual reality,and human-computer interaction.With the development of artificial intelligence,deep learning has achieved r...In recent years,gesture recognition has been widely used in the fields of intelligent driving,virtual reality,and human-computer interaction.With the development of artificial intelligence,deep learning has achieved remarkable success in computer vision.To help researchers better understanding the development status of gesture recognition in video,this article provides a detailed survey of the latest developments in gesture recognition technology for videos based on deep learning.The reviewed methods are broadly categorized into three groups based on the type of neural networks used for recognition:two stream convolutional neural networks,3D convolutional neural networks,and Long-short Term Memory(LSTM)networks.In this review,we discuss the advantages and limitations of existing technologies,focusing on the feature extraction method of the spatiotemporal structure information in a video sequence,and consider future research directions.展开更多
基金the National Key R&D Program of China(2018YFC0807500)the National Natural Science Foundation of China(61772396,61772392,62002271,61902296)+3 种基金the Fundamental Research Funds for the Central Universities(JBF180301,XJS210310,XJS190307)Xi'an Key Laboratory of Big Data and Intelligent Vision(201805053ZD4CG37)the National Natural Science Foundation of Shaanxi Province(2020JQ-330,2020JM-195)the China Postdoctoral Science Foundation(2019M663640).
文摘In recent years,gesture recognition has been widely used in the fields of intelligent driving,virtual reality,and human-computer interaction.With the development of artificial intelligence,deep learning has achieved remarkable success in computer vision.To help researchers better understanding the development status of gesture recognition in video,this article provides a detailed survey of the latest developments in gesture recognition technology for videos based on deep learning.The reviewed methods are broadly categorized into three groups based on the type of neural networks used for recognition:two stream convolutional neural networks,3D convolutional neural networks,and Long-short Term Memory(LSTM)networks.In this review,we discuss the advantages and limitations of existing technologies,focusing on the feature extraction method of the spatiotemporal structure information in a video sequence,and consider future research directions.