Adaptive cross-fusion learning for multi-modal gesture recognition

下载PDF

导出

摘要 Background Gesture recognition has attracted significant attention because of its wide range of potential applications.Although multi-modal gesture recognition has made significant progress in recent years,a popular method still is simply fusing prediction scores at the end of each branch,which often ignores complementary features among different modalities in the early stage and does not fuse the complementary features into a more discriminative feature.Methods This paper proposes an Adaptive Cross-modal Weighting(ACmW)scheme to exploit complementarity features from RGB-D data in this study.The scheme learns relations among different modalities by combining the features of different data streams.The proposed ACmW module contains two key functions:(1)fusing complementary features from multiple streams through an adaptive one-dimensional convolution;and(2)modeling the correlation of multi-stream complementary features in the time dimension.Through the effective combination of these two functional modules,the proposed ACmW can automatically analyze the relationship between the complementary features from different streams,and can fuse them in the spatial and temporal dimensions.Results Extensive experiments validate the effectiveness of the proposed method,and show that our method outperforms state-of-the-art methods on IsoGD and NVGesture.

作者 Benjia ZHOU Jun WAN Yanyan LIANG Guodong GUO

机构地区 Macao University of Science and Technology National Laboratory of Pattern Recognition Baidu Research

出处《Virtual Reality & Intelligent Hardware》 2021年第3期235-247,共13页 虚拟现实与智能硬件（中英文）

基金 the Chinese National Natural Science Foundation Projects(61961160704,61876179) the Key Project of the General Logistics Department(ASW17C001) the Science and Technology Development Fund of Macao(0010/2019/AFJ,0025/2019/AKP).

关键词 Gesture recognition Multi-modal fusion RGB-D

分类号 TP391.41 [自动化与计算机技术—计算机应用技术]

引文网络
相关文献

1Huiying ZHANG,Yu ZHANG,Xin GENG.Practical age estimation using deep label distribution learning[J].Frontiers of Computer Science,2021,15(3):75-80. 被引量：3
2GUO Fuyou,GAO Siqi,TONG Lianjun,QIU Fangdao,YAN Hengzhou.Spatio-temporal Differentiation and Driving Factors of Industrial Ecology of Restricted Development Zone from Adaptive Perspective:A Case Study of Shandong,China[J].Chinese Geographical Science,2021,31(2):329-341.
3Jinghua XU,Tiantian WANG,Qianyong CHEN,Shuyou ZHANG,Jianrong TAN.Performance design of a cryogenic air separation unit for variable working conditions using the lumped parameter model[J].Frontiers of Mechanical Engineering,2020,15(1):24-42. 被引量：1
4Huifang Li,Jianghang Huang,Jingwei Huang,Senchun Chai,Leilei Zhao,Yuanqing Xia.Deep Multimodal Learning and Fusion Based Intelligent Fault Diagnosis Approach[J].Journal of Beijing Institute of Technology,2021,30(2):172-185.
5Xiang GAO,Hainan CUI,Lingjie ZHU,Tianxin SHI,Shuhan SHEN.Multi-source data-based 3D digital preservation of large scale ancient chinese architecture:A case report[J].Virtual Reality & Intelligent Hardware,2019,1(5):525-541. 被引量：1
6Eliška Krkoška Lorencová,Zuzana V.Harmáčková,Lucie Landová,Adam Pártl,David Vačkář.Assessing impact of land use and climate change on regulating ecosystem services in the Czech Republic[J].Ecosystem Health and Sustainability,2016,2(3):1-13. 被引量：2
7Hakan Karaca.Application of Markovian models for non-ergodic and non-stationary earthquake times series for the identification of seismic patterns and future projections[J].Earthquake Science,2020,33(2):98-106.
8Robert Boenish,Daniel Willard,Jacob P.Kritzer,Kathleen Reardon.Fisheries monitoring: Perspectives from the United States[J].Aquaculture and Fisheries,2020,5(3):131-138. 被引量：2
9Bing Li,Yong Xian,Daqiao Zhang,Juan Su,Xiaoxiang Hu,Weilin Guo.Multi-Sensor Image Fusion: A Survey of the State of the Art[J].Journal of Computer and Communications,2021,9(6):73-108.
10Yuefang Wu,Xin Yao,Peili Sun,Yong Hu,Yuchuan Zhu,Yin Hu.Development of community health service-oriented computer-assisted information system for diagnosis and treatment of respiratory diseases[J].Family Medicine and Community Health,2013,1(4):1-9. 被引量：6

Virtual Reality & Intelligent Hardware

2021年第3期

浏览历史

内容加载中请稍等...

Adaptive cross-fusion learning for multi-modal gesture recognition

相关作者

相关机构

相关主题

浏览历史