Temporal action proposal generation aims to output the starting and ending times of each potential action for long videos and often suffers from high computation cost.To address the issue,we propose a new temporal con...Temporal action proposal generation aims to output the starting and ending times of each potential action for long videos and often suffers from high computation cost.To address the issue,we propose a new temporal convolution network called Multipath Temporal ConvNet(MTCN).In our work,one novel high performance ring parallel architecture based is further introduced into temporal action proposal generation in order to respond to the requirements of large memory occupation and a large number of videos.Remarkably,the total data transmission is reduced by adding a connection between multiple-computing load in the newly developed architecture.Compared to the traditional Parameter Server architecture,our parallel architecture has higher efficiency on temporal action detection tasks with multiple GPUs.We conduct experiments on ActivityNet-1.3 and THUMOS14,where our method outperforms-other state-of-art temporal action detection methods with high recall and high temporal precision.In addition,a time metric is further proposed here to evaluate the speed performancein the distributed training process.展开更多
基金supported by the National Key Research and Development Program of China(2016YFE0204200)the National Natural Science Foundation of China(Grant Nos,61972016,62032016)+2 种基金Bejing Natural Science Foundation(L191007)the Fundamental Research Funds for the Central Universities(YWF-21-BJ-J-313 and YWF-20-BJ-J-612)Open Research Fund of Digital Fujian Environment Monitoring Internet of Things Laboratory Foundation(202004).
文摘Temporal action proposal generation aims to output the starting and ending times of each potential action for long videos and often suffers from high computation cost.To address the issue,we propose a new temporal convolution network called Multipath Temporal ConvNet(MTCN).In our work,one novel high performance ring parallel architecture based is further introduced into temporal action proposal generation in order to respond to the requirements of large memory occupation and a large number of videos.Remarkably,the total data transmission is reduced by adding a connection between multiple-computing load in the newly developed architecture.Compared to the traditional Parameter Server architecture,our parallel architecture has higher efficiency on temporal action detection tasks with multiple GPUs.We conduct experiments on ActivityNet-1.3 and THUMOS14,where our method outperforms-other state-of-art temporal action detection methods with high recall and high temporal precision.In addition,a time metric is further proposed here to evaluate the speed performancein the distributed training process.