利用网络图像增强行为识别

Improvement of Action Recognition Using Web Images

下载PDF

导出

摘要鉴于商业视觉搜索引擎的日益成熟,网络数据可能是下一个扩大视觉识别的重要数据源。通过观察发现,动作名称查询到的网络图像具有歧视性的动作场景。网络图像的歧视性信息和视频的时间信息之间有相互补充的优势。在此基础上提出一种利用大量的网络图像来增强行为识别的方法。具体框架是:提取行为视频的密集轨迹特征,并与网络图像特征相结合后放入支持向量机中训练分类。该方法是一个跨域学习问题,为了有效地利用网络图像特征,引入了跨域字典学习算法来处理网络图像,以解决网络图像域和视频域之间存在的域差异问题。由于网络图像可以轻松地在网络上获取,所以该方法几乎零成本地增强行为识别。在KTH和YouTube数据集上的实验结果表明,该方法有效提高了人体行为识别的准确率。 In view of the growing maturity of commercial visual search engines,Web data may be the next important data source to expand visual recognition.It is observed that the Web images queried by the action name is discriminatory to the action scene.Clearly,there are complementary benefits between the temporal information available in videos and the discriminatory scenes portrayed in images.On the basis,we propose an algorithm which can enhance action recognition by using a large number of Web images.We extract the dense trajectory feature of behavior video and put it into support vector machine for training classification in combination with Web image feature.This algorithm is a cross-domain learning problem.In order to effectively use Web image features,we introduce a cross-domain dictionary learning algorithm to deal with Web images for solving the domain differences between Web image domain and video domain.Because the Web images can be easily obtained on the network,it can enhance action recognition with at almost zero cost.Experiment shows that the proposed algorithm can improve the accuracy of human action recognition effectively on KTH and YouTube datasets.

作者闻号 WEN Hao(School of Electronics and Information Engineering,Anhui University,Hefei 230601,China)

机构地区安徽大学电子信息工程学院

出处《计算机技术与发展》 2019年第1期31-34,共4页 Computer Technology and Development

基金安徽省自然科学基金(1508085MF120)

关键词网络学习迁移学习行为识别密集轨迹字典学习 Web learning transfer learning action recognition dense trajectory dictionary learning

分类号 TP39 [自动化与计算机技术—计算机应用技术]

引文网络
相关文献

参考文献3

1宋健明,张桦,高赞,张燕,薛彦兵,徐光平.基于多时空特征的人体动作识别算法[J].光电子．激光,2014,25(10):2009-2017. 被引量：6
2秦华标,张亚宁,蔡静静.基于复合时空特征的人体行为识别方法[J].计算机辅助设计与图形学学报,2014,26(8):1320-1325. 被引量：13
3刘雨娇,范勇,高琳,酉霞.基于时空深度特征的人体行为识别算法[J].计算机工程,2015,41(5):259-263. 被引量：10

二级参考文献56

1Laptev I,Lindeberg T.On space-time interest points [J].International Journal of Computer Vision,2005,64(2/3):107-123.
2Ahad M A R.Motion history image [M]//Motion History Images for Action Recognition and Understanding.London:Springer,2013:31-76.
3Wang H,Klaser A,Schmid C,et al.Action recognition by dense trajectories[C]//Proceedings of IEEE Conference on Computer Vision and Pattern Recognition.Washington D C:IEEE Computer Society Press,2011:3169-3176.
4Chaudhry R,Ravichandran A,Hager G,et al.Histograms of oriented optical flow and Binet-Cauchy kernels on nonlinear dynamical systems for the recognition of human actions [C]//Proceedings of IEEE Conference on Computer Vision and Pattern Recognition.Washington D C:IEEE Computer Society Press,2009:1932-1939.
5Fehr J,Burkhardt H.3D rotation invariant local binary patterns [C]//Proceedings of the 19th International Conference on Pattern Recognition.Washington D C:IEEE Computer Society Press,2008:1-4.
6Scovanner P,Ali S,Shah M.A 3-dimensional SIFT descriptor and its application to action recognition [C]//Proceedings of the 15th International Conference on Multimedia.New York:ACM Press,2007:357-360.
7Junejo I N,Aghbari Z A.Using SAX representation for human action recognition [J].Journal of Visual Communication and Image Representation,2012,23(6):853-861.
8Blei D M,Ng A Y,Jordan M I.Latent Dirichlet allocation [J].Journal of Machine Learning Research,2003,3(4/5):993-1022.
9Blei D M,McAuliffe J D.Supervised topic models [C]//Proceedings of the 21st Annual Conference on Neural Information Processing Systems.Cambridge:MIT Press,2007:121-128.
10Dollar P,Rabaud V,Cottrell G,et al.Behavior recognition via sparse spatio temporal features [C]//Proceedings of the 2nd Joint IEEE International Workshop on Visual Surveillance and Performance Evaluation of Tracking and Surveillance.Washington D C:IEEE Computer Society Press,2005:65-72.

共引文献24

1李玉鹏,刘婷婷,张良.基于深度学习的人体动作识别方法[J].计算机应用研究,2020,37(1):304-307. 被引量：6
2黄少年,施游.基于高层语义词袋的人体行为识别方法[J].电脑与电信,2015(3):37-39.
3雷庆,李绍滋,陈锻生.一种结合姿态和场景的图像中人体行为分类方法[J].小型微型计算机系统,2015,36(5):1098-1103. 被引量：4
4彭志勇,常发亮,刘洪彬,别秀德.基于HSV模型和特征点匹配的行人重识别算法[J].光电子．激光,2015,26(8):1575-1582. 被引量：15
5程海粟,李庆武,仇春春,郭晶晶.基于改进密集轨迹的人体行为识别算法[J].计算机工程,2016,42(8):199-205. 被引量：14
6张爱辉,孙克辉.PCRM的改进及其在人体行为识别中的应用[J].计算机工程与设计,2016,37(9):2515-2519. 被引量：3
7王晓华,侯登永,胡敏,任福继.复合时空特征的双模态情感识别[J].中国图象图形学报,2017,22(1):39-48. 被引量：6
8何凯霖,丁晓峰.基于低维流形的人体行为跟踪方法[J].计算机工程与设计,2017,38(5):1361-1365. 被引量：1
9张文达,许悦雷,马时平,李帅,邹洪中.基于多尺度V1-MT前馈模型的光流计算方法[J].计算机工程,2017,43(9):205-209. 被引量：2
10何俊林,赵晓亮,孙连海,甘胜江.结合MACH滤波最大池化及多类SVM的行为识别[J].计算机工程与设计,2017,38(12):3431-3435. 被引量：2

1武媛媛,李敏.KTH整合式护理干预对肾结石患者术后自我效能及生活质量的影响[J].河南医学研究,2018,27(23):4390-4391. 被引量：4
2盖赟,荆国栋.多尺度方法结合卷积神经网络的行为识别[J].计算机工程与应用,2019,55(2):100-103. 被引量：5
3罗会兰,王婵娟.行为识别中一种基于融合特征的改进VLAD编码方法[J].电子学报,2019,47(1):49-58. 被引量：11
4孙应毕.一种基于电压轨迹特征的暂态稳定识别方法[J].价值工程,2018,37(34):244-245.
5张照行.新时代国有企业行为文化建设路径研究——以中车石家庄车辆有限公司为例[J].河北企业,2019(2):120-122.
6王震,褚桂坤,王金星,黄信诚,高发瑞,丁新华.基于HOG特征的IKSVM稻瘟病孢子检测[J].农业机械学报,2018,49(S1):387-392. 被引量：6
7任浩源,景晶.基于机器视觉技术的汽车零部件疲劳试验裂纹识别方法[J].上海汽车,2019(1):47-49. 被引量：3
8管健.中美贸易争端中的焦点法律问题评析[J].武大国际法评论,2018,2(3):142-157. 被引量：14
9龙成鹏(文/图).牟定左脚舞:一个民族民间文化在城市的传播样本[J].今日民族,2018,0(12):55-59.
10赵莉,白猛猛,赵亚欣,肖锋.生成式对抗网络的图像域转换[J].西安工业大学学报,2018,38(6):645-651.

计算机技术与发展

2019年第1期

浏览历史

内容加载中请稍等...

利用网络图像增强行为识别

参考文献3

二级参考文献56

共引文献24

相关作者

相关机构

相关主题

浏览历史