基于Dropout深度网络的两步图像标注算法被引量：3

Two Steps Image Annotation Algorithm Based on Deep Network with Dropout

下载PDF

导出

摘要基于文本的图像检索技术强烈依赖于图像标签,深度学习可以用来实现图像标签的自动生成。多分类器融合是一种有效提升分类器精度的方法。为了提升深度学习模型的泛化性能,提出了Dropout算法。该方法的本质是在训练过程中随机地丢弃若干神经元,等价于同时训练多个子网络。由于图像标签的多样性,提出了两步标签融合算法:第一步,根据多个不同网络的输出将图像标签词汇分为基准词汇、备选词汇和无关词汇;第二步,选出备选词汇中与基准词汇强相关的词汇,基准词汇和被选出的词汇可作为图像的标签。最后,算法选取3个常用的数据集对提出的算法模型进行验证,实验结果表明,多分类器融合算法可以有效地解决图像自动标注问题。 The performance of text-based image retrieval is highly dependent on manual tagging, and the deep learning can be used to realize image keywords generated automatically. Combining the predictions of many different large neural nets is an effective way for improving the classification accuracy. Firstly, for improving the generalization performance of the deep learning model, this paper proposes the Dropout algorithm. Dropout is a technique for addressing this problem by randomly dropping units（along with their connections） from the neural network during training. So the algorithm is equivalent to train many neural networks for prediction. Next, by the reason of the diverse keywords of image, this paper proposes a two steps algorithm for image annotation. First step, the keywords are divided into three parts： base keywords, candidate keywords and irrelevant keywords depending on the output of all neural networks.Second step, the keywords are chosen in candidate set depending on their correlation with base keywords. At last,the base keywords and chosen keywords are labeled for images. Conducting extensive experiments on three popular data sets, the results demonstrate that the proposed framework can achieve favorable performance for image annotation.

作者杨阳张文生杨雪冰

机构地区中国科学院自动化研究所

出处《计算机科学与探索》 CSCD 北大核心 2015年第12期1494-1505,共12页 Journal of Frontiers of Computer Science and Technology

基金国家自然科学基金~~

关键词图像自动标注深度学习集成学习机器学习 image auto-annotation deep learning assemble learning machine learning

分类号 TP39 [自动化与计算机技术—计算机应用技术]

引文网络
相关文献

参考文献19

1卢汉清,刘静.基于图学习的自动图像标注[J].计算机学报,2008,31(9):1629-1639. 被引量：42
2Lavrenko V, Manmatha R, Jeon J. A model for learning the semantics of pictures[C]//Advances in Neural Information Processing Systems 16: Proceedings of the 17th Conference on Neural Information Processing Systems, British Columbia, Canada, Dec 11-13, 2003. Cambridge, MA, USA: MIT Press, 2004: 553-560.
3Feng S L, Manmatha R, Lavrenko V. Multiple Bernoulli relevance models for image and video annotation[C]//Procee-dings of the 2004 IEEE Computer Society Conference on Computer Vision and Pattern Recognition, Jun 27-Jul 2, 2004. Washington, DC, USA: IEEE Computer Society, 2004: 1002-1009.
4Zhang Shaoting, Huang Junzhou, Huang Yuchi, et al. Automatic image annotation using group sparsity[C]//Proceedings of the 23rd IEEE Conference on Computer Vision and Pattern Recognition, San Francisco, USA, Jun 13-18, 2010. Washington, DC, USA: IEEE Computer Society, 2010: 3312-3319.
5Verma Y, Jawahar C V. Exploring SVM for image annotation in presence of confusing labels[C]//Proceedings of the 24th British Machine Vision Conference, Bristol, UK, Sep 9-13,2013.
6Li Jia, Wang J Z. Automatic linguistic indexing of pictures by a statistical modeling approach[J], IEEE Transactionson Pattern Analysis and Machine Intelligence, 2003, 25(9): 1075-1088.
7Cameiro G, Chan A B, Moreno P J, et al. Supervised learning of semantic classes for image annotation and retrieval[J], IEEE Transactions on Pattern Analysis and Machine Intelligence, 2007, 29(3): 394-410.
8Makadia A, Pavlovic V, Kumar S. A new baseline for image annotation[C]//Proceedings of the 10th European Conference on Computer Vision, Marseille, France, Oct 12-18, 2008. Berlin, Heidelberg: Springer, 2008: 316-329.
9Guillaumin M, Mensink T, Verbeek J. TagProp: discriminative metric learning in nearest neighbor models for image auto-annotation[C]//Proceedings of the 12th International Conference on Computer Vision, Kyoto, Japan, Nov 10-12,2008. Berlin, Heidelberg: Springer, 2009: 309-316.
10Zhou Ning, Cheung W K, Qiu Guoping, et al. A hybrid probabilistic model for unified collaborative and content-based image tagging[J]. IEEE Transactions on Pattern Analysis and Machine Intelligence, 2011, 33(7): 1281-1294.

二级参考文献17

1Lavrenko V, Manmatha R, Jeon J. A model for learning the semantics of pictures//Proceedings of Advance in Neutral Information Processing, 2003
2Feng S L, Manmatha R, Lavrenko V. Multiple Bernoulli relevance models for image and video annotation//Proceedings of the IEEE Computer Society Conference on Computer Vision and Pattern Recognition. 2004, 2:1002-1009
3Zhou D, Bousquet O, Lal T N, Weston J, Seholkopf B. Ranking on data manifolds//Proeeedings of the 18th Annual Conferenee on Neural Information Proeessing System. 2003:169-176
4Zhou D, Bousquet O, Lal T N, Weston J, Scholkopf B. Learning with local and global consistency//Proceedings of the 18th Annual Conference on Neural Information Processing System. 2003:237-244
5Jeon J, Lavrenko V, Manmatha R. Automatic image annotation and retrieval using cross-media relevance models//Proceedings of the 26th Annual International ACM SIGIR. 2003:119-126
6Kang F, Jin R, Sukthankar R. Correlated label propagation with application to multi-label learning//Proceedings of the IEEE Computer Society Conference on Computer Vision and Pattern Recognition. 2006:1719-1726
7Jin Y, Khan L, Wang L. Image annotations by combining multiple evidence WordNet//Proceedings of the 13th Annual ACM International Conference on Multimedia. 2005: 706- 715
8Meila M, Shi J. A random walks view of spectral segmentation//Proceedings of the 8th International Workshop on Artificial Intelligence and Statistic. 2001
9Thomas H C, Leiserson Charles E, Rivest Ronald L, Stein Clifford. Introduction to Algorithms, Chapter 23: Minimum Spanning Trees. MIT Press and McGraw-Hill, 2001: 561- 579
10Salton G, Buckley C. Term weighting approaches in automatic text retrieval. Information Processing and Management, 1988, 24(5): 513-523

共引文献41

1吴效莹,李士勇.自动图像标注算法研究[J].科技风,2009(20):40-41.
2虎晓红,钱旭,王珂.图学习的区域图像标注方法[J].计算机应用,2009,29(9):2393-2394. 被引量：1
3谢书娟.SVM理论在图书馆馆藏图像标引方面的应用[J].甘肃科技,2010,26(1):118-119. 被引量：1
4高隽,谢昭,张骏,吴克伟.图像语义分析与理解综述[J].模式识别与人工智能,2010,23(2):191-202. 被引量：20
5李大湘,彭进业,卜起荣.基于QPSO-MIL算法的图像标注[J].计算机科学,2010,37(6):278-282. 被引量：2
6李东艳,李绍滋,柯逍.基于外部数据库的图像自动标注改善模型[J].计算机应用,2010,30(10):2610-2613. 被引量：1
7张华,梁宇生.基于实例图像自动语义标注方法的研究[J].山东农业大学学报（自然科学版）,2011,42(2):255-258. 被引量：1
8鲍泓,徐光美,冯松鹤,须德.自动图像标注技术研究进展[J].计算机科学,2011,38(7):35-40. 被引量：21
9刘峥,马军.一种基于图划分和图像搜索引擎的图像标注改善算法[J].计算机研究与发展,2011,48(7):1246-1254. 被引量：4
10邓剑勋,熊忠阳,曾代敏.基于AFSVM-MIL算法的图像标注[J].计算机应用研究,2011,28(10):3917-3919.

同被引文献20

1张宗航,孙超,郭国强.不确实海洋环境中的旁瓣结构声源测距方法[J].电声技术,2009,33(7):76-80. 被引量：2
2梁淑芬,刘银华,李立琛.基于LBP和深度学习的非限制条件下人脸识别算法[J].通信学报,2014,35(6):154-160. 被引量：52
3杨立学,陈克安,李双,周静,黄文超,侯峰.飞机舱内声品质的音色参数表达[J].西北工业大学学报,2015,33(3):444-450. 被引量：6
4吴伟,聂建云,高光来.一种基于改进的支持向量机多分类器图像标注方法[J].计算机工程与科学,2015,37(7):1338-1343. 被引量：9
5李晗,陈克安,田旭华.基于平板冲击声的声源特性表征及自动识别[J].应用声学,2016,35(4):294-301. 被引量：3
6黎健成,袁春,宋友.基于卷积神经网络的多标签图像自动标注[J].计算机科学,2016,43(7):41-45. 被引量：20
7臧淼,徐惠民,张永梅.基于距离约束稀疏/组稀疏编码的自动图像标注[J].四川大学学报（工程科学版）,2016,48(5):78-83. 被引量：4
8高耀东,侯凌燕,杨大利.基于多标签学习的卷积神经网络的图像标注方法[J].计算机应用,2017,37(1):228-232. 被引量：20
9柯逍,周铭柯,牛玉贞.融合深度特征和语义邻域的自动图像标注[J].模式识别与人工智能,2017,30(3):193-203. 被引量：11
10周铭柯,柯逍,杜明智.基于数据均衡的增进式深度自动图像标注[J].软件学报,2017,28(7):1862-1880. 被引量：7

引证文献3

1江晓林,项羽,高升.局部图结构与卷积神经网络的人脸识别[J].黑龙江科技大学学报,2019,29(6):757-762.
2曹建芳,赵爱迪,张自邦.融合阈值寻优的卷积神经网络在图像标注中的应用[J].计算机应用,2020,40(6):1587-1592. 被引量：3
3肖旭,王同,王文博,苏林,马力,任群言.基于多域特征提取和深度学习的声源被动测距[J].应用声学,2021,40(1):121-130.

二级引证文献3

1胡鹏宇.一种虚拟辅导员APP模式的创新型研究与设计[J].信息记录材料,2021,22(6):79-81. 被引量：1
2龚向阳,杨跃平,张明达,王思谨,江炯.基于深度残差LSTM的视频异常行为识别算法[J].电子设计工程,2022,30(19):164-168. 被引量：3
3王锦.面向分布式多传感器的FOA大数据融合算法研究[J].北部湾大学学报,2024,39(4):60-67.

1范晓杰,宣士斌,唐凤.基于Dropout卷积神经网络的行为识别[J].广西民族大学学报（自然科学版）,2017,23(1):76-82. 被引量：8
2姜枫,张丽红.基于随机Dropout卷积神经网络的人体行为识别方法研究[J].测试技术学报,2016,30(1):17-22. 被引量：9
3张欣,梁宗保.多分类器融合算法研究与应用[J].湘潭大学自然科学学报,2011,33(2):99-103. 被引量：5
4苑强,李纳新.数字手写体的深度信念网络识别方法[J].工业技术创新,2016,3(5):921-924.
5王瑞波,李济洪,李国臣,杨耀文.基于Dropout正则化的汉语框架语义角色识别[J].中文信息学报,2017,31(1):147-154. 被引量：16
6薛皓天,杨晶东,谈凯德.一种改进的BP神经网络在手写体识别上的应用[J].电子科技,2015,28(5):20-23. 被引量：8
7李江,冉君军,张克非.一种基于降噪自编码器的人脸表情识别方法[J].计算机应用研究,2016,33(12):3843-3846. 被引量：8
8刘汝杰,袁保宗,唐晓芳.一种新的基于聚类的多分类器融合算法[J].计算机研究与发展,2001,38(10):1236-1241. 被引量：12
9沈承恩,何军,邓扬.基于改进堆叠自动编码机的垃圾邮件分类[J].计算机应用,2016,36(1):158-162. 被引量：7
10杜昌顺,黄磊.分段卷积神经网络在文本情感分析中的应用[J].计算机工程与科学,2017,39(1):173-179. 被引量：30

计算机科学与探索

2015年第12期

浏览历史

内容加载中请稍等...

基于Dropout深度网络的两步图像标注算法被引量：3

参考文献19

二级参考文献17

共引文献41

同被引文献20

引证文献3

二级引证文献3

相关作者

相关机构

相关主题

浏览历史

基于Dropout深度网络的两步图像标注算法 被引量：3

参考文献19

二级参考文献17

共引文献41

同被引文献20

引证文献3

二级引证文献3

相关作者

相关机构

相关主题

浏览历史

基于Dropout深度网络的两步图像标注算法被引量：3