期刊文献+

基于改进的Mask R-CNN的鱼类识别算法研究

Research on Fish Recognition Algorithm Based on Improved Mask R-CNN
下载PDF
导出
摘要 水下鱼类是重要的地球生物资源。针对现有的深度学习模型在水下鱼类图像识别场景中识别效果不佳的问题,提出了一种基于改进的Mask R-CNN的鱼类识别算法模型。首先,采用深度残差网络和特征金字塔结构对水下鱼类图像进行特征提取;其次,选用区域候选网络针对特征图生成感兴趣区域;然后,通过改进的Soft NMS算法对感兴趣区域进行后处理以减少对鱼类目标候选框的误检率;最后,在头部网络中添加级联结构对特征区域进行微调以提升鱼类识别精度。在Fish4knowledge数据集上的对比实验结果表明,改进的鱼类识别算法的平均精度均值为87.4%,相对于基线算法模型精度提升了3.6%。所提算法能够有效提高水下鱼类识别精度,同时减少误检率,提升泛化性能,对我国水下鱼类资源的开发利用具有重要的学术价值和经济价值。 Underwater fishes are important living resources on the earth.Aiming at the poor detection ability of existing deep learning models in underwater fish images detection environments,a fish detection algorithm model based on improved Mask R-CNN is proposed with the use of the state-of-art in deep learning.Firstly,the deep residual network and feature pyramid are used to extract the features of underwater fish images.Secondly,the region proposal network is selected to generate the region of in-terest for the feature maps.Then,the improved Soft NMS algorithm is used to the region of interest that is post-processed to reduce the false detection rate of the fish object candidate frame.Finally,a cascade structure is added to the head network to fine-tune the feature area to improve the accuracy of fish recognition.The results of comparative experiments on the Fish4knowledge dataset show that,the average mean accuracy of the algorithm is 87.4%,which is an improvement of 3.6%compared to the single algorithm mod-el accuracy.The proposed algorithm can effectively improve the recognition accuracy of underwater fish while reducing the false de-tection rate and improving the generalization performance,which has important academic and economic value for the development and utilization of underwater fish resources in my country.
作者 闫党康 YAN Dangkang(School of Information,North China University of Technology,Beijing 100144)
出处 《计算机与数字工程》 2023年第6期1238-1243,共6页 Computer & Digital Engineering
关键词 鱼类识别 Mask R-CNN Soft NMS 级联结构 迁移学习 fish detection Mask R-CNN Soft NMS cascade structure transfer learning
  • 相关文献

参考文献7

二级参考文献89

  • 1LECUN Y, BOTTOU L, BENGIO Y, et al. Gradient-based learning applied to document recognition [J]. Proceedings of the IEEE, 1998, 86(11): 2278-2324.
  • 2HINTON G E, OSINDERO S, TEH Y W. A fast learning algorithm for deep belief nets [J]. Neural Computation, 2006, 18(7): 1527-1554.
  • 3LEE H, GROSSE R, RANGANATH R, et al. Convolutional deep belief networks for scalable unsupervised learning of hierarchical representations [C]// ICML '09: Proceedings of the 26th Annual International Conference on Machine Learning. New York: ACM, 2009: 609-616.
  • 4HUANG G B, LEE H, ERIK G. Learning hierarchical representations for face verification with convolutional deep belief networks [C]// CVPR '12: Proceedings of the 2012 IEEE Conference on Computer Vision and Pattern Recognition. Washington, DC: IEEE Computer Society, 2012: 2518-2525.
  • 5KRIZHEVSKY A, SUTSKEVER I, HINTON G E. ImageNet classification with deep convolutional neural networks [C]// Proceedings of Advances in Neural Information Processing Systems. Cambridge, MA: MIT Press, 2012: 1106-1114.
  • 6GIRSHICK R, DONAHUE J, DARRELL T, et al. Rich feature hierarchies for accurate object detection and semantic segmentation [C]// Proceedings of the 2014 IEEE Conference on Computer Vision and Pattern Recognition. Washington, DC: IEEE Computer Society, 2014: 580-587.
  • 7LONG J, SHELHAMER E, DARRELL T. Fully convolutional networks for semantic segmentation [C]// Proceedings of the 2015 IEEE Conference on Computer Vision and Pattern Recognition. Washington, DC: IEEE Computer Society, 2015: 3431-3440.
  • 8SIMONYAN K, ZISSERMAN A. Very deep convolutional networks for large-scale image recognition [EB/OL]. [2015-11-04]. http://www.robots.ox.ac.uk:5000/~vgg/publications/2015/Simonyan15/simonyan15.pdf.
  • 9SZEGEDY C, LIU W, JIA Y, et al. Going deeper with convolutions [C]// Proceedings of the 2015 IEEE Conference on Computer Vision and Pattern Recognition. Washington, DC: IEEE Computer Society, 2015: 1-8.
  • 10HE K, ZHANG X, REN S, et al. Deep residual learning for image recognition [EB/OL]. [2016-01-04]. https://www.researchgate.net/publication/286512696_Deep_Residual_Learning_for_Image_Recognition.

共引文献847

相关作者

内容加载中请稍等...

相关机构

内容加载中请稍等...

相关主题

内容加载中请稍等...

浏览历史

内容加载中请稍等...
;
使用帮助 返回顶部