基于改进的Mask R-CNN的鱼类识别算法研究

Research on Fish Recognition Algorithm Based on Improved Mask R-CNN

下载PDF

导出

摘要水下鱼类是重要的地球生物资源。针对现有的深度学习模型在水下鱼类图像识别场景中识别效果不佳的问题,提出了一种基于改进的Mask R-CNN的鱼类识别算法模型。首先,采用深度残差网络和特征金字塔结构对水下鱼类图像进行特征提取;其次,选用区域候选网络针对特征图生成感兴趣区域;然后,通过改进的Soft NMS算法对感兴趣区域进行后处理以减少对鱼类目标候选框的误检率;最后,在头部网络中添加级联结构对特征区域进行微调以提升鱼类识别精度。在Fish4knowledge数据集上的对比实验结果表明,改进的鱼类识别算法的平均精度均值为87.4%,相对于基线算法模型精度提升了3.6%。所提算法能够有效提高水下鱼类识别精度,同时减少误检率,提升泛化性能,对我国水下鱼类资源的开发利用具有重要的学术价值和经济价值。 Underwater fishes are important living resources on the earth.Aiming at the poor detection ability of existing deep learning models in underwater fish images detection environments,a fish detection algorithm model based on improved Mask R-CNN is proposed with the use of the state-of-art in deep learning.Firstly,the deep residual network and feature pyramid are used to extract the features of underwater fish images.Secondly,the region proposal network is selected to generate the region of in-terest for the feature maps.Then,the improved Soft NMS algorithm is used to the region of interest that is post-processed to reduce the false detection rate of the fish object candidate frame.Finally,a cascade structure is added to the head network to fine-tune the feature area to improve the accuracy of fish recognition.The results of comparative experiments on the Fish4knowledge dataset show that,the average mean accuracy of the algorithm is 87.4%,which is an improvement of 3.6%compared to the single algorithm mod-el accuracy.The proposed algorithm can effectively improve the recognition accuracy of underwater fish while reducing the false de-tection rate and improving the generalization performance,which has important academic and economic value for the development and utilization of underwater fish resources in my country.

作者闫党康 YAN Dangkang(School of Information,North China University of Technology,Beijing 100144)

机构地区北方工业大学信息学院

出处《计算机与数字工程》 2023年第6期1238-1243,共6页 Computer & Digital Engineering

关键词鱼类识别 Mask R-CNN Soft NMS 级联结构迁移学习 fish detection Mask R-CNN Soft NMS cascade structure transfer learning

分类号 TP391 [自动化与计算机技术—计算机应用技术]

引文网络
相关文献

参考文献7

1赵文清,严海,邵绪强.改进的非极大值抑制算法的目标检测[J].中国图象图形学报,2018,23(11):1676-1685. 被引量：53
2李均鹏,祝开艳,杨澍.基于迁移学习的复杂场景海洋鱼类识别方法[J].计算机应用与软件,2019,36(9):168-174. 被引量：15
3李彦冬,郝宗波,雷航.卷积神经网络研究综述[J].计算机应用,2016,36(9):2508-2515. 被引量：546
4龚锐,丁胜,章超华,苏浩.基于深度学习的轻量级和多姿态人脸识别方法[J].计算机应用,2020,40(3):704-709. 被引量：25
5朱繁,王洪元,张继.基于改进的Mask R-CNN的行人细粒度检测算法[J].计算机应用,2019,39(11):3210-3215. 被引量：10
6徐义鎏,贺鹏,任东,王慧,董婷,邵攀.基于改进faster RCNN的木材运输车辆检测[J].计算机应用,2020,40(S01):209-214. 被引量：7
7赵永强,饶元,董世鹏,张君毅.深度学习目标检测方法综述[J].中国图象图形学报,2020,25(4):629-654. 被引量：202

二级参考文献89

1LECUN Y, BOTTOU L, BENGIO Y, et al. Gradient-based learning applied to document recognition [J]. Proceedings of the IEEE, 1998, 86(11): 2278-2324.
2HINTON G E, OSINDERO S, TEH Y W. A fast learning algorithm for deep belief nets [J]. Neural Computation, 2006, 18(7): 1527-1554.
3LEE H, GROSSE R, RANGANATH R, et al. Convolutional deep belief networks for scalable unsupervised learning of hierarchical representations [C]// ICML '09: Proceedings of the 26th Annual International Conference on Machine Learning. New York: ACM, 2009: 609-616.
4HUANG G B, LEE H, ERIK G. Learning hierarchical representations for face verification with convolutional deep belief networks [C]// CVPR '12: Proceedings of the 2012 IEEE Conference on Computer Vision and Pattern Recognition. Washington, DC: IEEE Computer Society, 2012: 2518-2525.
5KRIZHEVSKY A, SUTSKEVER I, HINTON G E. ImageNet classification with deep convolutional neural networks [C]// Proceedings of Advances in Neural Information Processing Systems. Cambridge, MA: MIT Press, 2012: 1106-1114.
6GIRSHICK R, DONAHUE J, DARRELL T, et al. Rich feature hierarchies for accurate object detection and semantic segmentation [C]// Proceedings of the 2014 IEEE Conference on Computer Vision and Pattern Recognition. Washington, DC: IEEE Computer Society, 2014: 580-587.
7LONG J, SHELHAMER E, DARRELL T. Fully convolutional networks for semantic segmentation [C]// Proceedings of the 2015 IEEE Conference on Computer Vision and Pattern Recognition. Washington, DC: IEEE Computer Society, 2015: 3431-3440.
8SIMONYAN K, ZISSERMAN A. Very deep convolutional networks for large-scale image recognition [EB/OL]. [2015-11-04]. http://www.robots.ox.ac.uk:5000/~vgg/publications/2015/Simonyan15/simonyan15.pdf.
9SZEGEDY C, LIU W, JIA Y, et al. Going deeper with convolutions [C]// Proceedings of the 2015 IEEE Conference on Computer Vision and Pattern Recognition. Washington, DC: IEEE Computer Society, 2015: 1-8.
10HE K, ZHANG X, REN S, et al. Deep residual learning for image recognition [EB/OL]. [2016-01-04]. https://www.researchgate.net/publication/286512696_Deep_Residual_Learning_for_Image_Recognition.

共引文献847

1程林,柏杨,都昌平,薛翔天,章品正,於文雪,王世杰,陈阳.基于深度学习的X光地铁危险物品检测算法[J].中国体视学与图像分析,2021,26(3):301-309. 被引量：2
2徐哲壮,黄平,陈丹,吴开田,李建坤.融合机器视觉与邻近度估计的相似工业设备识别策略研究[J].仪器仪表学报,2023,44(1):283-290. 被引量：2
3赵朗月,吴一全.基于机器视觉的表面缺陷检测方法研究进展[J].仪器仪表学报,2022,43(1):198-219. 被引量：67
4杨大为,蔡宇.基于迁移学习与改进型AlexNet的蝴蝶分类算法[J].信息与控制,2023,52(4):514-524. 被引量：1
5任杰,李钢,赵燕姣,姚琼辛,田培辰.基于改进Faster RCNN的城市道路货车检测[J].计算机系统应用,2022,31(12):316-321. 被引量：1
6黎国溥,陈升东,王亮,邹凯,袁峰.基于改进YOLOv5的车辆端目标检测[J].计算机系统应用,2022,31(12):127-134. 被引量：5
7侯帅鹏,石英,华逸伦,苏涛.基于改进SSD的行人检测模型[J].武汉理工大学学报,2019,41(7):95-102. 被引量：1
8王君,蒲磊,何新宇,曹麟阁,梁文威,梁薇薇,成斌,徐铭江.多生物特征融合的矿井人员身份识别[J].科技通报,2021,37(3):44-49. 被引量：5
9苟玉晓,江永全,杨燕,周冠禄,林凯.基于全卷积神经网络的公交专用道识别[J].计算机应用研究,2020,37(S01):406-407.
10杨颖.基于MobileNet-SSD的蝶类昆虫识别算法[J].智能计算机与应用,2021,11(4):156-158. 被引量：2

1梁媛,崔磊,文典,张宏伟,唐剑韬,谭平.水电开发前后金沙江中游鱼类资源演变研究[J].水力发电,2023,49(9):6-11. 被引量：1
2张俊科,吴敬兵,吴晓晓.基于YOLOv5的岸边集装箱桥式起重机钢丝绳损伤检测方法[J].起重运输机械,2023(16):30-36.
3余思成,彭力.基于边界敏感网络的时序行为定位研究[J].计算机与数字工程,2023,51(6):1352-1358.
4张碧函,王亚静,侯晓龙,李瑶,郭立斌,慕建东,杨超臣,肖国华.小滦河鱼类资源调查与生物多样性评价[J].河北渔业,2023(9):32-35.

计算机与数字工程

2023年第6期

浏览历史

内容加载中请稍等...

基于改进的Mask R-CNN的鱼类识别算法研究

参考文献7

二级参考文献89

共引文献847

相关作者

相关机构

相关主题

浏览历史