期刊文献+
共找到389篇文章
< 1 2 20 >
每页显示 20 50 100
Diagnosis of Middle Ear Diseases Based on Convolutional Neural Network
1
作者 Yunyoung Nam Seong Jun Choi +1 位作者 Jihwan Shin Jinseok Lee 《Computer Systems Science & Engineering》 SCIE EI 2023年第8期1521-1532,共12页
An otoscope is traditionally used to examine the eardrum and ear canal.A diagnosis of otitis media(OM)relies on the experience of clinicians.If an examiner lacks experience,the examination may be difficult and time-co... An otoscope is traditionally used to examine the eardrum and ear canal.A diagnosis of otitis media(OM)relies on the experience of clinicians.If an examiner lacks experience,the examination may be difficult and time-consuming.This paper presents an ear disease classification method using middle ear images based on a convolutional neural network(CNN).Especially the segmentation and classification networks are used to classify an otoscopic image into six classes:normal,acute otitis media(AOM),otitis media with effusion(OME),chronic otitis media(COM),congenital cholesteatoma(CC)and traumatic perforations(TMPs).The Mask R-CNN is utilized for the segmentation network to extract the region of interest(ROI)from otoscopic images.The extracted ROIs are used as guiding features for the classification.The classification is based on transfer learning with an ensemble of two CNN classifiers:EfficientNetB0 and Inception-V3.The proposed model was trained with a 5-fold cross-validation technique.The proposed method was evaluated and achieved a classification accuracy of 97.29%. 展开更多
关键词 Otitis media convolutional neural network acute otitis media otitis media with effusion chronic otitis media congenital cholesteatoma traumatic perforation mask r-cnn
下载PDF
Hybrid Convolutional Neural Network for Plant Diseases Prediction
2
作者 S.Poornima N.Sripriya +2 位作者 Adel Fahad Alrasheedi S.S.Askar Mohamed Abouhawwash 《Intelligent Automation & Soft Computing》 SCIE 2023年第5期2393-2409,共17页
Plant diseases prediction is the essential technique to prevent the yield loss and gain high production of agricultural products.The monitoring of plant health continuously and detecting the diseases is a significant f... Plant diseases prediction is the essential technique to prevent the yield loss and gain high production of agricultural products.The monitoring of plant health continuously and detecting the diseases is a significant for sustainable agri-culture.Manual system to monitor the diseases in plant is time consuming and report a lot of errors.There is high demand for technology to detect the plant dis-eases automatically.Recently image processing approach and deep learning approach are highly invited in detection of plant diseases.The diseases like late blight,bacterial spots,spots on Septoria leaf and yellow leaf curved are widely found in plants.These are the main reasons to affects the plants life and yield.To identify the diseases earliest,our research presents the hybrid method by com-bining the region based convolutional neural network(RCNN)and region based fully convolutional networks(RFCN)for classifying the diseases.First the leaf images of plants are collected and preprocessed to remove noisy data in image.Further data normalization,augmentation and removal of background noises are done.The images are divided as testing and training,training images are fed as input to deep learning architecture.First,we identify the region of interest(RoI)by using selective search.In every region,feature of convolutional neural network(CNN)is extracted independently for further classification.The plants such as tomato,potato and bell pepper are taken for this experiment.The plant input image is analyzed and classify as healthy plant or unhealthy plant.If the image is detected as unhealthy,then type of diseases the plant is affected will be displayed.Our proposed technique achieves 98.5%of accuracy in predicting the plant diseases. 展开更多
关键词 Disease detection people detection image classification deep learning region based convolutional neural network
下载PDF
Grid Side Distributed Energy Storage Cloud Group End Region Hierarchical Time-Sharing Configuration Algorithm Based onMulti-Scale and Multi Feature Convolution Neural Network
3
作者 Wen Long Bin Zhu +3 位作者 Huaizheng Li Yan Zhu Zhiqiang Chen Gang Cheng 《Energy Engineering》 EI 2023年第5期1253-1269,共17页
There is instability in the distributed energy storage cloud group end region on the power grid side.In order to avoid large-scale fluctuating charging and discharging in the power grid environment and make the capaci... There is instability in the distributed energy storage cloud group end region on the power grid side.In order to avoid large-scale fluctuating charging and discharging in the power grid environment and make the capacitor components showa continuous and stable charging and discharging state,a hierarchical time-sharing configuration algorithm of distributed energy storage cloud group end region on the power grid side based on multi-scale and multi feature convolution neural network is proposed.Firstly,a voltage stability analysis model based onmulti-scale and multi feature convolution neural network is constructed,and the multi-scale and multi feature convolution neural network is optimized based on Self-OrganizingMaps(SOM)algorithm to analyze the voltage stability of the cloud group end region of distributed energy storage on the grid side under the framework of credibility.According to the optimal scheduling objectives and network size,the distributed robust optimal configuration control model is solved under the framework of coordinated optimal scheduling at multiple time scales;Finally,the time series characteristics of regional power grid load and distributed generation are analyzed.According to the regional hierarchical time-sharing configuration model of“cloud”,“group”and“end”layer,the grid side distributed energy storage cloud group end regional hierarchical time-sharing configuration algorithm is realized.The experimental results show that after applying this algorithm,the best grid side distributed energy storage configuration scheme can be determined,and the stability of grid side distributed energy storage cloud group end region layered timesharing configuration can be improved. 展开更多
关键词 Multiscale and multi feature convolution neural network distributed energy storage at grid side cloud group end region layered time-sharing configuration algorithm
下载PDF
Ozone Depletion Identification in Stratosphere Through Faster Region-Based Convolutional Neural Network
4
作者 Bakhtawar Aslam Ziyad Awadh Alrowaili +3 位作者 Bushra Khaliq Jaweria Manzoor Saira Raqeeb Fahad Ahmad 《Computers, Materials & Continua》 SCIE EI 2021年第8期2159-2178,共20页
The concept of classification through deep learning is to build a model that skillfully separates closely-related images dataset into different classes because of diminutive but continuous variations that took place i... The concept of classification through deep learning is to build a model that skillfully separates closely-related images dataset into different classes because of diminutive but continuous variations that took place in physical systems over time and effect substantially.This study has made ozone depletion identification through classification using Faster Region-Based Convolutional Neural Network(F-RCNN).The main advantage of F-RCNN is to accumulate the bounding boxes on images to differentiate the depleted and non-depleted regions.Furthermore,image classification’s primary goal is to accurately predict each minutely varied case’s targeted classes in the dataset based on ozone saturation.The permanent changes in climate are of serious concern.The leading causes beyond these destructive variations are ozone layer depletion,greenhouse gas release,deforestation,pollution,water resources contamination,and UV radiation.This research focuses on the prediction by identifying the ozone layer depletion because it causes many health issues,e.g.,skin cancer,damage to marine life,crops damage,and impacts on living being’s immune systems.We have tried to classify the ozone images dataset into two major classes,depleted and non-depleted regions,to extract the required persuading features through F-RCNN.Furthermore,CNN has been used for feature extraction in the existing literature,and those extricated diverse RoIs are passed on to the CNN for grouping purposes.It is difficult to manage and differentiate those RoIs after grouping that negatively affects the gathered results.The classification outcomes through F-RCNN approach are proficient and demonstrate that general accuracy lies between 91%to 93%in identifying climate variation through ozone concentration classification,whether the region in the image under consideration is depleted or non-depleted.Our proposed model presented 93%accuracy,and it outperforms the prevailing techniques. 展开更多
关键词 Deep learning image processing CLASSIFICATION climate variation ozone layer depleted region non-depleted region UV radiation faster region-based convolutional neural network
下载PDF
复杂背景下基于改进Mask R-CNN的路面裂缝检测算法
5
作者 张晓华 李小龙 +1 位作者 艾金泉 舒兆翰 《北京测绘》 2024年第3期431-436,共6页
裂缝检测对路面养护具有重要意义,深度学习在该领域取得一定成效。然而,在实际应用中,图像中的噪声纹理背景、复杂的裂缝拓扑结构和图像采集设备给裂缝检测带来了一定的挑战。为了提升在复杂场景下的路面裂缝检测精度,提出了一种改进掩... 裂缝检测对路面养护具有重要意义,深度学习在该领域取得一定成效。然而,在实际应用中,图像中的噪声纹理背景、复杂的裂缝拓扑结构和图像采集设备给裂缝检测带来了一定的挑战。为了提升在复杂场景下的路面裂缝检测精度,提出了一种改进掩码区域卷积神经网络(Mask R-CNN)模型的实例分割算法。使用ConvNeXt-T替代Mask R-CNN的ResNet50框架作为特征生成网络,在自下而上捕获长期依赖的同时保持裂缝特征多样性;设计高维特征提取模块(HFEM)获取高级语义信息,消除背景噪声;引入感受野模块(RFB),扩大感受野,增强多尺度特征信息交互能力。在多结构裂缝图像(MSCI)数据集上进行对比实验,结果表明,提出的改进方法能显著提升Mask R-CNN模型的分割精度,优于经典的Cascade Mask RCNN,最佳模型F1得分84.15%,相较原算法提高了6.29%。在DeepCrack数据集上进行泛化性实验,表现优异。 展开更多
关键词 路面裂缝检测 复杂场景 掩码区域卷积神经网络(mask r-cnn) 实例分割
下载PDF
改进Mask R-CNN的无人机影像建筑物提取
6
作者 方超 廖运茂 +2 位作者 刘飞 王坚 赵小平 《北京测绘》 2024年第1期97-101,共5页
从无人机影像中自动提取建筑物对城乡规划和管理至关重要,然而,在复杂背景干扰和建筑物外观变化很大的情况下给实例提取带来挑战。因此,提出一种改进的Mask区域卷积神经网络(R-CNN)方法用于无人机影像的建筑物自动实例提取。改进方法以R... 从无人机影像中自动提取建筑物对城乡规划和管理至关重要,然而,在复杂背景干扰和建筑物外观变化很大的情况下给实例提取带来挑战。因此,提出一种改进的Mask区域卷积神经网络(R-CNN)方法用于无人机影像的建筑物自动实例提取。改进方法以ResNet-101作为特征提取网络,在特征融合网络方面,通过添加自底向上的路径增强整个特征层次的定位能力,同时在特征融合中加入空洞空间金字塔池化模块(ASPP)来提高多尺度能力与改善模型性能。在自制建筑物数据集上的综合实验结果表明,与原始的Mask R-CNN方法相比,改进方法的mAP值提高了2.6%,能够很好地实现无人机影像建筑物实例提取。 展开更多
关键词 建筑物提取 mask r-cnn 路径融合 空洞空间金字塔池化模块
下载PDF
基于Mask R-CNN的柑橘主叶脉显微图像实例分割模型 被引量:1
7
作者 翁海勇 李效彬 +3 位作者 肖康松 丁若晗 贾良权 叶大鹏 《农业机械学报》 EI CAS CSCD 北大核心 2023年第7期252-258,271,共8页
针对目前植物解剖表型的测量与分析过程自动化低,难以应对复杂解剖表型的提取和识别的问题,以柑橘主叶脉为研究对象,提出了一种基于掩膜区域卷积神经网络(Mask region convolutional neural network,Mask R-CNN)的主叶脉显微图像实例分... 针对目前植物解剖表型的测量与分析过程自动化低,难以应对复杂解剖表型的提取和识别的问题,以柑橘主叶脉为研究对象,提出了一种基于掩膜区域卷积神经网络(Mask region convolutional neural network,Mask R-CNN)的主叶脉显微图像实例分割模型,以残差网络ResNet50和特征金字塔(Feature pyramid network,FPN)为主干特征提取网络,在掩膜(Mask)分支上添加一个新的感兴趣区域对齐层(Region of interest Align,RoI-Align),提升Mask分支的分割精度。结果表明,该网络架构能够精准地对柑橘主叶脉横切面中的髓部、木质部、韧皮部和皮层细胞进行识别分割。Mask R-CNN模型对髓部、木质部、韧皮部和皮层细胞的分割平均精确率(交并比(IoU)为0.50)分别为98.9%、89.8%、95.7%和97.2%,对4个组织区域的分割平均精确率均值(IoU为0.50)为95.4%。与未在Mask分支添加RoI-Align的Mask R-CNN相比,精度提升1.6个百分点。研究结果表明,Mask R-CNN模型对柑橘主叶脉各类组织区域具有良好的识别分割效果,可为柑橘微观表型研究提供技术支持与研究基础。 展开更多
关键词 柑橘主叶脉 显微图像 掩膜区域卷积神经网络 实例分割 微观表型
下载PDF
基于改进Faster R-CNN与U-Net算法的桥梁病害识别与量化方法
8
作者 乔朋 梁志强 +3 位作者 段长江 马晨 王思龙 狄谨 《东南大学学报(自然科学版)》 EI CAS CSCD 北大核心 2024年第3期627-638,共12页
为实现桥梁病害检测的自动化,对基于图像处理技术的混凝土桥梁表观病害的智能识别和尺寸确定方法展开研究.提出基于改进Faster R-CNN算法的病害识别方法,利用K均值聚类和遗传算法对区域候选网络锚框进行优化设计;以裂缝预测区域为基础,... 为实现桥梁病害检测的自动化,对基于图像处理技术的混凝土桥梁表观病害的智能识别和尺寸确定方法展开研究.提出基于改进Faster R-CNN算法的病害识别方法,利用K均值聚类和遗传算法对区域候选网络锚框进行优化设计;以裂缝预测区域为基础,提出ResNet34结合U-Net的裂缝形态提取方法,并结合裂缝形态学研究了裂缝像素宽度和长度的确定方法.结果表明:锚框优化设计可改进Faster R-CNN算法的表观病害识别效果,5类常见病害的预测准确率、召回率、平均精确率分别由68.40%、69.87%、74.64%提升到85.40%、83.59%、83.72%;利用病害预测框,结合改进U-Net算法的裂缝像素尺寸计算,可实现裂缝病害尺寸的自动测量;基于改进Faster R-CNN和改进U-Net的方法可实现混凝土桥梁常见病害的智能识别和尺寸量化,从而提高桥梁病害检测效率并促进桥梁技术状况评定的智能化. 展开更多
关键词 桥梁工程 表观病害识别 裂缝尺寸确定 改进Faster r-cnn 改进U-Net
下载PDF
Facial Expression Recognition Using Enhanced Convolution Neural Network with Attention Mechanism 被引量:2
9
作者 K.Prabhu S.SathishKumar +2 位作者 M.Sivachitra S.Dineshkumar P.Sathiyabama 《Computer Systems Science & Engineering》 SCIE EI 2022年第4期415-426,共12页
Facial Expression Recognition(FER)has been an interesting area of research in places where there is human-computer interaction.Human psychol-ogy,emotions and behaviors can be analyzed in FER.Classifiers used in FER hav... Facial Expression Recognition(FER)has been an interesting area of research in places where there is human-computer interaction.Human psychol-ogy,emotions and behaviors can be analyzed in FER.Classifiers used in FER have been perfect on normal faces but have been found to be constrained in occluded faces.Recently,Deep Learning Techniques(DLT)have gained popular-ity in applications of real-world problems including recognition of human emo-tions.The human face reflects emotional states and human intentions.An expression is the most natural and powerful way of communicating non-verbally.Systems which form communications between the two are termed Human Machine Interaction(HMI)systems.FER can improve HMI systems as human expressions convey useful information to an observer.This paper proposes a FER scheme called EECNN(Enhanced Convolution Neural Network with Atten-tion mechanism)to recognize seven types of human emotions with satisfying results in its experiments.Proposed EECNN achieved 89.8%accuracy in classi-fying the images. 展开更多
关键词 Facial expression recognition linear discriminant analysis animal migration optimization regions of interest enhanced convolution neural network with attention mechanism
下载PDF
人体关键点检测的Mask R-CNN网络模型改进研究 被引量:7
10
作者 宋玲 夏智敏 《计算机工程与应用》 CSCD 北大核心 2021年第1期150-160,共11页
由于在现有的人体关键点检测问题中,深度学习解决方案采用的掩膜区域卷积神经网络Mask R-CNN存在参数量大导致计算成本过高、迭代次数多导致训练时间过长等问题,提出了一种基于重组通道网络ShuffleNet改进Mask R-CNN网络模型。通过引入S... 由于在现有的人体关键点检测问题中,深度学习解决方案采用的掩膜区域卷积神经网络Mask R-CNN存在参数量大导致计算成本过高、迭代次数多导致训练时间过长等问题,提出了一种基于重组通道网络ShuffleNet改进Mask R-CNN网络模型。通过引入ShuffleNet的网络结构,使用分组逐点卷积与通道重排的操作与联合边框回归和掩膜分割的计算结果对Mask R-CNN进行轻量化改进。使用该方法改进网络模型在进行单人或多人情况下的人体关键点检测中,在保留精度的前提下,可以加快运行速度,减少检测时间。 展开更多
关键词 深度学习 卷积神经网络(CNN) 掩膜区域卷积神经网络(mask r-cnn) 重组通道网络 人体关键点检测
下载PDF
Leguminous seeds detection based on convolutional neural networks:Comparison of Faster R-CNN and YOLOv4 on a small custom dataset
11
作者 Noran S.Ouf 《Artificial Intelligence in Agriculture》 2023年第2期30-45,共16页
This paper help with leguminous seeds detection and smart farming. There are hundreds of kinds of seeds and itcan be very difficult to distinguish between them. Botanists and those who study plants, however, can ident... This paper help with leguminous seeds detection and smart farming. There are hundreds of kinds of seeds and itcan be very difficult to distinguish between them. Botanists and those who study plants, however, can identifythe type of seed at a glance. As far as we know, this is the first work to consider leguminous seeds images withdifferent backgrounds and different sizes and crowding. Machine learning is used to automatically classify andlocate 11 different seed types. We chose Leguminous seeds from 11 types to be the objects of this study. Thosetypes are of different colors, sizes, and shapes to add variety and complexity to our research. The images datasetof the leguminous seeds was manually collected, annotated, and then split randomly into three sub-datasetstrain, validation, and test (predictions), with a ratio of 80%, 10%, and 10% respectively. The images consideredthe variability between different leguminous seed types. The images were captured on five different backgrounds: white A4 paper, black pad, dark blue pad, dark green pad, and green pad. Different heights and shootingangles were considered. The crowdedness of the seeds also varied randomly between 1 and 50 seeds per image.Different combinations and arrangements between the 11 types were considered. Two different image-capturingdevices were used: a SAMSUNG smartphone camera and a Canon digital camera. A total of 828 images wereobtained, including 9801 seed objects (labels). The dataset contained images of different backgrounds, heights,angles, crowdedness, arrangements, and combinations. The TensorFlow framework was used to construct theFaster Region-based Convolutional Neural Network (R-CNN) model and CSPDarknet53 is used as the backbonefor YOLOv4 based on DenseNet designed to connect layers in convolutional neural. Using the transfer learningmethod, we optimized the seed detection models. The currently dominant object detection methods, Faster RCNN, and YOLOv4 performances were compared experimentally. The mAP (mean average precision) of the FasterR-CNN and YOLOv4 models were 84.56% and 98.52% respectively. YOLOv4 had a significant advantage in detection speed over Faster R-CNN which makes it suitable for real-time identification as well where high accuracy andlow false positives are needed. The results showed that YOLOv4 had better accuracy, and detection ability, as wellas faster detection speed beating Faster R-CNN by a large margin. The model can be effectively applied under avariety of backgrounds, image sizes, seed sizes, shooting angles, and shooting heights, as well as different levelsof seed crowding. It constitutes an effective and efficient method for detecting different leguminous seeds incomplex scenarios. This study provides a reference for further seed testing and enumeration applications. 展开更多
关键词 Machine learning Object detection Leguminous seeds Deep learning convolutional neural networks Faster r-cnn YOLOv4
原文传递
基于Faster R-CNN的密集人群检测算法 被引量:4
12
作者 邹斌 张聪 《计算机应用》 CSCD 北大核心 2023年第1期61-66,共6页
为提高拥挤场景下的人群检测准确率,提出一种基于改进Faster R-CNN的密集人群检测算法。首先,在特征提取阶段添加空间与通道注意力机制,使用加强的双向特征金字塔网络(S-BiFPN)替代原网络中的多尺度特征金字塔(FPN),使网络对重要特征进... 为提高拥挤场景下的人群检测准确率,提出一种基于改进Faster R-CNN的密集人群检测算法。首先,在特征提取阶段添加空间与通道注意力机制,使用加强的双向特征金字塔网络(S-BiFPN)替代原网络中的多尺度特征金字塔(FPN),使网络对重要特征进行自主学习并加强对图像深层特征的提取;其次,引入多实例预测(MIP)算法对实例进行预测,以避免模型对拥挤场景下的目标造成漏检;最后,对模型中的非极大值抑制(NMS)进行优化,并额外增设一个交并比(IoU)阈值,以对检测结果的干扰项进行精确抑制。在开源的密集人群检测数据集上进行测试的结果显示,相较于原Faster R-CNN算法,所提算法的平均精度(AP)提升5.6%,Jaccard指数值提升3.2%。所提算法具有较高检测精度和稳定性,可以满足密集场景人群检测的需求。 展开更多
关键词 密集人群检测 Faster r-cnn 注意力机制 多实例预测 加强的双向特征金字塔网络
下载PDF
基于改进的Mask R-CNN的染色体图像分割框架 被引量:7
13
作者 冯涛 陈斌 张跃飞 《计算机应用》 CSCD 北大核心 2020年第11期3332-3339,共8页
针对染色体图像的人工分割耗时费力且当前自动分割方法精度不佳的问题,基于改进的Mask R-CNN提出了一种染色体图像分割框架——Mask Oriented R-CNN,引入方向信息对染色体图像进行实例分割。首先,新增有向包围框回归分支,以预测紧实包... 针对染色体图像的人工分割耗时费力且当前自动分割方法精度不佳的问题,基于改进的Mask R-CNN提出了一种染色体图像分割框架——Mask Oriented R-CNN,引入方向信息对染色体图像进行实例分割。首先,新增有向包围框回归分支,以预测紧实包围框并获取方向信息;然后,提出新的交并比(IoU)度量——角度加权交并比(AwIoU),从而结合方向信息与边的关系以改进冗余包围框的判据;最后,实现有向卷积通路结构,通过拷贝掩模分支通路并依据实例的方向信息选择训练路径来减少掩模预测中的干扰。实验结果表明,相较于基准模型Mask R-CNN,Mask Oriented R-CNN在IoU阈值为0.5时的平均精度均值指标提升了10.22个百分点,IoU阈值为0.5~0.95时的平均指标提升了4.91个百分点。研究结果显示,Mask Oriented R-CNN框架相较于基准模型取得了更好的染色体图像分割结果,有助于实现染色体图像自动分割。 展开更多
关键词 卷积神经网络 实例分割 mask r-cnn 染色体图像分割 图像分割 非极大值抑制 交并比
下载PDF
基于Mask-RCNN与SFM的单目视觉长方体三维测量方法
14
作者 宋乐 侯宇鹏 +3 位作者 张俊鹏 吴桐 齐昊鸣 商恩浩 《Journal of Measurement Science and Instrumentation》 CAS CSCD 2023年第2期127-136,共10页
为解决基于运动结构恢复(Structure from motion,SFM)多视角拍摄的局限性,以实现自动化三维测量效果,本文提出了一种可用于长方体三维测量的基于Mask-区域卷积神经网络(Mask-region convolutional neural networks,Mask-RCNN)和SFM的单... 为解决基于运动结构恢复(Structure from motion,SFM)多视角拍摄的局限性,以实现自动化三维测量效果,本文提出了一种可用于长方体三维测量的基于Mask-区域卷积神经网络(Mask-region convolutional neural networks,Mask-RCNN)和SFM的单目视觉测量方法。以箱体三维测量为例,该方法包括测量点提取、转换矩阵计算和三维映射测量三个部分,仅需一次标定获取内部参数,利用深度学习技术实现了单视角自动化三维测量,避免复杂重建的同时降低了视觉测量方法的应用要求。实验结果表明,该方法在棋盘格标志物下获得测量结果的相对标准不确定度在6%以内,在箱体自带标志物下获得测量结果的相对标准不确定度在8%以内。 展开更多
关键词 深度学习 mask-区域卷积神经网络 单目视觉 运动结构恢复 三维测量
下载PDF
基于改进Mask R-CNN的轨道扣件状态检测方法 被引量:6
15
作者 许贵阳 李金洋 +1 位作者 白堂博 杨建伟 《中国铁道科学》 EI CAS CSCD 北大核心 2022年第1期44-51,共8页
为提高轨道扣件状态检测的准确率,基于K均值聚类算法改进掩膜区域卷积神经网络(Mask R-CNN)实例分割算法中的区域建议网络。进行基于改进Mask R-CNN的轨道扣件状态检测方法研究,并将该方法分别应用于普速铁路有砟轨道2个扣件数据集和高... 为提高轨道扣件状态检测的准确率,基于K均值聚类算法改进掩膜区域卷积神经网络(Mask R-CNN)实例分割算法中的区域建议网络。进行基于改进Mask R-CNN的轨道扣件状态检测方法研究,并将该方法分别应用于普速铁路有砟轨道2个扣件数据集和高速铁路无砟轨道1个扣件数据集上进行轨道扣件状态检测。结果表明:该方法能对普速铁路有砟轨道和高速铁路无砟轨道图像中的扣件状态进行准确检测,扣件的定位准确率和分类准确率平均分别达到97.05%和98.36%,均优于YOLO V3,Faster R-CNN和Mask R-CNN算法;相较于前2种算法,本方法对普速铁路有砟轨道扣件状态检测的优势更为明显。 展开更多
关键词 轨道 扣件 状态检测 掩膜区域卷积神经网络 K均值聚类算法 定位准确率 分类准确率
下载PDF
基于改进Faster R-CNN算法的行人识别系统设计与研究
16
作者 蔡劲松 李伟 《信息与电脑》 2023年第20期163-167,共5页
文章基于改进更快的区域卷积神经网络(Faster Region Convolutional Neural Networks,Faster R-CNN)模型,提出了一种行人识别系统设计。介绍了计算机视觉常用技术手段与方法、通行检测步骤,分析了主流的算法优缺点,利用深度学习方法提... 文章基于改进更快的区域卷积神经网络(Faster Region Convolutional Neural Networks,Faster R-CNN)模型,提出了一种行人识别系统设计。介绍了计算机视觉常用技术手段与方法、通行检测步骤,分析了主流的算法优缺点,利用深度学习方法提取图像特征,然后使用改进Faster R-CNN模型进行目标检测。在改进Faster R-CNN模型中,采用了自适应尺度池化和增强的感兴趣区域(Region of Interest,RoI)池化技术,可以提高模型检测精度和速度。 展开更多
关键词 行人检测 机器学习 更快的区域卷积神经网络(Faster r-cnn) 深度学习
下载PDF
基于改进的Mask R-CNN的行人细粒度检测算法 被引量:10
17
作者 朱繁 王洪元 张继 《计算机应用》 CSCD 北大核心 2019年第11期3210-3215,共6页
针对复杂场景下行人检测效果差的问题,采用基于深度学习的目标检测中领先的研究成果,提出了一种基于改进Mask R-CNN框架的行人检测算法。首先,采用K-means算法对行人数据集的目标框进行聚类得到合适的长宽比,通过增加一组长宽比(2∶5)... 针对复杂场景下行人检测效果差的问题,采用基于深度学习的目标检测中领先的研究成果,提出了一种基于改进Mask R-CNN框架的行人检测算法。首先,采用K-means算法对行人数据集的目标框进行聚类得到合适的长宽比,通过增加一组长宽比(2∶5)使12种anchors适应图像中行人的尺寸;然后,结合细粒度图像识别技术,实现行人的高定位精度;其次,采用全卷积网络(FCN)分割前景对象,并进行像素预测获得行人的局部掩码(上半身、下半身),实现对行人的细粒度检测;最后,通过学习行人的局部特征获得行人的整体掩码。为了验证改进算法的有效性,将其与当前具有代表性的目标检测方法(如更快速的区域卷积神经网络(Faster R-CNN)、YOLOv2、R-FCN)在同数据集上进行对比。实验结果表明,改进的算法提高了行人检测的速度和精度,并且降低了误检率。 展开更多
关键词 mask r-cnn 行人检测 K-MEANS算法 细粒度 全卷积网络
下载PDF
基于改进的Mask R-CNN的公路裂缝检测算法 被引量:13
18
作者 张跃飞 王敬飞 +2 位作者 陈斌 冯涛 陈志毅 《计算机应用》 CSCD 北大核心 2020年第S02期162-165,共4页
针对复杂场景下,Mask R-CNN检测公路裂缝掩码拟合质量不高的问题,提出一种基于改进的Mask RCNN的路面裂缝检测算法。首先,采用自适应带权重的损失函数,从而以权重的方式让神经网路更加注重细微裂缝的特征;然后,在Mask R-CNN的掩码支路中... 针对复杂场景下,Mask R-CNN检测公路裂缝掩码拟合质量不高的问题,提出一种基于改进的Mask RCNN的路面裂缝检测算法。首先,采用自适应带权重的损失函数,从而以权重的方式让神经网路更加注重细微裂缝的特征;然后,在Mask R-CNN的掩码支路中,添加一个新的比例预测分支来指导损失函数,让神经网路在学习过程中更加注重裂缝的细节信息,进而提升掩码预测的质量。为了验证改进算法的有效性,将其与当前具有代表性的实例分割检测方法(如Mask R-CNN、PANet)在相同数据集上进行对比。实验结果表明,改进的算法提升了掩码拟合的质量,增加了检测精度。 展开更多
关键词 公路裂缝检测 深度学习 目标检测 mask r-cnn 实例分割 语义分割
下载PDF
基于Mask R-CNN的磁瓦表面缺陷检测算法 被引量:11
19
作者 郭龙源 段厚裕 +4 位作者 周武威 童光红 吴健辉 欧先锋 李武劲 《计算机集成制造系统》 EI CSCD 北大核心 2022年第5期1393-1400,共8页
磁瓦图像具有光照不均、表面纹理复杂、对比度低等特点,针对传统的缺陷检测算法难以准确分割其中缺陷的问题,提出基于掩膜区域卷积网络(Mask R-CNN)的缺陷检测算法。该算法首先通过限制对比度的自适应直方图均衡化方法对图像进行预处理... 磁瓦图像具有光照不均、表面纹理复杂、对比度低等特点,针对传统的缺陷检测算法难以准确分割其中缺陷的问题,提出基于掩膜区域卷积网络(Mask R-CNN)的缺陷检测算法。该算法首先通过限制对比度的自适应直方图均衡化方法对图像进行预处理;然后,采用残差网络50(ResNet50)构建特征金字塔网络(FPN)获取图像信息并提取特征,再采用区域建议网络(RPN)提取缺陷区域的感兴趣区域,得到相应的锚框,并通过全卷积神经网络(FCN)对感兴趣区域内部的像素类别进行预测,以实现缺陷分割;最后通过网络的全连接层实现每个感兴趣区域所属类别和相应锚框坐标的预测。实验结果表明,该算法具有较强的泛化能力,可以对表面存在大量纹理复杂、光照不均和对比度低的磁瓦图像进行精确的缺陷分割,具有较强的鲁棒性。 展开更多
关键词 表面缺陷检测 掩膜区域卷积网络 特征金字塔网络 磁瓦
下载PDF
改进Mask R-CNN的遥感图像多目标检测与分割 被引量:13
20
作者 李森森 吴清 《计算机工程与应用》 CSCD 北大核心 2020年第14期183-190,共8页
针对高分辨率遥感图像在目标检测与分割中特征提取困难、准确率低、虚假率高等问题,提出了一种改进的Mask R-CNN卷积神经网络。该网络以ResNet50为特征提取网络,在此基础上利用自下而上和自上而下两种分层跳连融合方式来进行更好的图像... 针对高分辨率遥感图像在目标检测与分割中特征提取困难、准确率低、虚假率高等问题,提出了一种改进的Mask R-CNN卷积神经网络。该网络以ResNet50为特征提取网络,在此基础上利用自下而上和自上而下两种分层跳连融合方式来进行更好的图像特征提取。针对遥感图像不同目标间尺寸差异过大、目标易丢失的问题,设计了自适应感兴趣区域来进行感兴趣区域提取。在目标分割中,使用局部融合全连接的卷积神经网络替换原全卷积神经网络,并使用上采样操作替换反卷积操作。在NWPU VHR-10数据集上进行验证,结果表明该方法与现有常用方法相比,显著地提高了遥感图像中多目标检测与分割的准确率。 展开更多
关键词 卷积神经网络 分层跳连融合 自适应感兴趣区域提取 多目标检测分割 局部融合全连接卷积网络
下载PDF
上一页 1 2 20 下一页 到第
使用帮助 返回顶部