期刊文献+
共找到290篇文章
< 1 2 15 >
每页显示 20 50 100
A Lightweight Convolutional Neural Network with Hierarchical Multi-Scale Feature Fusion for Image Classification
1
作者 Adama Dembele Ronald Waweru Mwangi Ananda Omutokoh Kube 《Journal of Computer and Communications》 2024年第2期173-200,共28页
Convolutional neural networks (CNNs) are widely used in image classification tasks, but their increasing model size and computation make them challenging to implement on embedded systems with constrained hardware reso... Convolutional neural networks (CNNs) are widely used in image classification tasks, but their increasing model size and computation make them challenging to implement on embedded systems with constrained hardware resources. To address this issue, the MobileNetV1 network was developed, which employs depthwise convolution to reduce network complexity. MobileNetV1 employs a stride of 2 in several convolutional layers to decrease the spatial resolution of feature maps, thereby lowering computational costs. However, this stride setting can lead to a loss of spatial information, particularly affecting the detection and representation of smaller objects or finer details in images. To maintain the trade-off between complexity and model performance, a lightweight convolutional neural network with hierarchical multi-scale feature fusion based on the MobileNetV1 network is proposed. The network consists of two main subnetworks. The first subnetwork uses a depthwise dilated separable convolution (DDSC) layer to learn imaging features with fewer parameters, which results in a lightweight and computationally inexpensive network. Furthermore, depthwise dilated convolution in DDSC layer effectively expands the field of view of filters, allowing them to incorporate a larger context. The second subnetwork is a hierarchical multi-scale feature fusion (HMFF) module that uses parallel multi-resolution branches architecture to process the input feature map in order to extract the multi-scale feature information of the input image. Experimental results on the CIFAR-10, Malaria, and KvasirV1 datasets demonstrate that the proposed method is efficient, reducing the network parameters and computational cost by 65.02% and 39.78%, respectively, while maintaining the network performance compared to the MobileNetV1 baseline. 展开更多
关键词 MobileNet Image Classification Lightweight Convolutional Neural Network Depthwise Dilated Separable Convolution Hierarchical multi-scale feature fusion
下载PDF
Attention Guided Multi Scale Feature Fusion Network for Automatic Prostate Segmentation
2
作者 Yuchun Li Mengxing Huang +1 位作者 Yu Zhang Zhiming Bai 《Computers, Materials & Continua》 SCIE EI 2024年第2期1649-1668,共20页
The precise and automatic segmentation of prostate magnetic resonance imaging(MRI)images is vital for assisting doctors in diagnosing prostate diseases.In recent years,many advanced methods have been applied to prosta... The precise and automatic segmentation of prostate magnetic resonance imaging(MRI)images is vital for assisting doctors in diagnosing prostate diseases.In recent years,many advanced methods have been applied to prostate segmentation,but due to the variability caused by prostate diseases,automatic segmentation of the prostate presents significant challenges.In this paper,we propose an attention-guided multi-scale feature fusion network(AGMSF-Net)to segment prostate MRI images.We propose an attention mechanism for extracting multi-scale features,and introduce a 3D transformer module to enhance global feature representation by adding it during the transition phase from encoder to decoder.In the decoder stage,a feature fusion module is proposed to obtain global context information.We evaluate our model on MRI images of the prostate acquired from a local hospital.The relative volume difference(RVD)and dice similarity coefficient(DSC)between the results of automatic prostate segmentation and ground truth were 1.21%and 93.68%,respectively.To quantitatively evaluate prostate volume on MRI,which is of significant clinical significance,we propose a unique AGMSF-Net.The essential performance evaluation and validation experiments have demonstrated the effectiveness of our method in automatic prostate segmentation. 展开更多
关键词 Prostate segmentation multi-scale attention 3D Transformer feature fusion MRI
下载PDF
FusionNN:A Semantic Feature Fusion Model Based on Multimodal for Web Anomaly Detection
3
作者 Li Wang Mingshan Xia +3 位作者 Hao Hu Jianfang Li Fengyao Hou Gang Chen 《Computers, Materials & Continua》 SCIE EI 2024年第5期2991-3006,共16页
With the rapid development of the mobile communication and the Internet,the previous web anomaly detectionand identificationmodels were built relying on security experts’empirical knowledge and attack features.Althou... With the rapid development of the mobile communication and the Internet,the previous web anomaly detectionand identificationmodels were built relying on security experts’empirical knowledge and attack features.Althoughthis approach can achieve higher detection performance,it requires huge human labor and resources to maintainthe feature library.In contrast,semantic feature engineering can dynamically discover new semantic featuresand optimize feature selection by automatically analyzing the semantic information contained in the data itself,thus reducing dependence on prior knowledge.However,current semantic features still have the problem ofsemantic expression singularity,as they are extracted from a single semantic mode such as word segmentation,character segmentation,or arbitrary semantic feature extraction.This paper extracts features of web requestsfrom dual semantic granularity,and proposes a semantic feature fusion method to solve the above problems.Themethod first preprocesses web requests,and extracts word-level and character-level semantic features of URLs viaconvolutional neural network(CNN),respectively.By constructing three loss functions to reduce losses betweenfeatures,labels and categories.Experiments on the HTTP CSIC 2010,Malicious URLs and HttpParams datasetsverify the proposedmethod.Results show that compared withmachine learning,deep learningmethods and BERTmodel,the proposed method has better detection performance.And it achieved the best detection rate of 99.16%in the dataset HttpParams. 展开更多
关键词 feature fusion web anomaly detection MULTIMODAL convolutional neural network(cnn) semantic feature extraction
下载PDF
Ship recognition based on HRRP via multi-scale sparse preserving method
4
作者 YANG Xueling ZHANG Gong SONG Hu 《Journal of Systems Engineering and Electronics》 SCIE CSCD 2024年第3期599-608,共10页
In order to extract the richer feature information of ship targets from sea clutter, and address the high dimensional data problem, a method termed as multi-scale fusion kernel sparse preserving projection(MSFKSPP) ba... In order to extract the richer feature information of ship targets from sea clutter, and address the high dimensional data problem, a method termed as multi-scale fusion kernel sparse preserving projection(MSFKSPP) based on the maximum margin criterion(MMC) is proposed for recognizing the class of ship targets utilizing the high-resolution range profile(HRRP). Multi-scale fusion is introduced to capture the local and detailed information in small-scale features, and the global and contour information in large-scale features, offering help to extract the edge information from sea clutter and further improving the target recognition accuracy. The proposed method can maximally preserve the multi-scale fusion sparse of data and maximize the class separability in the reduced dimensionality by reproducing kernel Hilbert space. Experimental results on the measured radar data show that the proposed method can effectively extract the features of ship target from sea clutter, further reduce the feature dimensionality, and improve target recognition performance. 展开更多
关键词 ship target recognition high-resolution range profile(HRRP) multi-scale fusion kernel sparse preserving projection(MSFKSPP) feature extraction dimensionality reduction
下载PDF
Feature Fusion-Based Deep Learning Network to Recognize Table Tennis Actions
5
作者 Chih-Ta Yen Tz-Yun Chen +1 位作者 Un-Hung Chen Guo-Chang WangZong-Xian Chen 《Computers, Materials & Continua》 SCIE EI 2023年第1期83-99,共17页
A system for classifying four basic table tennis strokes using wearable devices and deep learning networks is proposed in this study.The wearable device consisted of a six-axis sensor,Raspberry Pi 3,and a power bank.M... A system for classifying four basic table tennis strokes using wearable devices and deep learning networks is proposed in this study.The wearable device consisted of a six-axis sensor,Raspberry Pi 3,and a power bank.Multiple kernel sizes were used in convolutional neural network(CNN)to evaluate their performance for extracting features.Moreover,a multiscale CNN with two kernel sizes was used to perform feature fusion at different scales in a concatenated manner.The CNN achieved recognition of the four table tennis strokes.Experimental data were obtained from20 research participants who wore sensors on the back of their hands while performing the four table tennis strokes in a laboratory environment.The data were collected to verify the performance of the proposed models for wearable devices.Finally,the sensor and multi-scale CNN designed in this study achieved accuracy and F1 scores of 99.58%and 99.16%,respectively,for the four strokes.The accuracy for five-fold cross validation was 99.87%.This result also shows that the multi-scale convolutional neural network has better robustness after fivefold cross validation. 展开更多
关键词 Wearable devices deep learning six-axis sensor feature fusion multi-scale convolutional neural networks action recognit
下载PDF
Grasp Detection with Hierarchical Multi-Scale Feature Fusion and Inverted Shuffle Residual
6
作者 Wenjie Geng Zhiqiang Cao +3 位作者 Peiyu Guan Fengshui Jing Min Tan Junzhi Yu 《Tsinghua Science and Technology》 SCIE EI CAS CSCD 2024年第1期244-256,共13页
Grasp detection plays a critical role for robot manipulation.Mainstream pixel-wise grasp detection networks with encoder-decoder structure receive much attention due to good accuracy and efficiency.However,they usuall... Grasp detection plays a critical role for robot manipulation.Mainstream pixel-wise grasp detection networks with encoder-decoder structure receive much attention due to good accuracy and efficiency.However,they usually transmit the high-level feature in the encoder to the decoder,and low-level features are neglected.It is noted that low-level features contain abundant detail information,and how to fully exploit low-level features remains unsolved.Meanwhile,the channel information in high-level feature is also not well mined.Inevitably,the performance of grasp detection is degraded.To solve these problems,we propose a grasp detection network with hierarchical multi-scale feature fusion and inverted shuffle residual.Both low-level and high-level features in the encoder are firstly fused by the designed skip connections with attention module,and the fused information is then propagated to corresponding layers of the decoder for in-depth feature fusion.Such a hierarchical fusion guarantees the quality of grasp prediction.Furthermore,an inverted shuffle residual module is created,where the high-level feature from encoder is split in channel and the resultant split features are processed in their respective branches.By such differentiation processing,more high-dimensional channel information is kept,which enhances the representation ability of the network.Besides,an information enhancement module is added before the encoder to reinforce input information.The proposed method attains 98.9%and 97.8%in image-wise and object-wise accuracy on the Cornell grasping dataset,respectively,and the experimental results verify the effectiveness of the method. 展开更多
关键词 grasp detection hierarchical multi-scale feature fusion skip connections with attention inverted shuffle residual
原文传递
Multi-Scale Feature Fusion Model for Bridge Appearance Defect Detection
7
作者 Rong Pang Yan Yang +3 位作者 Aiguo Huang Yan Liu Peng Zhang Guangwu Tang 《Big Data Mining and Analytics》 EI CSCD 2024年第1期1-11,共11页
Although the Faster Region-based Convolutional Neural Network(Faster R-CNN)model has obvious advantages in defect recognition,it still cannot overcome challenging problems,such as time-consuming,small targets,irregula... Although the Faster Region-based Convolutional Neural Network(Faster R-CNN)model has obvious advantages in defect recognition,it still cannot overcome challenging problems,such as time-consuming,small targets,irregular shapes,and strong noise interference in bridge defect detection.To deal with these issues,this paper proposes a novel Multi-scale Feature Fusion(MFF)model for bridge appearance disease detection.First,the Faster R-CNN model adopts Region Of Interest(ROl)pooling,which omits the edge information of the target area,resulting in some missed detections and inaccuracies in both detecting and localizing bridge defects.Therefore,this paper proposes an MFF based on regional feature Aggregation(MFF-A),which reduces the missed detection rate of bridge defect detection and improves the positioning accuracy of the target area.Second,the Faster R-CNN model is insensitive to small targets,irregular shapes,and strong noises in bridge defect detection,which results in a long training time and low recognition accuracy.Accordingly,a novel Lightweight MFF(namely MFF-L)model for bridge appearance defect detection using a lightweight network EfficientNetV2 and a feature pyramid network is proposed,which fuses multi-scale features to shorten the training speed and improve recognition accuracy.Finally,the effectiveness of the proposed method is evaluated on the bridge disease dataset and public computational fluid dynamic dataset. 展开更多
关键词 defect detection multi-scale feature fusion(MFF) Region Of Interest(ROl)alignment lightweight network
原文传递
基于AM和CNN的多级特征融合的风力发电机轴承故障诊断方法
8
作者 王进花 韩金玉 +1 位作者 曹洁 王亚丽 《太阳能学报》 EI CAS CSCD 北大核心 2024年第5期51-61,共11页
提出一种基于注意力机制的多级特征融合卷积神经网络(A2ML2F-CNN)故障诊断方法。该方法将原始电流和振动信号作为输入,首先使用基于注意力卷积神经网络(AMCNN)模块分别进行数据信号特征提取,并进行一级特征融合连接。在此基础上,再次分... 提出一种基于注意力机制的多级特征融合卷积神经网络(A2ML2F-CNN)故障诊断方法。该方法将原始电流和振动信号作为输入,首先使用基于注意力卷积神经网络(AMCNN)模块分别进行数据信号特征提取,并进行一级特征融合连接。在此基础上,再次分别采用注意力机制一维卷积神经网(AM1DCNN)和二维卷积神经网络(2DCNN)提取相关信息,并进行二级特征融合,以此来解决单传感器数据故障信息不足及互补特征难以提取的问题,最后采用全连接层和Softmax层进行分类,得到诊断结果。为验证所提方法的故障诊断效果,通过帕德伯恩数据集进行实验验证,并将其与CNN、LSTM、SVM等方法的诊断精度进行对比,相较于上述方法,该文方法的诊断准确率分别提高1.8、3.2和4.8个百分点,验证了所提方法的有效性。 展开更多
关键词 风力机 故障诊断 特征融合 注意力机制 卷积神经网络 风力发电机轴承
下载PDF
基于改进Faster R CNN的光伏组件红外热斑检测算法
9
作者 季瑞瑞 梅远 +5 位作者 杨思凡 骆丰凯 储小帅 张龙 王朵 李珂明 《激光与红外》 CAS CSCD 北大核心 2024年第4期584-592,共9页
光伏故障检测对光伏电站智能运维具有重要意义。针对光伏组件红外图像中热斑目标小、难检测的问题,研究了基于改进Faster R CNN的光伏组件红外热斑故障检测模型。将Swin Transformer作为Faster R CNN模型中的特征提取模块,捕获图像的全... 光伏故障检测对光伏电站智能运维具有重要意义。针对光伏组件红外图像中热斑目标小、难检测的问题,研究了基于改进Faster R CNN的光伏组件红外热斑故障检测模型。将Swin Transformer作为Faster R CNN模型中的特征提取模块,捕获图像的全局信息,建立特征之间的依赖关系,提高模型的建模能力;进一步利用BiFPN进行特征融合,改善了热斑故障由于目标小和特征不明显容易被模型忽略掉的问题;同时为了抑制光伏红外图像中背景和噪声的干扰,加入轻量级注意力模块CBAM,使模型更加关注重要通道和关键区域,提高对热斑故障检测精度。在自建光伏组件图像数据集上进行实验,热斑故障检测精度高达915,验证了本文模型对光伏组件热斑故障检测的有效性。 展开更多
关键词 光伏组件 红外图像 故障检测 Faster Rcnn 特征融合
下载PDF
结合视觉Transformer和CNN的道路裂缝检测方法
10
作者 代少升 刘科生 余自安 《半导体光电》 CAS 北大核心 2024年第2期252-260,共9页
提出了一种结合视觉Transformer和CNN的道路裂缝检测方法。利用CNN来捕获局部的细节信息,同时利用视觉Transformer来捕获全局特征。通过设计的Fusion特征融合模块将两者提取的特征有机地结合在一起,从而解决了单独使用CNN或视觉Transfor... 提出了一种结合视觉Transformer和CNN的道路裂缝检测方法。利用CNN来捕获局部的细节信息,同时利用视觉Transformer来捕获全局特征。通过设计的Fusion特征融合模块将两者提取的特征有机地结合在一起,从而解决了单独使用CNN或视觉Transformer方法存在的局限。最终将结果传递至交互式解码器,生成道路裂缝的检测结果。实验结果表明,无论是在公开的数据集上还是在自建的数据集上,相较于单独使用CNN或视觉Transformer的方法,所提出的方法在道路裂缝检测任务中有更好的效果。 展开更多
关键词 道路裂缝检测 视觉Transformer和cnn 动态加权交叉特征融合
下载PDF
基于改进Faster R-CNN的热轧带钢表面缺陷检测
11
作者 邓慧 曾磊 《控制工程》 CSCD 北大核心 2024年第4期752-759,共8页
热轧带钢是钢铁行业的重要产品,其表面缺陷是影响产品质量的重要因素。针对传统缺陷检测算法存在的过程繁琐、精度不足和效率低下等问题,提出一种基于改进更快速区域卷积神经网络(faster region-based convolutional neural network,Fas... 热轧带钢是钢铁行业的重要产品,其表面缺陷是影响产品质量的重要因素。针对传统缺陷检测算法存在的过程繁琐、精度不足和效率低下等问题,提出一种基于改进更快速区域卷积神经网络(faster region-based convolutional neural network,Faster R-CNN)的检测算法,实现对热轧带钢表面缺陷的高效、高精度检测。首先,采用特征相加的方法对底层细节特征和高层语义特征进行融合;然后,采用精准的感兴趣区域池化(precise region of interest pooling,Precise ROI Pooling)获取固定大小的特征向量,避免特征出现位置偏差;最后,利用均值偏移聚类算法对带钢数据集进行聚类,获得适用于热轧带钢表面缺陷检测的先验框尺寸。实验结果表明,所提算法在热轧带钢表面缺陷检测数据集上的平均精度均值达到了85.34%,检测速度为23.5帧/s,且鲁棒性良好,满足实际的工业检测需求。 展开更多
关键词 表面缺陷检测 Faster R-cnn 特征融合 Precise ROI Pooling 均值偏移
下载PDF
基于改进Faster R-CNN的变电站设备外部缺陷检测
12
作者 张铭泉 邢福德 刘冬 《智能系统学报》 CSCD 北大核心 2024年第2期290-298,共9页
针对变电站设备外部缺陷目标检测任务中目标形状多样,周围环境复杂,当前代表性算法识别准确度低,错检漏检严重的问题,对比了众多目标检测算法在变电站设备缺陷数据集上的检测结果,检测精度较高的是添加了特征融合金字塔结构的Faster R-C... 针对变电站设备外部缺陷目标检测任务中目标形状多样,周围环境复杂,当前代表性算法识别准确度低,错检漏检严重的问题,对比了众多目标检测算法在变电站设备缺陷数据集上的检测结果,检测精度较高的是添加了特征融合金字塔结构的Faster R-CNN(faster region-based convolutional network)算法,但其对小目标物体和设备渗漏油的检测精度仍有提升空间,为此设计一种基于Faster R-CNN的改进算法。改进算法通过对输入图像进行数据增强,在网络中添加SPP(spatial pyramid pooling)结构以及改进特征融合方式,对分类以及边界框回归损失函数进行改进的方式来提高缺陷的检测精度。与原Faster R-CNN算法进行对比,改进算法在变电站设备缺陷目标检测数据集的检测结果中AP(average precision)(0.5∶0.95)提高了2.7个百分点,AP(0.5)提高了4.3个百分点,对小目标物体的检测精度也提高了1.8个百分点,试验结果验证了该方法的有效性。 展开更多
关键词 变电站设备外部缺陷 深度学习 目标检测 卷积神经网络 Faster R-cnn 特征提取 特征融合金字塔结构 损失函数
下载PDF
基于改进Faster R-CNN的红外目标检测算法
13
作者 汪西晨 彭富伦 +1 位作者 李业勋 张俊举 《应用光学》 CAS 北大核心 2024年第2期346-353,共8页
为提升红外目标的检测精度,提出了一种引入频域注意力机制的Faster R-CNN红外目标检测算法。首先,针对红外图像边缘模糊和噪声问题,设计了一种并行的图像增强预处理结构;其次,在Faster R-CNN中引入频域注意力机制,设计了一种新型红外目... 为提升红外目标的检测精度,提出了一种引入频域注意力机制的Faster R-CNN红外目标检测算法。首先,针对红外图像边缘模糊和噪声问题,设计了一种并行的图像增强预处理结构;其次,在Faster R-CNN中引入频域注意力机制,设计了一种新型红外目标检测主干网络;最后,引入路径增强金字塔结构,融合多尺度特征进行预测,利用底层网络丰富的位置信息,提升检测精度。在红外飞机的数据集上进行实验,结果表明,改进后的Faster R-CNN目标检测框架比以ResNet50为主干的算法的AP提升了7.6%。此外,与目前主流算法对比,本文算法提高了红外目标的检测精度,验证了算法改进的有效性。 展开更多
关键词 红外目标检测 图像增强 Faster R-cnn 频域注意力机制 多尺度特征融合
下载PDF
融合Multi-scale CNN和Bi-LSTM的人脸表情识别研究 被引量:3
14
作者 李军 李明 《北京联合大学学报》 CAS 2021年第1期35-39,44,共6页
为了有效改善现有人脸表情识别模型中存在信息丢失严重、特征信息之间联系不密切的问题,提出一种融合多尺度卷积神经网络(Multi-scale CNN)和双向长短期记忆(Bi-LSTM)的模型。Bi-LSTM可以增强特征信息间的联系与信息的维持,在Multi-scal... 为了有效改善现有人脸表情识别模型中存在信息丢失严重、特征信息之间联系不密切的问题,提出一种融合多尺度卷积神经网络(Multi-scale CNN)和双向长短期记忆(Bi-LSTM)的模型。Bi-LSTM可以增强特征信息间的联系与信息的维持,在Multi-scale CNN中通过不同尺度的卷积核可以提取到更加丰富的特征信息,并通过加入批标准化(BN)层与特征融合处理,从而加快网络的收敛速度,有利于特征信息的重利用,再将两者提取到的特征信息进行融合,最后将改进的正则化方法应用到目标函数中,减小网络复杂度和过拟合。在JAFFE和FER-2013公开数据集上进行实验,准确率分别达到了95.455%和74.115%,由此证明所提算法的有效性和先进性。 展开更多
关键词 多尺度卷积神经网络 双向长短期记忆 特征融合 批标准化层 正则化
下载PDF
Feature Fusion Multi_XMNet Convolution Neural Network for Clothing Image Classification 被引量:2
15
作者 周洪雷 彭志飞 +1 位作者 陶然 张璐 《Journal of Donghua University(English Edition)》 CAS 2021年第6期519-526,共8页
Faced with the massive amount of online shopping clothing images,how to classify them quickly and accurately is a challenging task in image classification.In this paper,we propose a novel method,named Multi_XMNet,to s... Faced with the massive amount of online shopping clothing images,how to classify them quickly and accurately is a challenging task in image classification.In this paper,we propose a novel method,named Multi_XMNet,to solve the clothing images classification problem.The proposed method mainly consists of two convolution neural network(CNN)branches.One branch extracts multiscale features from the whole expressional image by Multi_X which is designed by improving the Xception network,while the other extracts attention mechanism features from the whole expressional image by MobileNetV3-small network.Both multiscale and attention mechanism features are aggregated before making classification.Additionally,in the training stage,global average pooling(GAP),convolutional layers,and softmax classifiers are used instead of the fully connected layer to classify the final features,which speed up model training and alleviate the problem of overfitting caused by too many parameters.Experimental comparisons are made in the public DeepFashion dataset.The experimental results show that the classification accuracy of this method is 95.38%,which is better than InceptionV3,Xception and InceptionV3_Xception by 5.58%,3.32%,and 2.22%,respectively.The proposed Multi_XMNet image classification model can help enterprises and researchers in the field of clothing e-commerce to automaticly,efficiently and accurately classify massive clothing images. 展开更多
关键词 feature extraction feature fusion multiscale feature convolution neural network(cnn) clothing image classification
下载PDF
RGB and LBP-texture deep nonlinearly fusion features for fabric retrieval 被引量:1
16
作者 沈飞 Wei Mengwan +2 位作者 Liu Jiajun Zeng Huanqiang Zhu Jianqing 《High Technology Letters》 EI CAS 2020年第2期196-203,共8页
Fabric retrieval is very challenging since problems like viewpoint variations,illumination changes,blots,and poor image qualities are usually encountered in fabric images.In this work,a novel deep feature nonlinear fu... Fabric retrieval is very challenging since problems like viewpoint variations,illumination changes,blots,and poor image qualities are usually encountered in fabric images.In this work,a novel deep feature nonlinear fusion network(DFNFN)is proposed to nonlinearly fuse features learned from RGB and texture images for improving fabric retrieval.Texture images are obtained by using local binary pattern texture(LBP-Texture)features to describe RGB fabric images.The DFNFN firstly applies two feature learning branches to deal with RGB images and the corresponding LBP-Texture images simultaneously.Each branch contains the same convolutional neural network(CNN)architecture but independently learning parameters.Then,a nonlinear fusion module(NFM)is designed to concatenate the features produced by the two branches and nonlinearly fuse the concatenated features via a convolutional layer followed with a rectified linear unit(ReLU).The NFM is flexible since it can be embedded in different depths of the DFNFN to find the best fusion position.Consequently,DFNFN can optimally fuse features learned from RGB and LBP-Texture images to boost the retrieval accuracy.Extensive experiments on the Fabric 1.0 dataset show that the proposed method is superior to many state-of-the-art methods. 展开更多
关键词 FABRIC RETRIEVAL feature fusion convolutional neural network(cnn)
下载PDF
结合CNN和Transformer的遥感图像土地覆盖分类方法
17
作者 汤泊川 帕力旦·吐尔逊 +1 位作者 柏洁馨 齐然然 《微电子学与计算机》 2024年第4期64-73,共10页
利用遥感图像进行语义分割是一种有效的土地覆盖分类方法。然而由于主流框架存在边缘分割不准确、缺乏全局信息导致错误分类等问题,阻碍了其在土地覆盖分类中的应用。针对以上问题,提出了一种用于遥感图像土地覆盖分类的卷积神经网络(Co... 利用遥感图像进行语义分割是一种有效的土地覆盖分类方法。然而由于主流框架存在边缘分割不准确、缺乏全局信息导致错误分类等问题,阻碍了其在土地覆盖分类中的应用。针对以上问题,提出了一种用于遥感图像土地覆盖分类的卷积神经网络(Convolutional Neural Networks,CNN)和Transformer混合网络CTHNet,结合了CNN的局部细节提取能力和Transformer的全局信息提取能力。同时设计了自适应融合模块,融合来自对应级别的CNN和Transformer特征,自适应融合模块的输出进入分割头得到最终的预测结果。最后,结合边界检测分支为语义分割提供边缘约束。在两个公开的土地覆盖分类数据集上的实验结果表明,该方法优于当前主流的方法,分别实现了90.53%和64.33%的平均交并比(mIoU),对遥感图像中的大目标和边界也有更好的识别效果。 展开更多
关键词 土地覆盖分类 遥感图像 特征融合 卷积神经网络 TRANSFORMER
下载PDF
基于CNN跨层融合结构的边缘检测算法
18
作者 李金迪 张陶界 +1 位作者 周迪斌 刘文浩 《计算机系统应用》 2024年第2期207-215,共9页
传统边缘检测算法难以处理复杂的图像,而现有基于深度的边缘检测模型,其检测结果往往存在边缘定位错误和信息丢失等现象.针对此类问题,提出一种基于RCF的高精度的边缘检测算法RCF-CLF.首先,引入HDC结构设计用于避免因叠加相同膨胀卷积... 传统边缘检测算法难以处理复杂的图像,而现有基于深度的边缘检测模型,其检测结果往往存在边缘定位错误和信息丢失等现象.针对此类问题,提出一种基于RCF的高精度的边缘检测算法RCF-CLF.首先,引入HDC结构设计用于避免因叠加相同膨胀卷积而引起的网格效应;其次,设计了一种特征增强结构,旨在融合多尺度信息、扩大感受野;然后,设计了跨层融合结构,将高层信息和低层信息融合,用于提取准确的边缘信息;最后,引入注意力机制CBAM,通过聚焦物体边缘区域,抑制非边缘区域,从而提高网络对边缘信息的提取能力.本文在BSDS500和BIPED数据集上评估所提出的方法,与RCF算法相比,在BIPED数据集上,主要指标ODS、OIS和AP分别达到了0.893、0.901和0.945,提高了近5个百分点,在BSDS500数据集上,主要指标也有所提升.此外,与其他同类算法相比,本文算法也具有一定的优势,可以实现更加准确的边缘定位. 展开更多
关键词 边缘检测 卷积神经网络 特征增强 跨层融合 注意力机制
下载PDF
基于改进Cascade R-CNN的安全帽检测算法
19
作者 冯佩云 钱育蓉 +3 位作者 范迎迎 魏宏杨 秦雨刚 莫王昊 《微电子学与计算机》 2024年第1期63-73,共11页
针对安全帽检测中,目标形状、尺度变化大,易出现漏检、误检等问题,提出了一种基于改进级联基于区域的卷积神经网络(Cascade R-CNN)的安全帽检测算法。首先,对ResNet50进行改进形成D-ResNet50,利用可变形卷积仅增加少量参数就可增大感受... 针对安全帽检测中,目标形状、尺度变化大,易出现漏检、误检等问题,提出了一种基于改进级联基于区域的卷积神经网络(Cascade R-CNN)的安全帽检测算法。首先,对ResNet50进行改进形成D-ResNet50,利用可变形卷积仅增加少量参数就可增大感受野的特性,对特征提取网络的C2~C5卷积层进行重塑,提高网络对目标几何变换的适应能力和特征提取能力。其次,将D-ResNet50作为主干网络引入Cascade R-CNN,形成级联目标检测器,在每个阶段对正负样本重采样,抑制误检问题。再次,对递归特征金字塔进行改进,更高效地进行多尺度特征融合,并且基于反馈信息对特征进行二次处理,增强特征表达,提高网络的分类和定位能力。最后,使用Soft-非极大值抑制(Soft-NMS)进行后处理,进一步解决漏检问题。提出的方法在Hard hat workers数据集上的AP值相比检测基线提高了3.5%,与Sparse R-CNN、TridentNet、VFnet等先进算法相比分别提升了4.7%、5.9%、2.3%等。 展开更多
关键词 安全帽检测 多尺度特征融合 反馈连接 可变形卷积 Cascade R-cnn CARAFE
下载PDF
基于OR-CNN的电动车进入电梯危险行为检测系统设计
20
作者 吕樵润 林辉 刘孝炜 《机电工程技术》 2024年第1期253-256,共4页
在楼宇中电动车不论是在电梯中还是在楼层中自燃爆炸都会给人们造成严重的危害,是一个重大的危险源。目前虽然在电梯口贴上了“电动车禁止进入”的提示标志,但效果不佳。因此,设计了基于OR-CNN的电动车进入电梯危险行为检测系统,以满足... 在楼宇中电动车不论是在电梯中还是在楼层中自燃爆炸都会给人们造成严重的危害,是一个重大的危险源。目前虽然在电梯口贴上了“电动车禁止进入”的提示标志,但效果不佳。因此,设计了基于OR-CNN的电动车进入电梯危险行为检测系统,以满足管理部门对禁止电动车进入电梯的需求。基于OR-CNN网络的电动车检测模型将RoI池化层替换为PORoI,PORoI池化单元通过先验知识将目标划分为5个部分,融合各个部分的特征信息,更好地完成在遮挡环境下的目标检测任务。此外,系统在发现违规行为时会使电梯门处于禁关状态,并发出警报提醒,对违规行为的视频段进行抽帧处理并记录存档,以便事后追责,实现智能化管理。测试结果表明,与YOLOv5相比,所设计的检测系统在遮挡情况下的电动车检测准确率明显提高,更适应电梯等狭小环境中目标遮挡的情况。 展开更多
关键词 OR-cnn PORoI 电动车 电梯 危险行为检测 特征信息融合 管理智能化
下载PDF
上一页 1 2 15 下一页 到第
使用帮助 返回顶部