In the textile industry,the presence of defects on the surface of fabric is an essential factor in determining fabric quality.Therefore,identifying fabric defects forms a crucial part of the fabric production process....In the textile industry,the presence of defects on the surface of fabric is an essential factor in determining fabric quality.Therefore,identifying fabric defects forms a crucial part of the fabric production process.Traditional fabric defect detection algorithms can only detect specific materials and specific fabric defect types;in addition,their detection efficiency is low,and their detection results are relatively poor.Deep learning-based methods have many advantages in the field of fabric defect detection,however,such methods are less effective in identifying multiscale fabric defects and defects with complex shapes.Therefore,we propose an effective algorithm,namely multilayer feature extraction combined with deformable convolution(MFDC),for fabric defect detection.In MFDC,multi-layer feature extraction is used to fuse the underlying location features with high-level classification features through a horizontally connected top-down architecture to improve the detection of multi-scale fabric defects.On this basis,a deformable convolution is added to solve the problem of the algorithm’s weak detection ability of irregularly shaped fabric defects.In this approach,Roi Align and Cascade-RCNN are integrated to enhance the adaptability of the algorithm in materials with complex patterned backgrounds.The experimental results show that the MFDC algorithm can achieve good detection results for both multi-scale fabric defects and defects with complex shapes,at the expense of a small increase in detection time.展开更多
Currently,deep convolutional neural networks have made great progress in the field of semantic segmentation.Because of the fixed convolution kernel geometry,standard convolution neural networks have been limited the a...Currently,deep convolutional neural networks have made great progress in the field of semantic segmentation.Because of the fixed convolution kernel geometry,standard convolution neural networks have been limited the ability to simulate geometric transformations.Therefore,a deformable convolution is introduced to enhance the adaptability of convolutional networks to spatial transformation.Considering that the deep convolutional neural networks cannot adequately segment the local objects at the output layer due to using the pooling layers in neural network architecture.To overcome this shortcoming,the rough prediction segmentation results of the neural network output layer will be processed by fully connected conditional random fields to improve the ability of image segmentation.The proposed method can easily be trained by end-to-end using standard backpropagation algorithms.Finally,the proposed method is tested on the ISPRS dataset.The results show that the proposed method can effectively overcome the influence of the complex structure of the segmentation object and obtain state-of-the-art accuracy on the ISPRS Vaihingen 2D semantic labeling dataset.展开更多
The intensive application of deep learning in medical image processing has facilitated the advancement of automatic retinal vessel segmentation research.To overcome the limitation that traditional U-shaped vessel segm...The intensive application of deep learning in medical image processing has facilitated the advancement of automatic retinal vessel segmentation research.To overcome the limitation that traditional U-shaped vessel segmentation networks fail to extract features in fundus image sufficiently,we propose a novel network(DSeU-net)based on deformable convolution and squeeze excitation residual module.The deformable convolution is utilized to dynamically adjust the receptive field for the feature extraction of retinal vessel.And the squeeze excitation residual module is used to scale the weights of the low-level features so that the network learns the complex relationships of the different feature layers efficiently.We validate the DSeU-net on three public retinal vessel segmentation datasets including DRIVE,CHASEDB1,and STARE,and the experimental results demonstrate the satisfactory segmentation performance of the network.展开更多
Deformable medical image registration plays a vital role in medical image applications,such as placing different temporal images at the same time point or different modality images into the same coordinate system.Vari...Deformable medical image registration plays a vital role in medical image applications,such as placing different temporal images at the same time point or different modality images into the same coordinate system.Various strategies have been developed to satisfy the increasing needs of deformable medical image registration.One popular registration method is estimating the displacement field by computing the optical flow between two images.The motion field(flow field)is computed based on either gray-value or handcrafted descriptors such as the scale-invariant feature transform(SIFT).These methods assume that illumination is constant between images.However,medical images may not always satisfy this assumption.In this study,we propose a metric learning-based motion estimation method called Siamese Flow for deformable medical image registration.We train metric learners using a Siamese network,which produces an image patch descriptor that guarantees a smaller feature distance in two similar anatomical structures and a larger feature distance in two dissimilar anatomical structures.In the proposed registration framework,the flow field is computed based on such features and is close to the real deformation field due to the excellent feature representation ability of the Siamese network.Experimental results demonstrate that the proposed method outperforms the Demons,SIFT Flow,Elastix,and VoxelMorph networks regarding registration accuracy and robustness,particularly with large deformations.展开更多
Background Exploring correspondences across multiview images is the basis of various computer vision tasks.However,most existing methods have limited accuracy under challenging conditions.Method To learn more robust a...Background Exploring correspondences across multiview images is the basis of various computer vision tasks.However,most existing methods have limited accuracy under challenging conditions.Method To learn more robust and accurate correspondences,we propose DSD-MatchingNet for local feature matching in this study.First,we develop a deformable feature extraction module to obtain multilevel feature maps,which harvest contextual information from dynamic receptive fields.The dynamic receptive fields provided by the deformable convolution network ensure that our method obtains dense and robust correspondence.Second,we utilize sparse-to-dense matching with symmetry of correspondence to implement accurate pixel-level matching,which enables our method to produce more accurate correspondences.Result Experiments show that our proposed DSD-MatchingNet achieves a better performance on the image matching benchmark,as well as on the visual localization benchmark.Specifically,our method achieved 91.3%mean matching accuracy on the HPatches dataset and 99.3%visual localization recalls on the Aachen Day-Night dataset.展开更多
Extracting implicit anomaly information through deformation monitoring data mining is highly significant to determining dam safety status.As an intelligent singular value diagnostic method for concrete dam deformation...Extracting implicit anomaly information through deformation monitoring data mining is highly significant to determining dam safety status.As an intelligent singular value diagnostic method for concrete dam deformation monitoring, shallow neural network models result in local optima and overfitting, and require manual feature extraction.To obtain an intelligent singular value diagnosis model that can be used for dam safety monitoring, a convolutional neural network (CNN) model that has advantages of deep learning (DL), such as automatic feature extraction, good model fitting, and strong generalizability, was trained in this study.An engineering example shows that the predicted result of the intelligent singular value diagnostic method based on CNN is highly compatible with the confusion matrix, with a precision of 92.41%, receiver operating characteristic (ROC) coordinates of (0.03, 0.97), an area-under-curve (AUC) value of 0.99, and an F1-score of 0.91.Moreover, the performance of the CNN model is better than those of models based on decision tree (DT) and k-nearest neighbor (KNN) methods.Therefore, the intelligent singular value diagnostic method based on CNN is simple to operate, highly intelligent, and highly reliable, and it has a high potential for application in engineering.展开更多
Lane detection is a fundamental necessary task for autonomous driving.The conventional methods mainly treat lane detection as a pixel-wise segmentation problem,which suffers from the challenge of uncontrollable drivin...Lane detection is a fundamental necessary task for autonomous driving.The conventional methods mainly treat lane detection as a pixel-wise segmentation problem,which suffers from the challenge of uncontrollable driving road environments and needs post-processing to abstract the lane parameters.In this work,a series of lines are used to represent traffic lanes and a novel line deformation network(LDNet) is proposed to directly predict the coordinates of lane line points.Inspired by the dynamic behavior of classic snake algorithms,LDNet uses a neural network to iteratively deform an initial lane line to match the lane markings.To capture the long and discontinuous structures of lane lines,1 D convolution in LDNet is used for structured feature learning along the lane lines.Based on LDNet,a two-stage pipeline is developed for lane marking detection:(1) initial lane line proposal to predict a list of lane line candidates,and(2) lane line deformation to obtain the coordinates of lane line points.Experiments show that the proposed approach achieves competitive performances on the TuSimple dataset while being efficient for real-time applications on a GTX 1650 GPU.In particular,the accuracy of LDNet with the annotated starting and ending points is up to99.45%,which indicates the improved initial lane line proposal method can further enhance the performance of LDNet.展开更多
为解决交通道路小目标检测难度大、精度低,容易出现错检漏检的问题,提出一种基于YOLO v5(you only look once v5)算法的多尺度特征融合目标检测改进算法。首先,增加小目标检测头用于适应小目标尺寸,缓解漏检情况。然后,引入可变形卷积网...为解决交通道路小目标检测难度大、精度低,容易出现错检漏检的问题,提出一种基于YOLO v5(you only look once v5)算法的多尺度特征融合目标检测改进算法。首先,增加小目标检测头用于适应小目标尺寸,缓解漏检情况。然后,引入可变形卷积网络v2(deformable convolutional networks V2,DCN V2)提高模型对运动中小目标的学习能力;同时,增加上下文增强模块,提升对远距离小目标的识别能力。最后,在替换损失函数、提高边界框定位精度的同时,使用空间金字塔池化和上下文空间金字塔卷积分组模块,提高网络的感受野和特征表达能力。实验结果表明,所提算法在KITTI数据集小目标类别上平均识别精度达到了95.2%,相较于原始YOLO v5,算法总体平均识别精度提升了2.7%,对小目标的检测效果更佳,平均识别精度提升了3.1%,证明所提算法在道路小目标检测方面的有效性。展开更多
基金supported in part by the National Science Foundation of China under Grant 62001236in part by the Natural Science Foundation of the Jiangsu Higher Education Institutions of China under Grant 20KJA520003.
文摘In the textile industry,the presence of defects on the surface of fabric is an essential factor in determining fabric quality.Therefore,identifying fabric defects forms a crucial part of the fabric production process.Traditional fabric defect detection algorithms can only detect specific materials and specific fabric defect types;in addition,their detection efficiency is low,and their detection results are relatively poor.Deep learning-based methods have many advantages in the field of fabric defect detection,however,such methods are less effective in identifying multiscale fabric defects and defects with complex shapes.Therefore,we propose an effective algorithm,namely multilayer feature extraction combined with deformable convolution(MFDC),for fabric defect detection.In MFDC,multi-layer feature extraction is used to fuse the underlying location features with high-level classification features through a horizontally connected top-down architecture to improve the detection of multi-scale fabric defects.On this basis,a deformable convolution is added to solve the problem of the algorithm’s weak detection ability of irregularly shaped fabric defects.In this approach,Roi Align and Cascade-RCNN are integrated to enhance the adaptability of the algorithm in materials with complex patterned backgrounds.The experimental results show that the MFDC algorithm can achieve good detection results for both multi-scale fabric defects and defects with complex shapes,at the expense of a small increase in detection time.
基金National Key Research and Development Program of China(No.2017YFC0405806)。
文摘Currently,deep convolutional neural networks have made great progress in the field of semantic segmentation.Because of the fixed convolution kernel geometry,standard convolution neural networks have been limited the ability to simulate geometric transformations.Therefore,a deformable convolution is introduced to enhance the adaptability of convolutional networks to spatial transformation.Considering that the deep convolutional neural networks cannot adequately segment the local objects at the output layer due to using the pooling layers in neural network architecture.To overcome this shortcoming,the rough prediction segmentation results of the neural network output layer will be processed by fully connected conditional random fields to improve the ability of image segmentation.The proposed method can easily be trained by end-to-end using standard backpropagation algorithms.Finally,the proposed method is tested on the ISPRS dataset.The results show that the proposed method can effectively overcome the influence of the complex structure of the segmentation object and obtain state-of-the-art accuracy on the ISPRS Vaihingen 2D semantic labeling dataset.
基金Beijing Natural Science Foundation(No.IS23112)Beijing Institute of Technology Research Fund Program for Young Scholars(No.6120220236)。
文摘The intensive application of deep learning in medical image processing has facilitated the advancement of automatic retinal vessel segmentation research.To overcome the limitation that traditional U-shaped vessel segmentation networks fail to extract features in fundus image sufficiently,we propose a novel network(DSeU-net)based on deformable convolution and squeeze excitation residual module.The deformable convolution is utilized to dynamically adjust the receptive field for the feature extraction of retinal vessel.And the squeeze excitation residual module is used to scale the weights of the low-level features so that the network learns the complex relationships of the different feature layers efficiently.We validate the DSeU-net on three public retinal vessel segmentation datasets including DRIVE,CHASEDB1,and STARE,and the experimental results demonstrate the satisfactory segmentation performance of the network.
基金This study was supported in part by the Sichuan Science and Technology Program(2019YFH0085,2019ZDZX0005,2019YFG0196)in part by the Foundation of Chengdu University of Information Technology(No.KYTZ202008).
文摘Deformable medical image registration plays a vital role in medical image applications,such as placing different temporal images at the same time point or different modality images into the same coordinate system.Various strategies have been developed to satisfy the increasing needs of deformable medical image registration.One popular registration method is estimating the displacement field by computing the optical flow between two images.The motion field(flow field)is computed based on either gray-value or handcrafted descriptors such as the scale-invariant feature transform(SIFT).These methods assume that illumination is constant between images.However,medical images may not always satisfy this assumption.In this study,we propose a metric learning-based motion estimation method called Siamese Flow for deformable medical image registration.We train metric learners using a Siamese network,which produces an image patch descriptor that guarantees a smaller feature distance in two similar anatomical structures and a larger feature distance in two dissimilar anatomical structures.In the proposed registration framework,the flow field is computed based on such features and is close to the real deformation field due to the excellent feature representation ability of the Siamese network.Experimental results demonstrate that the proposed method outperforms the Demons,SIFT Flow,Elastix,and VoxelMorph networks regarding registration accuracy and robustness,particularly with large deformations.
基金Supported by the National Natural Science Foundation of China under Grants 61872241,62077037 and 62272298in part by Shanghai Municipal Science and Technology Major Project under Grant 2021SHZDZX0102。
文摘Background Exploring correspondences across multiview images is the basis of various computer vision tasks.However,most existing methods have limited accuracy under challenging conditions.Method To learn more robust and accurate correspondences,we propose DSD-MatchingNet for local feature matching in this study.First,we develop a deformable feature extraction module to obtain multilevel feature maps,which harvest contextual information from dynamic receptive fields.The dynamic receptive fields provided by the deformable convolution network ensure that our method obtains dense and robust correspondence.Second,we utilize sparse-to-dense matching with symmetry of correspondence to implement accurate pixel-level matching,which enables our method to produce more accurate correspondences.Result Experiments show that our proposed DSD-MatchingNet achieves a better performance on the image matching benchmark,as well as on the visual localization benchmark.Specifically,our method achieved 91.3%mean matching accuracy on the HPatches dataset and 99.3%visual localization recalls on the Aachen Day-Night dataset.
基金supported by the National Natural Science Foundation of China(Grant No.51579207)the Open Foundation of State Key Laboratory Base of Eco-Hydraulic Engineering in Arid Area(Grant No.2016ZZKT-8)the Key Projects of Natural Science Basic Research Program of Shaanxi Province(Grant No.2018JZ5010)
文摘Extracting implicit anomaly information through deformation monitoring data mining is highly significant to determining dam safety status.As an intelligent singular value diagnostic method for concrete dam deformation monitoring, shallow neural network models result in local optima and overfitting, and require manual feature extraction.To obtain an intelligent singular value diagnosis model that can be used for dam safety monitoring, a convolutional neural network (CNN) model that has advantages of deep learning (DL), such as automatic feature extraction, good model fitting, and strong generalizability, was trained in this study.An engineering example shows that the predicted result of the intelligent singular value diagnostic method based on CNN is highly compatible with the confusion matrix, with a precision of 92.41%, receiver operating characteristic (ROC) coordinates of (0.03, 0.97), an area-under-curve (AUC) value of 0.99, and an F1-score of 0.91.Moreover, the performance of the CNN model is better than those of models based on decision tree (DT) and k-nearest neighbor (KNN) methods.Therefore, the intelligent singular value diagnostic method based on CNN is simple to operate, highly intelligent, and highly reliable, and it has a high potential for application in engineering.
基金Supported by the Science and Technology Research Project of Hubei Provincial Department of Education (No.Q20202604)。
文摘Lane detection is a fundamental necessary task for autonomous driving.The conventional methods mainly treat lane detection as a pixel-wise segmentation problem,which suffers from the challenge of uncontrollable driving road environments and needs post-processing to abstract the lane parameters.In this work,a series of lines are used to represent traffic lanes and a novel line deformation network(LDNet) is proposed to directly predict the coordinates of lane line points.Inspired by the dynamic behavior of classic snake algorithms,LDNet uses a neural network to iteratively deform an initial lane line to match the lane markings.To capture the long and discontinuous structures of lane lines,1 D convolution in LDNet is used for structured feature learning along the lane lines.Based on LDNet,a two-stage pipeline is developed for lane marking detection:(1) initial lane line proposal to predict a list of lane line candidates,and(2) lane line deformation to obtain the coordinates of lane line points.Experiments show that the proposed approach achieves competitive performances on the TuSimple dataset while being efficient for real-time applications on a GTX 1650 GPU.In particular,the accuracy of LDNet with the annotated starting and ending points is up to99.45%,which indicates the improved initial lane line proposal method can further enhance the performance of LDNet.