期刊文献+
共找到13篇文章
< 1 >
每页显示 20 50 100
A multivariate grey incidence model for different scale data based on spatial pyramid pooling 被引量:4
1
作者 ZHANG Ke CUI Le YIN Yao 《Journal of Systems Engineering and Electronics》 SCIE EI CSCD 2020年第4期770-779,共10页
In order to solve the problem that existing multivariate grey incidence models cannot be applied to time series on different scales, a new model is proposed based on spatial pyramid pooling.Firstly, local features of ... In order to solve the problem that existing multivariate grey incidence models cannot be applied to time series on different scales, a new model is proposed based on spatial pyramid pooling.Firstly, local features of multivariate time series on different scales are pooled and aggregated by spatial pyramid pooling to construct n levels feature pooling matrices on the same scale. Secondly,Deng's multivariate grey incidence model is introduced to measure the degree of incidence between feature pooling matrices at each level. Thirdly, grey incidence degrees at each level are integrated into a global incidence degree. Finally, the performance of the proposed model is verified on two data sets compared with a variety of algorithms. The results illustrate that the proposed model is more effective and efficient than other similarity measure algorithms. 展开更多
关键词 grey system spatial pyramid pooling grey incidence multivariate time series
下载PDF
An adaptive physics-informed deep learning method for pore pressure prediction using seismic data 被引量:2
2
作者 Xin Zhang Yun-Hu Lu +2 位作者 Yan Jin Mian Chen Bo Zhou 《Petroleum Science》 SCIE EI CAS CSCD 2024年第2期885-902,共18页
Accurate prediction of formation pore pressure is essential to predict fluid flow and manage hydrocarbon production in petroleum engineering.Recent deep learning technique has been receiving more interest due to the g... Accurate prediction of formation pore pressure is essential to predict fluid flow and manage hydrocarbon production in petroleum engineering.Recent deep learning technique has been receiving more interest due to the great potential to deal with pore pressure prediction.However,most of the traditional deep learning models are less efficient to address generalization problems.To fill this technical gap,in this work,we developed a new adaptive physics-informed deep learning model with high generalization capability to predict pore pressure values directly from seismic data.Specifically,the new model,named CGP-NN,consists of a novel parametric features extraction approach(1DCPP),a stacked multilayer gated recurrent model(multilayer GRU),and an adaptive physics-informed loss function.Through machine training,the developed model can automatically select the optimal physical model to constrain the results for each pore pressure prediction.The CGP-NN model has the best generalization when the physicsrelated metricλ=0.5.A hybrid approach combining Eaton and Bowers methods is also proposed to build machine-learnable labels for solving the problem of few labels.To validate the developed model and methodology,a case study on a complex reservoir in Tarim Basin was further performed to demonstrate the high accuracy on the pore pressure prediction of new wells along with the strong generalization ability.The adaptive physics-informed deep learning approach presented here has potential application in the prediction of pore pressures coupled with multiple genesis mechanisms using seismic data. 展开更多
关键词 Pore pressure prediction Seismic data 1D convolution pyramid pooling Adaptive physics-informed loss function High generalization capability
下载PDF
An improved deep dilated convolutional neural network for seismic facies interpretation
3
作者 Na-Xia Yang Guo-Fa Li +2 位作者 Ting-Hui Li Dong-Feng Zhao Wei-Wei Gu 《Petroleum Science》 SCIE EI CAS CSCD 2024年第3期1569-1583,共15页
With the successful application and breakthrough of deep learning technology in image segmentation,there has been continuous development in the field of seismic facies interpretation using convolutional neural network... With the successful application and breakthrough of deep learning technology in image segmentation,there has been continuous development in the field of seismic facies interpretation using convolutional neural networks.These intelligent and automated methods significantly reduce manual labor,particularly in the laborious task of manually labeling seismic facies.However,the extensive demand for training data imposes limitations on their wider application.To overcome this challenge,we adopt the UNet architecture as the foundational network structure for seismic facies classification,which has demonstrated effective segmentation results even with small-sample training data.Additionally,we integrate spatial pyramid pooling and dilated convolution modules into the network architecture to enhance the perception of spatial information across a broader range.The seismic facies classification test on the public data from the F3 block verifies the superior performance of our proposed improved network structure in delineating seismic facies boundaries.Comparative analysis against the traditional UNet model reveals that our method achieves more accurate predictive classification results,as evidenced by various evaluation metrics for image segmentation.Obviously,the classification accuracy reaches an impressive 96%.Furthermore,the results of seismic facies classification in the seismic slice dimension provide further confirmation of the superior performance of our proposed method,which accurately defines the range of different seismic facies.This approach holds significant potential for analyzing geological patterns and extracting valuable depositional information. 展开更多
关键词 Seismic facies interpretation Dilated convolution Spatial pyramid pooling Internal feature maps Compound loss function
下载PDF
NFHP-RN:AMethod of Few-Shot Network Attack Detection Based on the Network Flow Holographic Picture-ResNet
4
作者 Tao Yi Xingshu Chen +2 位作者 Mingdong Yang Qindong Li Yi Zhu 《Computer Modeling in Engineering & Sciences》 SCIE EI 2024年第7期929-955,共27页
Due to the rapid evolution of Advanced Persistent Threats(APTs)attacks,the emergence of new and rare attack samples,and even those never seen before,make it challenging for traditional rule-based detection methods to ... Due to the rapid evolution of Advanced Persistent Threats(APTs)attacks,the emergence of new and rare attack samples,and even those never seen before,make it challenging for traditional rule-based detection methods to extract universal rules for effective detection.With the progress in techniques such as transfer learning and meta-learning,few-shot network attack detection has progressed.However,challenges in few-shot network attack detection arise from the inability of time sequence flow features to adapt to the fixed length input requirement of deep learning,difficulties in capturing rich information from original flow in the case of insufficient samples,and the challenge of high-level abstract representation.To address these challenges,a few-shot network attack detection based on NFHP(Network Flow Holographic Picture)-RN(ResNet)is proposed.Specifically,leveraging inherent properties of images such as translation invariance,rotation invariance,scale invariance,and illumination invariance,network attack traffic features and contextual relationships are intuitively represented in NFHP.In addition,an improved RN network model is employed for high-level abstract feature extraction,ensuring that the extracted high-level abstract features maintain the detailed characteristics of the original traffic behavior,regardless of changes in background traffic.Finally,a meta-learning model based on the self-attention mechanism is constructed,achieving the detection of novel APT few-shot network attacks through the empirical generalization of high-level abstract feature representations of known-class network attack behaviors.Experimental results demonstrate that the proposed method can learn high-level abstract features of network attacks across different traffic detail granularities.Comparedwith state-of-the-artmethods,it achieves favorable accuracy,precision,recall,and F1 scores for the identification of unknown-class network attacks through cross-validation onmultiple datasets. 展开更多
关键词 APT attacks spatial pyramid pooling NFHP(network flow holo-graphic picture) ResNet self-attention mechanism META-LEARNING
下载PDF
An Improved UNet Lightweight Network for Semantic Segmentation of Weed Images in Corn Fields
5
作者 Yu Zuo Wenwen Li 《Computers, Materials & Continua》 SCIE EI 2024年第6期4413-4431,共19页
In cornfields,factors such as the similarity between corn seedlings and weeds and the blurring of plant edge details pose challenges to corn and weed segmentation.In addition,remote areas such as farmland are usually ... In cornfields,factors such as the similarity between corn seedlings and weeds and the blurring of plant edge details pose challenges to corn and weed segmentation.In addition,remote areas such as farmland are usually constrained by limited computational resources and limited collected data.Therefore,it becomes necessary to lighten the model to better adapt to complex cornfield scene,and make full use of the limited data information.In this paper,we propose an improved image segmentation algorithm based on unet.Firstly,the inverted residual structure is introduced into the contraction path to reduce the number of parameters in the training process and improve the feature extraction ability;secondly,the pyramid pooling module is introduced to enhance the network’s ability of acquiring contextual information as well as the ability of dealing with the small target loss problem;and lastly,Finally,to further enhance the segmentation capability of the model,the squeeze and excitation mechanism is introduced in the expansion path.We used images of corn seedlings collected in the field and publicly available corn weed datasets to evaluate the improved model.The improved model has a total parameter of 3.79 M and miou can achieve 87.9%.The fps on a single 3050 ti video card is about 58.9.The experimental results show that the network proposed in this paper can quickly segment corn weeds in a cornfield scenario with good segmentation accuracy. 展开更多
关键词 Semantic segmentation deep learning UNet pyramid pooling module
下载PDF
基于特征增强的轻量级无人机目标检测算法
6
作者 陈运雷 刘紫燕 +3 位作者 吴应雨 郑旭晖 张倩 杨模 《传感技术学报》 CAS CSCD 北大核心 2023年第6期901-910,共10页
针对无人机航拍图像特征少,小尺寸目标多以及检测任务实时性要求高等问题,以YOLOX算法为基础提出基于特征增强的轻量级无人机目标检测算法。首先,设计更加轻量的密集残差网络结构ResNet_G优化模型的主干网络,提升模型对图像特征的利用率... 针对无人机航拍图像特征少,小尺寸目标多以及检测任务实时性要求高等问题,以YOLOX算法为基础提出基于特征增强的轻量级无人机目标检测算法。首先,设计更加轻量的密集残差网络结构ResNet_G优化模型的主干网络,提升模型对图像特征的利用率,同时降低模型复杂度;其次,提出基于注意力机制的Atrous Spatial Pyramid Pooling(ASPP)模块作为特征增强模块,加强上下文信息关联度以减少丢失小目标特征;最后,使用Focal Loss函数与CDIoU Loss函数,改善负样本对模型权重的影响以提高对密集目标的识别能力。实验结果表明,与原网络相比,改进后算法在VisDrone2021数据集上平均检测精度提升5.08%,参数量减少0.25 M,推理时间降低2.21 ms。 展开更多
关键词 无人机小目标检测 轻量化 Ghost模块 Atrous Spatial pyramid pooling(ASPP) CDIoU Loss Focal Loss
下载PDF
PF-YOLOv4-Tiny: Towards Infrared Target Detection on Embedded Platform
7
作者 Wenbo Li Qi Wang Shang Gao 《Intelligent Automation & Soft Computing》 SCIE 2023年第7期921-938,共18页
Infrared target detection models are more required than ever before to be deployed on embedded platforms,which requires models with less memory consumption and better real-time performance while considering accuracy.T... Infrared target detection models are more required than ever before to be deployed on embedded platforms,which requires models with less memory consumption and better real-time performance while considering accuracy.To address the above challenges,we propose a modified You Only Look Once(YOLO)algorithm PF-YOLOv4-Tiny.The algorithm incorpo-rates spatial pyramidal pooling(SPP)and squeeze-and-excitation(SE)visual attention modules to enhance the target localization capability.The PANet-based-feature pyramid networks(P-FPN)are proposed to transfer semantic information and location information simultaneously to ameliorate detection accuracy.To lighten the network,the standard convolutions other than the backbone network are replaced with depthwise separable convolutions.In post-processing the images,the soft-non-maximum suppression(soft-NMS)algorithm is employed to subside the missed and false detection problems caused by the occlusion between targets.The accuracy of our model can finally reach 61.75%,while the total Params is only 9.3 M and GFLOPs is 11.At the same time,the inference speed reaches 87 FPS on NVIDIA GeForce GTX 1650 Ti,which can meet the requirements of the infrared target detection algorithm for the embedded deployments. 展开更多
关键词 Infrared target detection visual attention module spatial pyramid pooling dual-path feature fusion depthwise separable convolution soft-NMS
下载PDF
Multi-Object Detection of Chinese License Plate in Complex Scenes
8
作者 Dan Liu Yajuan Wu +2 位作者 Yuxin He Lu Qin Bochuan Zheng 《Computer Systems Science & Engineering》 SCIE EI 2021年第1期145-156,共12页
Multi-license plate detection in complex scenes is still a challenging task because of multiple vehicle license plates with different sizes and classes in the images having complex background.The edge features of high... Multi-license plate detection in complex scenes is still a challenging task because of multiple vehicle license plates with different sizes and classes in the images having complex background.The edge features of high-density distribution and the high curvature features of stroke turning of Chinese character are important signs to distinguish Chinese license plate from other objects.To accurately detect multiple vehicle license plates with different sizes and classes in complex scenes,a multi-object detection of Chinese license plate method based on improved YOLOv3 network was proposed in this research.The improvements include replacing the residual block of the YOLOv3 backbone network with the Inception-ResNet-A block,imbedding the SPP block into the detection network,cutting the redundant Inception-ResNet-A block to suit for the multi-license plate detection task,and clustering the ground truth boxes of license plates to obtain a new set of anchor boxes.A Chinese vehicle license plate image dataset was built for training and testing the improved network,and the location and class of the license plates in each image were accurately labeled.The dataset has 62,153 pieces of images and 4 classes of China vehicle license plates,almost images have multiple license plates with different sizes.Experiments demonstrated that the multilicense plate detection method obtained 83.4%mAP,98.88%precision,98.17%recall,98.52 F1 score,89.196 BFLOPS and 22 FPS on the test dataset,and whole performance was better than the other five compared networks including YOLOv3,SSD,Faster-RCNN,EfficientDet and RetinaNet. 展开更多
关键词 Chinese vehicle license plate multiple license plate multi-object detection Inception-ResNet-A spatial pyramid pooling
下载PDF
MF~2ResU-Net:a multi-feature fusion deep learning architecture for retinal blood vessel segmentation
9
作者 CUI Zhenchao SONG Shujie QI Jing 《Digital Chinese Medicine》 2022年第4期406-418,共13页
Objective For computer-aided Chinese medical diagnosis and aiming at the problem of insufficient segmentation,a novel multi-level method based on the multi-scale fusion residual neural network(MF2ResU-Net)model is pro... Objective For computer-aided Chinese medical diagnosis and aiming at the problem of insufficient segmentation,a novel multi-level method based on the multi-scale fusion residual neural network(MF2ResU-Net)model is proposed.Methods To obtain refined features of retinal blood vessels,three cascade connected UNet networks are employed.To deal with the problem of difference between the parts of encoder and decoder,in MF2ResU-Net,shortcut connections are used to combine the encoder and decoder layers in the blocks.To refine the feature of segmentation,atrous spatial pyramid pooling(ASPP)is embedded to achieve multi-scale features for the final segmentation networks.Results The MF2ResU-Net was superior to the existing methods on the criteria of sensitivity(Sen),specificity(Spe),accuracy(ACC),and area under curve(AUC),the values of which are 0.8013 and 0.8102,0.9842 and 0.9809,0.9700 and 0.9776,and 0.9797 and 0.9837,respectively for DRIVE and CHASE DB1.The results of experiments demonstrated the effectiveness and robustness of the model in the segmentation of complex curvature and small blood vessels.Conclusion Based on residual connections and multi-feature fusion,the proposed method can obtain accurate segmentation of retinal blood vessels by refining the segmentation features,which can provide another diagnosis method for computer-aided Chinese medical diagnosis. 展开更多
关键词 Medical image processing Atrous space pyramid pooling(ASPP) Residual neural network Multi-level model Retinal vessels segmentation
下载PDF
Scale variant vehicle object recognition by CNN module of multipooling- PCA process
10
作者 Yuxiang Guo Itsuo Kumazawa Chuyo Kaku 《Journal of Intelligent and Connected Vehicles》 EI 2023年第4期227-236,共10页
The moving vehicles present different scales in the image due to the perspective effect of different viewpoint distances.The premise of advanced driver assistance system(ADAS)system for safety surveillance and safe dr... The moving vehicles present different scales in the image due to the perspective effect of different viewpoint distances.The premise of advanced driver assistance system(ADAS)system for safety surveillance and safe driving is early identification of vehicle targets in front of the ego vehicle.The recognition of the same vehicle at different scales requires feature learning with scale invariance.Unlike existing feature vector methods,the normalized PCA eigenvalues calculated from feature maps are used to extract scale-invariant features.This study proposed a convolutional neural network(CNN)structure embedded with the module of multi-pooling-PCA for scale variant object recognition.The validation of the proposed network structure is verified by scale variant vehicle image dataset.Compared with scale invariant network algorithms of Scale-invariant feature transform(SIFT)and FSAF as well as miscellaneous networks,the proposed network can achieve the best recognition accuracy tested by the vehicle scale variant dataset.To testify the practicality of this modified network,the testing of public dataset ImageNet is done and the comparable results proved its effectiveness in general purpose of applications. 展开更多
关键词 object detection scale invariance spatial pyramid pooling multi-pooling convolutional neural network(CNN)
原文传递
Three-dimensional nanoscale reduced-angle ptycho-tomographic imaging with deep learning(RAPID) 被引量:1
11
作者 Ziling Wu Iksung Kang +5 位作者 Yudong Yao Yi Jiang Junjing Deng Jeffrey Klug Stefan Vogt George Barbastathis 《eLight》 2023年第1期198-210,共13页
X-ray ptychographic tomography is a nondestructive method for three dimensional(3D)imaging with nanometer-sized resolvable features.The size of the volume that can be imaged is almost arbitrary,limited only by the pen... X-ray ptychographic tomography is a nondestructive method for three dimensional(3D)imaging with nanometer-sized resolvable features.The size of the volume that can be imaged is almost arbitrary,limited only by the penetration depth and the available scanning time.Here we present a method that rapidly accelerates the imaging operation over a given volume through acquiring a limited set of data via large angular reduction and compensating for the resulting ill-posedness through deeply learned priors.The proposed 3D reconstruction method“RAPID”relies initially on a subset of the object measured with the nominal number of required illumination angles and treats the reconstructions from the conventional two-step approach as ground truth.It is then trained to reproduce equal fidelity from much fewer angles.After training,it performs with similar fidelity on the hitherto unexamined portions of the object,previously not shown during training,with a limited set of acquisitions.In our experimental demonstration,the nominal number of angles was 349 and the reduced number of angles was 21,resulting in a×140 aggregate speedup over a volume of 4.48×93.18×3.92μm^(3) and with(14 nm)^(3) feature size,i.e.-10^(8) voxels.RAPID’s key distinguishing feature over earlier attempts is the incorporation of atrous spatial pyramid pooling modules into the deep neural network framework in an anisotropic way.We found that adjusting the atrous rate improves reconstruction fidelity because it expands the convolutional kernels’range to match the physics of multi-slice ptychography without significantly increasing the number of parameters. 展开更多
关键词 X-ray ptychographic tomography Deep learning Reduced-angle Rapid imaging Atrous spatial pyramid pooling ANISOTROPIC
原文传递
3D pulmonary vessel segmentation based on improved residual attention u-net
12
作者 Jiachen Han Naixin He +2 位作者 Qiang Zheng Lin Li Chaoqing Ma 《Medicine in Novel Technology and Devices》 2023年第4期64-75,共12页
Automatic segmentation of pulmonary vessels is a fundamental and essential task for the diagnosis of various pulmonary vessels diseases.The accuracy of segmentation is suffering from the complex vascular structure.In ... Automatic segmentation of pulmonary vessels is a fundamental and essential task for the diagnosis of various pulmonary vessels diseases.The accuracy of segmentation is suffering from the complex vascular structure.In this paper,an Improved Residual Attention U-Net(IRAU-Net)aiming to segment pulmonary vessel in 3D is proposed.To extract more vessel structure information,the Squeeze and Excitation(SE)block is embedded in the down sampling stage.And in the up sampling stage,the global attention module(GAM)is used to capture target features in both high and low levels.These two stages are connected by Atrous Spatial Pyramid Pooling(ASPP)which can sample in various receptive fields with a low computational cost.By the evaluation experiment,the better performance of IRAU-Net on the segmentation of terminal vessel is indicated.It is expected to provide robust support for clinical diagnosis and treatment. 展开更多
关键词 Pulmonary vessel segmentation RAU-Net Squeeze and excitation Atrous spatial pyramid pooling Deep learning
原文传递
基于双支路特征融合的MRI颅脑肿瘤图像分割研究 被引量:2
13
作者 熊炜 周蕾 +2 位作者 乐玲 张开 李利荣 《光电子.激光》 CAS CSCD 北大核心 2022年第4期383-392,共10页
针对磁共振成像(magnetic resonance imaging, MRI)颅脑肿瘤区域误识别与分割网络空间信息丢失问题,提出一种基于双支路特征融合的MRI脑肿瘤图像分割方法。首先通过主支路的重构VGG与注意力模型(re-parameterization visual geometry gr... 针对磁共振成像(magnetic resonance imaging, MRI)颅脑肿瘤区域误识别与分割网络空间信息丢失问题,提出一种基于双支路特征融合的MRI脑肿瘤图像分割方法。首先通过主支路的重构VGG与注意力模型(re-parameterization visual geometry group and attention model, RVAM)提取网络的上下文信息,然后使用可变形卷积与金字塔池化模型(deformable convolution and pyramid pooling model, DCPM)在副支路获取丰富的空间信息,之后使用特征融合模块对两支路的特征信息进行融合。最后引入注意力模型,在上采样过程中加强分割目标在解码时的权重。提出的方法在Kaggle_3m数据集和BraTS2019数据集上进行了实验验证,实验结果表明该方法具有良好的脑肿瘤分割性能,其中在Kaggle_3m上,Dice相似系数、杰卡德系数分别达到了91.45%和85.19%。 展开更多
关键词 磁共振成像(magnetic resonance imaging MRI)颅脑肿瘤图像分割 双支路特征融合 重构VGG与注意力模型(re-parameterization visual geometry group and attention model RVAM) 可变形卷积与金字塔池化模型(deformable convolution and pyramid pooling model DCPM)
原文传递
上一页 1 下一页 到第
使用帮助 返回顶部