期刊文献+
共找到20篇文章
< 1 >
每页显示 20 50 100
Guided-YNet: Saliency Feature-Guided Interactive Feature Enhancement Lung Tumor Segmentation Network
1
作者 Tao Zhou Yunfeng Pan +3 位作者 Huiling Lu Pei Dang Yujie Guo Yaxing Wang 《Computers, Materials & Continua》 SCIE EI 2024年第9期4813-4832,共20页
Multimodal lung tumor medical images can provide anatomical and functional information for the same lesion.Such as Positron Emission Computed Tomography(PET),Computed Tomography(CT),and PET-CT.How to utilize the lesio... Multimodal lung tumor medical images can provide anatomical and functional information for the same lesion.Such as Positron Emission Computed Tomography(PET),Computed Tomography(CT),and PET-CT.How to utilize the lesion anatomical and functional information effectively and improve the network segmentation performance are key questions.To solve the problem,the Saliency Feature-Guided Interactive Feature Enhancement Lung Tumor Segmentation Network(Guide-YNet)is proposed in this paper.Firstly,a double-encoder single-decoder U-Net is used as the backbone in this model,a single-coder single-decoder U-Net is used to generate the saliency guided feature using PET image and transmit it into the skip connection of the backbone,and the high sensitivity of PET images to tumors is used to guide the network to accurately locate lesions.Secondly,a Cross Scale Feature Enhancement Module(CSFEM)is designed to extract multi-scale fusion features after downsampling.Thirdly,a Cross-Layer Interactive Feature Enhancement Module(CIFEM)is designed in the encoder to enhance the spatial position information and semantic information.Finally,a Cross-Dimension Cross-Layer Feature Enhancement Module(CCFEM)is proposed in the decoder,which effectively extractsmultimodal image features through global attention and multi-dimension local attention.The proposed method is verified on the lung multimodal medical image datasets,and the results showthat theMean Intersection overUnion(MIoU),Accuracy(Acc),Dice Similarity Coefficient(Dice),Volumetric overlap error(Voe),Relative volume difference(Rvd)of the proposed method on lung lesion segmentation are 87.27%,93.08%,97.77%,95.92%,89.28%,and 88.68%,respectively.It is of great significance for computer-aided diagnosis. 展开更多
关键词 Medical image segmentation U-Net saliency feature guidance cross-modal feature enhancement cross-dimension feature enhancement
下载PDF
Traffic Scene Captioning with Multi-Stage Feature Enhancement
2
作者 Dehai Zhang Yu Ma +3 位作者 Qing Liu Haoxing Wang Anquan Ren Jiashu Liang 《Computers, Materials & Continua》 SCIE EI 2023年第9期2901-2920,共20页
Traffic scene captioning technology automatically generates one or more sentences to describe the content of traffic scenes by analyzing the content of the input traffic scene images,ensuring road safety while providi... Traffic scene captioning technology automatically generates one or more sentences to describe the content of traffic scenes by analyzing the content of the input traffic scene images,ensuring road safety while providing an important decision-making function for sustainable transportation.In order to provide a comprehensive and reasonable description of complex traffic scenes,a traffic scene semantic captioningmodel withmulti-stage feature enhancement is proposed in this paper.In general,the model follows an encoder-decoder structure.First,multilevel granularity visual features are used for feature enhancement during the encoding process,which enables the model to learn more detailed content in the traffic scene image.Second,the scene knowledge graph is applied to the decoding process,and the semantic features provided by the scene knowledge graph are used to enhance the features learned by the decoder again,so that themodel can learn the attributes of objects in the traffic scene and the relationships between objects to generate more reasonable captions.This paper reports extensive experiments on the challenging MS-COCO dataset,evaluated by five standard automatic evaluation metrics,and the results show that the proposed model has improved significantly in all metrics compared with the state-of-the-art methods,especially achieving a score of 129.0 on the CIDEr-D evaluation metric,which also indicates that the proposed model can effectively provide a more reasonable and comprehensive description of the traffic scene. 展开更多
关键词 Traffic scene captioning sustainable transportation feature enhancement encoder-decoder structure multi-level granularity scene knowledge graph
下载PDF
Feature deformation network with multi-range feature enhancement for agricultural machinery operation mode identification
3
作者 Weixin Zhai Zhi Xu +5 位作者 Jinming Liu Xiya Xiong Jiawen Pan Sun-Ok Chung Dionysis Bochtis Caicong Wu 《International Journal of Agricultural and Biological Engineering》 SCIE 2024年第4期265-275,共11页
Utilizing the spatiotemporal features contained in extensive trajectory data for identifying operation modes of agricultural machinery is an important basis task for subsequent agricultural machinery trajectory resear... Utilizing the spatiotemporal features contained in extensive trajectory data for identifying operation modes of agricultural machinery is an important basis task for subsequent agricultural machinery trajectory research.In the present study,to effectively identify agricultural machinery operation mode,a feature deformation network with multi-range feature enhancement was proposed.First,a multi-range feature enhancement module was developed to fully explore the feature distribution of agricultural machinery trajectory data.Second,to further enrich the representation of trajectories,a feature deformation module was proposed that can map trajectory points to high-dimensional space to form feature maps.Then,EfficientNet-B0 was used to extract features of different scales and depths from the feature map,select features highly relevant to the results,and finally accurately predict the mode of each trajectory point.To validate the effectiveness of the proposed method,experiments were conducted to compare the results with those of other methods on a dataset of real agricultural trajectories.On the corn and wheat harvester trajectory datasets,the model achieved accuracies of 96.88%and 96.68%,as well as F1 scores of 93.54%and 94.19%,exhibiting improvements of 8.35%and 9.08%in accuracy and 20.99%and 20.04%in F1 score compared with the current state-of-the-art method. 展开更多
关键词 road-field trajectory classification efficientNet feature deformation network multi-range feature enhancement agricultural machinery operation mode recognition
原文传递
BDPartNet: Feature Decoupling and Reconstruction Fusion Network for Infrared and Visible Image 被引量:1
4
作者 Xuejie Wang Jianxun Zhang +2 位作者 Ye Tao Xiaoli Yuan Yifan Guo 《Computers, Materials & Continua》 SCIE EI 2024年第6期4621-4639,共19页
While single-modal visible light images or infrared images provide limited information,infrared light captures significant thermal radiation data,whereas visible light excels in presenting detailed texture information... While single-modal visible light images or infrared images provide limited information,infrared light captures significant thermal radiation data,whereas visible light excels in presenting detailed texture information.Com-bining images obtained from both modalities allows for leveraging their respective strengths and mitigating individual limitations,resulting in high-quality images with enhanced contrast and rich texture details.Such capabilities hold promising applications in advanced visual tasks including target detection,instance segmentation,military surveillance,pedestrian detection,among others.This paper introduces a novel approach,a dual-branch decomposition fusion network based on AutoEncoder(AE),which decomposes multi-modal features into intensity and texture information for enhanced fusion.Local contrast enhancement module(CEM)and texture detail enhancement module(DEM)are devised to process the decomposed images,followed by image fusion through the decoder.The proposed loss function ensures effective retention of key information from the source images of both modalities.Extensive comparisons and generalization experiments demonstrate the superior performance of our network in preserving pixel intensity distribution and retaining texture details.From the qualitative results,we can see the advantages of fusion details and local contrast.In the quantitative experiments,entropy(EN),mutual information(MI),structural similarity(SSIM)and other results have improved and exceeded the SOTA(State of the Art)model as a whole. 展开更多
关键词 Deep learning feature enhancement computer vision
下载PDF
Detecting XSS with Random Forest and Multi-Channel Feature Extraction
5
作者 Qiurong Qin Yueqin Li +3 位作者 Yajie Mi Jinhui Shen Kexin Wu Zhenzhao Wang 《Computers, Materials & Continua》 SCIE EI 2024年第7期843-874,共32页
In the era of the Internet,widely used web applications have become the target of hacker attacks because they contain a large amount of personal information.Among these vulnerabilities,stealing private data through cr... In the era of the Internet,widely used web applications have become the target of hacker attacks because they contain a large amount of personal information.Among these vulnerabilities,stealing private data through crosssite scripting(XSS)attacks is one of the most commonly used attacks by hackers.Currently,deep learning-based XSS attack detection methods have good application prospects;however,they suffer from problems such as being prone to overfitting,a high false alarm rate,and low accuracy.To address these issues,we propose a multi-stage feature extraction and fusion model for XSS detection based on Random Forest feature enhancement.The model utilizes RandomForests to capture the intrinsic structure and patterns of the data by extracting leaf node indices as features,which are subsequentlymergedwith the original data features to forma feature setwith richer information content.Further feature extraction is conducted through three parallel channels.Channel I utilizes parallel onedimensional convolutional layers(1Dconvolutional layers)with different convolutional kernel sizes to extract local features at different scales and performmulti-scale feature fusion;Channel II employsmaximum one-dimensional pooling layers(max 1D pooling layers)of various sizes to extract key features from the data;and Channel III extracts global information bi-directionally using a Bi-Directional Long-Short TermMemory Network(Bi-LSTM)and incorporates a multi-head attention mechanism to enhance global features.Finally,effective classification and prediction of XSS are performed by fusing the features of the three channels.To test the effectiveness of the model,we conduct experiments on six datasets.We achieve an accuracy of 100%on the UNSW-NB15 dataset and 99.99%on the CICIDS2017 dataset,which is higher than that of the existing models. 展开更多
关键词 Random forest feature enhancement three-channel parallelism XSS detection
下载PDF
A Progressive Feature Fusion-Based Manhole Cover Defect Recognition Method
6
作者 Tingting Hu Xiangyu Ren +2 位作者 Wanfa Sun Shengying Yang Boyang Feng 《Journal of Computer and Communications》 2024年第8期307-316,共10页
Manhole cover defect recognition is of significant practical importance as it can accurately identify damaged or missing covers, enabling timely replacement and maintenance. Traditional manhole cover detection techniq... Manhole cover defect recognition is of significant practical importance as it can accurately identify damaged or missing covers, enabling timely replacement and maintenance. Traditional manhole cover detection techniques primarily focus on detecting the presence of covers rather than classifying the types of defects. However, manhole cover defects exhibit small inter-class feature differences and large intra-class feature variations, which makes their recognition challenging. To improve the classification of manhole cover defect types, we propose a Progressive Dual-Branch Feature Fusion Network (PDBFFN). The baseline backbone network adopts a multi-stage hierarchical architecture design using Res-Net50 as the visual feature extractor, from which both local and global information is obtained. Additionally, a Feature Enhancement Module (FEM) and a Fusion Module (FM) are introduced to enhance the network’s ability to learn critical features. Experimental results demonstrate that our model achieves a classification accuracy of 82.6% on a manhole cover defect dataset, outperforming several state-of-the-art fine-grained image classification models. 展开更多
关键词 feature enhancement PROGRESSIVE Dual-Branch feature Fusion
下载PDF
Application of graph neural network and feature information enhancement in relation inference of sparse knowledge graph
7
作者 Hai-Tao Jia Bo-Yang Zhang +4 位作者 Chao Huang Wen-Han Li Wen-Bo Xu Yu-Feng Bi Li Ren 《Journal of Electronic Science and Technology》 EI CAS CSCD 2023年第2期44-54,共11页
At present,knowledge embedding methods are widely used in the field of knowledge graph(KG)reasoning,and have been successfully applied to those with large entities and relationships.However,in research and production ... At present,knowledge embedding methods are widely used in the field of knowledge graph(KG)reasoning,and have been successfully applied to those with large entities and relationships.However,in research and production environments,there are a large number of KGs with a small number of entities and relations,which are called sparse KGs.Limited by the performance of knowledge extraction methods or some other reasons(some common-sense information does not appear in the natural corpus),the relation between entities is often incomplete.To solve this problem,a method of the graph neural network and information enhancement is proposed.The improved method increases the mean reciprocal rank(MRR)and Hit@3 by 1.6%and 1.7%,respectively,when the sparsity of the FB15K-237 dataset is 10%.When the sparsity is 50%,the evaluation indexes MRR and Hit@10 are increased by 0.8%and 1.8%,respectively. 展开更多
关键词 feature information enhancement Graph neural network Natural language processing Sparse knowledge graph(KG)inference
下载PDF
Fault feature enhancement of gearbox in combined machining center by using adaptive cascade stochastic resonance 被引量:6
8
作者 LI Bing LI JiMeng HE ZhengJia 《Science China(Technological Sciences)》 SCIE EI CAS 2011年第12期3203-3210,共8页
The difficulty to select the best system parameters restricts the engineering application of stochastic resonance (SR). An adaptive cascade stochastic resonance (ACSR) is proposed in the present study. The propose... The difficulty to select the best system parameters restricts the engineering application of stochastic resonance (SR). An adaptive cascade stochastic resonance (ACSR) is proposed in the present study. The proposed method introduces correlation theory into SR, and uses correlation coefficient of the input signals and noise as a weight to construct the weighted signal-to-noise ratio (WSNR) index. The influence of high frequency noise is alleviated and the signal-to-noise ratio index used in traditional SR is improved accordingly. The ACSR with WSNR can obtain optimal parameters adaptively. And it is not necessary to predict the exact frequency of the target signal. In addition, through the secondary utilization of noise, ACSR makes the signal output waveforrn smoother and the fluctuation period more obvious. Simulation example and engineering application of gearbox fault diagnosis demonstrate the effectiveness and feasibility of the proposed method. 展开更多
关键词 stochastic resonance ADAPTIVE weighted signal-to-noise ratio feature enhancement combined machining center
原文传递
DFE-GCN: Dual Feature Enhanced Graph Convolutional Network for Controversy Detection
9
作者 Chengfei Hua Wenzhong Yang +3 位作者 Liejun Wang Fuyuan Wei KeZiErBieKe HaiLaTi Yuanyuan Liao 《Computers, Materials & Continua》 SCIE EI 2023年第10期893-909,共17页
With the development of social media and the prevalence of mobile devices,an increasing number of people tend to use social media platforms to express their opinions and attitudes,leading to many online controversies.... With the development of social media and the prevalence of mobile devices,an increasing number of people tend to use social media platforms to express their opinions and attitudes,leading to many online controversies.These online controversies can severely threaten social stability,making automatic detection of controversies particularly necessary.Most controversy detection methods currently focus on mining features from text semantics and propagation structures.However,these methods have two drawbacks:1)limited ability to capture structural features and failure to learn deeper structural features,and 2)neglecting the influence of topic information and ineffective utilization of topic features.In light of these phenomena,this paper proposes a social media controversy detection method called Dual Feature Enhanced Graph Convolutional Network(DFE-GCN).This method explores structural information at different scales from global and local perspectives to capture deeper structural features,enhancing the expressive power of structural features.Furthermore,to strengthen the influence of topic information,this paper utilizes attention mechanisms to enhance topic features after each graph convolutional layer,effectively using topic information.We validated our method on two different public datasets,and the experimental results demonstrate that our method achieves state-of-the-art performance compared to baseline methods.On the Weibo and Reddit datasets,the accuracy is improved by 5.92%and 3.32%,respectively,and the F1 score is improved by 1.99%and 2.17%,demonstrating the positive impact of enhanced structural features and topic features on controversy detection. 展开更多
关键词 Controversy detection graph convolutional network feature enhancement social media
下载PDF
Coherent Features of Resonance-Mediated Two-Photon Absorption Enhancement by Varying the Energy Level Structure,Laser Spectrum Bandwidth and Central Frequency
10
作者 程文静 梁果 +3 位作者 吴萍 贾天卿 孙真荣 张诗按 《Chinese Physics Letters》 SCIE CAS CSCD 2017年第8期41-45,共5页
The femtosecond pulse shaping technique has been shown to be an effective method to control the multi-photon absorption by the light–matter interaction. Previous studies mainly focused on the quantum coherent control... The femtosecond pulse shaping technique has been shown to be an effective method to control the multi-photon absorption by the light–matter interaction. Previous studies mainly focused on the quantum coherent control of the multi-photon absorption by the phase, amplitude and polarization modulation, but the coherent features of the multi-photon absorption depending on the energy level structure, the laser spectrum bandwidth and laser central frequency still lack in-depth systematic research. In this work, we further explore the coherent features of the resonance-mediated two-photon absorption in a rubidium atom by varying the energy level structure, spectrum bandwidth and central frequency of the femtosecond laser field. The theoretical results show that the change of the intermediate state detuning can effectively influence the enhancement of the near-resonant part, which further affects the transform-limited (TL)-normalized final state population maximum. Moreover, as the laser spectrum bandwidth increases, the TL-normalized final state population maximum can be effectively enhanced due to the increase of the enhancement in the near-resonant part, but the TL-normalized final state population maximum is constant by varying the laser central frequency. These studies can provide a clear physical picture for understanding the coherent features of the resonance-mediated two-photon absorption, and can also provide a theoretical guidance for the future applications. 展开更多
关键词 TL Coherent features of Resonance-Mediated Two-Photon Absorption enhancement by Varying the Energy Level Structure Laser Spectrum Bandwidth and Central Frequency
下载PDF
Feature Enhanced Stacked Auto Encoder for Diseases Detection in Brain MRI
11
作者 Umair Muneer Butt Rimsha Arif +2 位作者 Sukumar Letchmunan Babur Hayat Malik Muhammad Adil Butt 《Computers, Materials & Continua》 SCIE EI 2023年第8期2551-2570,共20页
The detection of brain disease is an essential issue in medical and research areas.Deep learning techniques have shown promising results in detecting and diagnosing brain diseases using magnetic resonance imaging(MRI)... The detection of brain disease is an essential issue in medical and research areas.Deep learning techniques have shown promising results in detecting and diagnosing brain diseases using magnetic resonance imaging(MRI)images.These techniques involve training neural networks on large datasets of MRI images,allowing the networks to learn patterns and features indicative of different brain diseases.However,several challenges and limitations still need to be addressed further to improve the accuracy and effectiveness of these techniques.This paper implements a Feature Enhanced Stacked Auto Encoder(FESAE)model to detect brain diseases.The standard stack auto encoder’s results are trivial and not robust enough to boost the system’s accuracy.Therefore,the standard Stack Auto Encoder(SAE)is replaced with a Stacked Feature Enhanced Auto Encoder with a feature enhancement function to efficiently and effectively get non-trivial features with less activation energy froman image.The proposed model consists of four stages.First,pre-processing is performed to remove noise,and the greyscale image is converted to Red,Green,and Blue(RGB)to enhance feature details for discriminative feature extraction.Second,feature Extraction is performed to extract significant features for classification using DiscreteWavelet Transform(DWT)and Channelization.Third,classification is performed to classify MRI images into four major classes:Normal,Tumor,Brain Stroke,and Alzheimer’s.Finally,the FESAE model outperforms the state-of-theart,machine learning,and deep learning methods such as Artificial Neural Network(ANN),SAE,Random Forest(RF),and Logistic Regression(LR)by achieving a high accuracy of 98.61% on a dataset of 2000 MRI images.The proposed model has significant potential for assisting radiologists in diagnosing brain diseases more accurately and improving patient outcomes. 展开更多
关键词 Brain diseases deep learning feature enhanced stacked auto encoder stack auto encoder
下载PDF
Enhanced Feature Fusion Segmentation for Tumor Detection Using Intelligent Techniques
12
作者 R.Radha R.Gopalakrishnan 《Intelligent Automation & Soft Computing》 SCIE 2023年第3期3113-3127,共15页
In thefield of diagnosis of medical images the challenge lies in tracking and identifying the defective cells and the extent of the defective region within the complex structure of a brain cavity.Locating the defective... In thefield of diagnosis of medical images the challenge lies in tracking and identifying the defective cells and the extent of the defective region within the complex structure of a brain cavity.Locating the defective cells precisely during the diagnosis phase helps tofight the greatest exterminator of mankind.Early detec-tion of these defective cells requires an accurate computer-aided diagnostic system(CAD)that supports early treatment and promotes survival rates of patients.An ear-lier version of CAD systems relies greatly on the expertise of radiologist and it con-sumed more time to identify the defective region.The manuscript takes the efficacy of coalescing features like intensity,shape,and texture of the magnetic resonance image(MRI).In the Enhanced Feature Fusion Segmentation based classification method(EEFS)the image is enhanced and segmented to extract the prominent fea-tures.To bring out the desired effect the EEFS method uses Enhanced Local Binary Pattern(EnLBP),Partisan Gray Level Co-occurrence Matrix Histogram of Oriented Gradients(PGLCMHOG),and iGrab cut method to segment image.These prominent features along with deep features are coalesced to provide a single-dimensional fea-ture vector that is effectively used for prediction.The coalesced vector is used with the existing classifiers to compare the results of these classifiers with that of the gen-erated vector.The generated vector provides promising results with commendably less computatio nal time for pre-processing and classification of MR medical images. 展开更多
关键词 Enhanced local binary pattern LEVEL iGrab cut method magnetic resonance image computer aided diagnostic system enhanced feature fusion segmentation enhanced local binary pattern
下载PDF
Automated Extraction for Water Bodies Using New Water Index from Landsat 8 OLI Images 被引量:2
13
作者 Pu YAN Yue FANG +2 位作者 Jie CHEN Gang WANG Qingwei TANG 《Journal of Geodesy and Geoinformation Science》 CSCD 2023年第1期59-75,共17页
The extraction of water bodies is essential for monitoring water resources,ecosystem services and the hydrological cycle,so analyzing water bodies from remote sensing images is necessary.The water index is designed to... The extraction of water bodies is essential for monitoring water resources,ecosystem services and the hydrological cycle,so analyzing water bodies from remote sensing images is necessary.The water index is designed to highlight water bodies in remote sensing images.We employ a new water index and digital image processing technology to extract water bodies automatically and accurately from Landsat 8 OLI images.Firstly,we preprocess Landsat 8 OLI images with radiometric calibration and atmospheric correction.Subsequently,we apply KT transformation,LBV transformation,AWEI nsh,and HIS transformation to the preprocessed image to calculate a new water index.Then,we perform linear feature enhancement and improve the local adaptive threshold segmentation method to extract small water bodies accurately.Meanwhile,we employ morphological enhancement and improve the local adaptive threshold segmentation method to extract large water bodies.Finally,we combine small and large water bodies to get complete water bodies.Compared with other traditional methods,our method has apparent advantages in water extraction,particularly in the extraction of small water bodies. 展开更多
关键词 water bodies extraction Landsat 8 OLI images water index improved local adaptive threshold segmentation linear feature enhancement
下载PDF
EFECL:Feature encoding enhancement with contrastive learning for indoor 3D object detection
14
作者 Yao Duan Renjiao Yi +2 位作者 Yuanming Gao Kai Xu Chenyang Zhu 《Computational Visual Media》 SCIE EI CSCD 2023年第4期875-892,共18页
Good proposal initials are critical for 3D object detection applications.However,due to the significant geometry variation of indoor scenes,incomplete and noisy proposals are inevitable in most cases.Mining feature in... Good proposal initials are critical for 3D object detection applications.However,due to the significant geometry variation of indoor scenes,incomplete and noisy proposals are inevitable in most cases.Mining feature information among these“bad”proposals may mislead the detection.Contrastive learning provides a feasible way for representing proposals,which can align complete and incomplete/noisy proposals in feature space.The aligned feature space can help us build robust 3D representation even if bad proposals are given.Therefore,we devise a new contrast learning framework for indoor 3D object detection,called EFECL,that learns robust 3D representations by contrastive learning of proposals on two different levels.Specifically,we optimize both instance-level and category-level contrasts to align features by capturing instance-specific characteristics and semantic-aware common patterns.Furthermore,we propose an enhanced feature aggregation module to extract more general and informative features for contrastive learning.Evaluations on ScanNet V2 and SUN RGB-D benchmarks demonstrate the generalizability and effectiveness of our method,and our method can achieve 12.3%and 7.3%improvements on both datasets over the benchmark alternatives.The code and models are publicly available at https://github.com/YaraDuan/EFECL. 展开更多
关键词 indoor scene object detection contrastive learning feature enhancement
原文传递
Infrared Small Target Detection Algorithm Based on ISTD-CenterNet
15
作者 Ning Li Shucai Huang Daozhi Wei 《Computers, Materials & Continua》 SCIE EI 2023年第12期3511-3531,共21页
This paper proposes a real-time detection method to improve the Infrared small target detection CenterNet(ISTD-CenterNet)network for detecting small infrared targets in complex environments.The method eliminates the n... This paper proposes a real-time detection method to improve the Infrared small target detection CenterNet(ISTD-CenterNet)network for detecting small infrared targets in complex environments.The method eliminates the need for an anchor frame,addressing the issues of low accuracy and slow speed.HRNet is used as the framework for feature extraction,and an ECBAM attention module is added to each stage branch for intelligent identification of the positions of small targets and significant objects.A scale enhancement module is also added to obtain a high-level semantic representation and fine-resolution prediction map for the entire infrared image.Besides,an improved sensory field enhancement module is designed to leverage semantic information in low-resolution feature maps,and a convolutional attention mechanism module is used to increase network stability and convergence speed.Comparison experiments conducted on the infrared small target data set ESIRST.The experiments show that compared to the benchmark network CenterNet-HRNet,the proposed ISTD-CenterNet improves the recall by 22.85%and the detection accuracy by 13.36%.Compared to the state-of-the-art YOLOv5small,the ISTD-CenterNet recall is improved by 5.88%,the detection precision is improved by 2.33%,and the detection frame rate is 48.94 frames/sec,which realizes the accurate real-time detection of small infrared targets. 展开更多
关键词 Infrared small target detection CenterNet data enhancement feature enhancement attention mechanism
下载PDF
Underwater Sea Cucumber Target Detection Based on Edge-Enhanced Scaling YOLOv4
16
作者 Ziting Zhang Hang Zhang +3 位作者 Yue Wang Tonghai Liu Yuxiang He Yunchen Tian 《Journal of Beijing Institute of Technology》 EI CAS 2023年第3期328-340,共13页
Sea cucumber detection is widely recognized as the key to automatic culture.The underwater light environment is complex and easily obscured by mud,sand,reefs,and other underwater organisms.To date,research on sea cucu... Sea cucumber detection is widely recognized as the key to automatic culture.The underwater light environment is complex and easily obscured by mud,sand,reefs,and other underwater organisms.To date,research on sea cucumber detection has mostly concentrated on the distinction between prospective objects and the background.However,the key to proper distinction is the effective extraction of sea cucumber feature information.In this study,the edge-enhanced scaling You Only Look Once-v4(YOLOv4)(ESYv4)was proposed for sea cucumber detection.By emphasizing the target features in a way that reduced the impact of different hues and brightness values underwater on the misjudgment of sea cucumbers,a bidirectional cascade network(BDCN)was used to extract the overall edge greyscale image in the image and add up the original RGB image as the detected input.Meanwhile,the YOLOv4 model for backbone detection is scaled,and the number of parameters is reduced to 48%of the original number of parameters.Validation results of 783images indicated that the detection precision of positive sea cucumber samples reached 0.941.This improvement reflects that the algorithm is more effective to improve the edge feature information of the target.It thus contributes to the automatic multi-objective detection of underwater sea cucumbers. 展开更多
关键词 sea cucumber edge extraction feature enhancement edge-enhanced scaling You Only Look Once-v4(YOLOv4)(ESYv4) model scaling
下载PDF
Video expression recognition based on frame-level attention mechanism
17
作者 陈瑞 TONG Ying +1 位作者 ZHANG Yiye XU Bo 《High Technology Letters》 EI CAS 2023年第2期130-139,共10页
Facial expression recognition(FER) in video has attracted the increasing interest and many approaches have been made.The crucial problem of classifying a given video sequence into several basic emotions is how to fuse... Facial expression recognition(FER) in video has attracted the increasing interest and many approaches have been made.The crucial problem of classifying a given video sequence into several basic emotions is how to fuse facial features of individual frames.In this paper, a frame-level attention module is integrated into an improved VGG-based frame work and a lightweight facial expression recognition method is proposed.The proposed network takes a sub video cut from an experimental video sequence as its input and generates a fixed-dimension representation.The VGG-based network with an enhanced branch embeds face images into feature vectors.The frame-level attention module learns weights which are used to adaptively aggregate the feature vectors to form a single discriminative video representation.Finally, a regression module outputs the classification results.The experimental results on CK+and AFEW databases show that the recognition rates of the proposed method can achieve the state-of-the-art performance. 展开更多
关键词 facial expression recognition(FER) video sequence attention mechanism feature extraction enhanced feature VGG network image classification neural network
下载PDF
多尺度特征图分类再提取的目标检测算法 被引量:9
18
作者 尹震宇 樊超 +2 位作者 赵志浩 黄哲 张飞青 《小型微型计算机系统》 CSCD 北大核心 2021年第3期536-541,共6页
为提高SSD算法检测目标的能力,提出了一种对多尺度特征图进行分类再提取的目标检测算法.该算法将SSD特征金字塔中多个不同尺度的特征图分为低层和高层两类特征图.针对低层特征图所处位置网络深度不够导致的特征表示能力不足,设计了SFE(S... 为提高SSD算法检测目标的能力,提出了一种对多尺度特征图进行分类再提取的目标检测算法.该算法将SSD特征金字塔中多个不同尺度的特征图分为低层和高层两类特征图.针对低层特征图所处位置网络深度不够导致的特征表示能力不足,设计了SFE(Shallow Feature Enhancement)模块提取特征从而增加网络深度,最终提高对小目标的检测能力;高层特征图采用两段反卷积的方式,将更深层提取的语义信息融入特征图,从而提高对中等目标和大目标的检测能力.同时提出了减弱反卷积棋盘格效应的回流式反卷积的方法.在Pascal VOC 2007测试集上进行验证后的结果表明,本文算法的mAP值达到了79.6%,相比SSD算法提高了2.4%. 展开更多
关键词 目标检测 SSD shallow feature enhancement(SFE) 反卷积
下载PDF
Multi-Sensor Image Fusion: A Survey of the State of the Art
19
作者 Bing Li Yong Xian +3 位作者 Daqiao Zhang Juan Su Xiaoxiang Hu Weilin Guo 《Journal of Computer and Communications》 2021年第6期73-108,共36页
Image fusion has been developing into an important area of research. In remote sensing, the use of the same image sensor in different working modes, or different image sensors, can provide reinforcing or complementary... Image fusion has been developing into an important area of research. In remote sensing, the use of the same image sensor in different working modes, or different image sensors, can provide reinforcing or complementary information. Therefore, it is highly valuable to fuse outputs from multiple sensors (or the same sensor in different working modes) to improve the overall performance of the remote images, which are very useful for human visual perception and image processing task. Accordingly, in this paper, we first provide a comprehensive survey of the state of the art of multi-sensor image fusion methods in terms of three aspects: pixel-level fusion, feature-level fusion and decision-level fusion. An overview of existing fusion strategies is then introduced, after which the existing fusion quality measures are summarized. Finally, this review analyzes the development trends in fusion algorithms that may attract researchers to further explore the research in this field. 展开更多
关键词 Multi-Sensor Image Fusion Fusion Strategy feature enhancement Fusion Performance Assessment
下载PDF
森林优化特征选择算法的增强与扩展 被引量:8
20
作者 刘兆赓 李占山 +2 位作者 王丽 王涛 于海鸿 《软件学报》 EI CSCD 北大核心 2020年第5期1511-1524,共14页
特征选择作为一种重要的数据预处理方法,不但能解决维数灾难问题,还能提高算法的泛化能力.各种各样的方法已被应用于解决特征选择问题,其中,基于演化计算的特征选择算法近年来获得了更多的关注并取得了一些成功.近期研究结果表明,森林... 特征选择作为一种重要的数据预处理方法,不但能解决维数灾难问题,还能提高算法的泛化能力.各种各样的方法已被应用于解决特征选择问题,其中,基于演化计算的特征选择算法近年来获得了更多的关注并取得了一些成功.近期研究结果表明,森林优化特征选择算法具有更好的分类性能及维度缩减能力.然而,初始化阶段的随机性、全局播种阶段的人为参数设定,影响了该算法的准确率和维度缩减能力;同时,算法本身存在着高维数据处理能力不足的本质缺陷.从信息增益率的角度给出了一种初始化策略,在全局播种阶段,借用模拟退火控温函数的思想自动生成参数,并结合维度缩减率给出了适应度函数;同时,针对形成的优质森林采取贪心算法,形成一种特征选择算法EFSFOA(enhanced feature selection using forest optimization algorithm).此外,在面对高维数据的处理时,采用集成特征选择的方案形成了一个适用于EFSFOA的集成特征选择框架,使其能够有效处理高维数据特征选择问题.通过设计对比实验,验证了EFSFOA与FSFOA相比在分类准确率和维度缩减率上均有明显的提高,高维数据处理能力更是提高到了100 000维.将EFSFOA与近年来提出的比较高效的基于演化计算的特征选择方法进行对比,EFSFOA仍具有很强的竞争力. 展开更多
关键词 enhanced feature selection using forest optimization algorithm(EFSFOA) 高维 特征选择 演化计算
下载PDF
上一页 1 下一页 到第
使用帮助 返回顶部