期刊文献+
共找到490篇文章
< 1 2 25 >
每页显示 20 50 100
Grid Side Distributed Energy Storage Cloud Group End Region Hierarchical Time-Sharing Configuration Algorithm Based onMulti-Scale and Multi Feature Convolution Neural Network
1
作者 Wen Long Bin Zhu +3 位作者 Huaizheng Li Yan Zhu Zhiqiang Chen Gang Cheng 《Energy Engineering》 EI 2023年第5期1253-1269,共17页
There is instability in the distributed energy storage cloud group end region on the power grid side.In order to avoid large-scale fluctuating charging and discharging in the power grid environment and make the capaci... There is instability in the distributed energy storage cloud group end region on the power grid side.In order to avoid large-scale fluctuating charging and discharging in the power grid environment and make the capacitor components showa continuous and stable charging and discharging state,a hierarchical time-sharing configuration algorithm of distributed energy storage cloud group end region on the power grid side based on multi-scale and multi feature convolution neural network is proposed.Firstly,a voltage stability analysis model based onmulti-scale and multi feature convolution neural network is constructed,and the multi-scale and multi feature convolution neural network is optimized based on Self-OrganizingMaps(SOM)algorithm to analyze the voltage stability of the cloud group end region of distributed energy storage on the grid side under the framework of credibility.According to the optimal scheduling objectives and network size,the distributed robust optimal configuration control model is solved under the framework of coordinated optimal scheduling at multiple time scales;Finally,the time series characteristics of regional power grid load and distributed generation are analyzed.According to the regional hierarchical time-sharing configuration model of“cloud”,“group”and“end”layer,the grid side distributed energy storage cloud group end regional hierarchical time-sharing configuration algorithm is realized.The experimental results show that after applying this algorithm,the best grid side distributed energy storage configuration scheme can be determined,and the stability of grid side distributed energy storage cloud group end region layered timesharing configuration can be improved. 展开更多
关键词 Multiscale and multi feature convolution neural network distributed energy storage at grid side cloud group end region layered time-sharing configuration algorithm
下载PDF
Ozone Depletion Identification in Stratosphere Through Faster Region-Based Convolutional Neural Network
2
作者 Bakhtawar Aslam Ziyad Awadh Alrowaili +3 位作者 Bushra Khaliq Jaweria Manzoor Saira Raqeeb Fahad Ahmad 《Computers, Materials & Continua》 SCIE EI 2021年第8期2159-2178,共20页
The concept of classification through deep learning is to build a model that skillfully separates closely-related images dataset into different classes because of diminutive but continuous variations that took place i... The concept of classification through deep learning is to build a model that skillfully separates closely-related images dataset into different classes because of diminutive but continuous variations that took place in physical systems over time and effect substantially.This study has made ozone depletion identification through classification using Faster Region-Based Convolutional Neural Network(F-RCNN).The main advantage of F-RCNN is to accumulate the bounding boxes on images to differentiate the depleted and non-depleted regions.Furthermore,image classification’s primary goal is to accurately predict each minutely varied case’s targeted classes in the dataset based on ozone saturation.The permanent changes in climate are of serious concern.The leading causes beyond these destructive variations are ozone layer depletion,greenhouse gas release,deforestation,pollution,water resources contamination,and UV radiation.This research focuses on the prediction by identifying the ozone layer depletion because it causes many health issues,e.g.,skin cancer,damage to marine life,crops damage,and impacts on living being’s immune systems.We have tried to classify the ozone images dataset into two major classes,depleted and non-depleted regions,to extract the required persuading features through F-RCNN.Furthermore,CNN has been used for feature extraction in the existing literature,and those extricated diverse RoIs are passed on to the CNN for grouping purposes.It is difficult to manage and differentiate those RoIs after grouping that negatively affects the gathered results.The classification outcomes through F-RCNN approach are proficient and demonstrate that general accuracy lies between 91%to 93%in identifying climate variation through ozone concentration classification,whether the region in the image under consideration is depleted or non-depleted.Our proposed model presented 93%accuracy,and it outperforms the prevailing techniques. 展开更多
关键词 Deep learning image processing CLASSIFICATION climate variation ozone layer depleted region non-depleted region UV radiation faster region-based convolutional neural network
下载PDF
Hybrid Convolutional Neural Network for Plant Diseases Prediction
3
作者 S.Poornima N.Sripriya +2 位作者 Adel Fahad Alrasheedi S.S.Askar Mohamed Abouhawwash 《Intelligent Automation & Soft Computing》 SCIE 2023年第5期2393-2409,共17页
Plant diseases prediction is the essential technique to prevent the yield loss and gain high production of agricultural products.The monitoring of plant health continuously and detecting the diseases is a significant f... Plant diseases prediction is the essential technique to prevent the yield loss and gain high production of agricultural products.The monitoring of plant health continuously and detecting the diseases is a significant for sustainable agri-culture.Manual system to monitor the diseases in plant is time consuming and report a lot of errors.There is high demand for technology to detect the plant dis-eases automatically.Recently image processing approach and deep learning approach are highly invited in detection of plant diseases.The diseases like late blight,bacterial spots,spots on Septoria leaf and yellow leaf curved are widely found in plants.These are the main reasons to affects the plants life and yield.To identify the diseases earliest,our research presents the hybrid method by com-bining the region based convolutional neural network(RCNN)and region based fully convolutional networks(RFCN)for classifying the diseases.First the leaf images of plants are collected and preprocessed to remove noisy data in image.Further data normalization,augmentation and removal of background noises are done.The images are divided as testing and training,training images are fed as input to deep learning architecture.First,we identify the region of interest(RoI)by using selective search.In every region,feature of convolutional neural network(CNN)is extracted independently for further classification.The plants such as tomato,potato and bell pepper are taken for this experiment.The plant input image is analyzed and classify as healthy plant or unhealthy plant.If the image is detected as unhealthy,then type of diseases the plant is affected will be displayed.Our proposed technique achieves 98.5%of accuracy in predicting the plant diseases. 展开更多
关键词 Disease detection people detection image classification deep learning region based convolutional neural network
下载PDF
Enhanced Detection of Glaucoma on Ensemble Convolutional Neural Network for Clinical Informatics 被引量:1
4
作者 D.Stalin David S.Arun Mozhi Selvi +4 位作者 S.Sivaprakash P.Vishnu Raja Dilip Kumar Sharma Pankaj Dadheech Sudhakar Sengan 《Computers, Materials & Continua》 SCIE EI 2022年第2期2563-2579,共17页
Irretrievable loss of vision is the predominant result of Glaucoma in the retina.Recently,multiple approaches have paid attention to the automatic detection of glaucoma on fundus images.Due to the interlace of blood v... Irretrievable loss of vision is the predominant result of Glaucoma in the retina.Recently,multiple approaches have paid attention to the automatic detection of glaucoma on fundus images.Due to the interlace of blood vessels and the herculean task involved in glaucoma detection,the exactly affected site of the optic disc of whether small or big size cup,is deemed challenging.Spatially Based Ellipse Fitting Curve Model(SBEFCM)classification is suggested based on the Ensemble for a reliable diagnosis of Glaucomain theOptic Cup(OC)and Optic Disc(OD)boundary correspondingly.This research deploys the Ensemble Convolutional Neural Network(CNN)classification for classifying Glaucoma or Diabetes Retinopathy(DR).The detection of the boundary between the OC and the OD is performed by the SBEFCM,which is the latest weighted ellipse fitting model.The SBEFCM that enhances and widens the multi-ellipse fitting technique is proposed here.There is a preprocessing of input fundus image besides segmentation of blood vessels to avoid interlacing surrounding tissues and blood vessels.The ascertaining of OCandODboundary,which characterizedmany output factors for glaucoma detection,has been developed by EnsembleCNNclassification,which includes detecting sensitivity,specificity,precision,andArea Under the receiver operating characteristic Curve(AUC)values accurately by an innovative SBEFCM.In terms of contrast,the proposed Ensemble CNNsignificantly outperformed the current methods. 展开更多
关键词 Glaucoma and diabetic retinopathy detection ensemble convolutional neural network spatially based ellipse fitting curve optic disk optic cup
下载PDF
Age Invariant Face Recognition Using Convolutional Neural Networks and Set Distances 被引量:4
5
作者 Hachim El Khiyari Harry Wechsler 《Journal of Information Security》 2017年第3期174-185,共12页
Biometric security systems based on facial characteristics face a challenging task due to variability in the intrapersonal facial appearance of subjects traced to factors such as pose, illumination, expression and agi... Biometric security systems based on facial characteristics face a challenging task due to variability in the intrapersonal facial appearance of subjects traced to factors such as pose, illumination, expression and aging. This paper innovates as it proposes a deep learning and set-based approach to face recognition subject to aging. The images for each subject taken at various times are treated as a single set, which is then compared to sets of images belonging to other subjects. Facial features are extracted using a convolutional neural network characteristic of deep learning. Our experimental results show that set-based recognition performs better than the singleton-based approach for both face identification and face verification. We also find that by using set-based recognition, it is easier to recognize older subjects from younger ones rather than younger subjects from older ones. 展开更多
关键词 Aging BIOMETRICS convolutional neural networks (CNN) Deep LEARNING Image Set-based Face Recognition (ISFR) Transfer LEARNING
下载PDF
Ensemble Based Learning with Accurate Motion Contrast Detection
6
作者 M.Indirani S.Shankar 《Intelligent Automation & Soft Computing》 SCIE 2023年第2期1657-1674,共18页
Recent developments in computer vision applications have enabled detection of significant visual objects in video streams.Studies quoted in literature have detected objects from video streams using Spatiotemporal Parti... Recent developments in computer vision applications have enabled detection of significant visual objects in video streams.Studies quoted in literature have detected objects from video streams using Spatiotemporal Particle Swarm Optimization(SPSOM)and Incremental Deep Convolution Neural Networks(IDCNN)for detecting multiple objects.However,the study considered opticalflows resulting in assessing motion contrasts.Existing methods have issue with accuracy and error rates in motion contrast detection.Hence,the overall object detection performance is reduced significantly.Thus,consideration of object motions in videos efficiently is a critical issue to be solved.To overcome the above mentioned problems,this research work proposes a method involving ensemble approaches to and detect objects efficiently from video streams.This work uses a system modeled on swarm optimization and ensemble learning called Spatiotemporal Glowworm Swarm Optimization Model(SGSOM)for detecting multiple significant objects.A steady quality in motion contrasts is maintained in this work by using Chebyshev distance matrix.The proposed system achieves global optimization in its multiple object detection by exploiting spatial/temporal cues and local constraints.Its experimental results show that the proposed system scores 4.8%in Mean Absolute Error(MAE)while achieving 86%in accuracy,81.5%in precision,85%in recall and 81.6%in F-measure and thus proving its utility in detecting multiple objects. 展开更多
关键词 Multiple significant objects ensemble based learning modified pooling layer based convolutional neural network spatiotemporal glowworm swarm optimization model
下载PDF
Facial Expression Recognition Using Enhanced Convolution Neural Network with Attention Mechanism 被引量:2
7
作者 K.Prabhu S.SathishKumar +2 位作者 M.Sivachitra S.Dineshkumar P.Sathiyabama 《Computer Systems Science & Engineering》 SCIE EI 2022年第4期415-426,共12页
Facial Expression Recognition(FER)has been an interesting area of research in places where there is human-computer interaction.Human psychol-ogy,emotions and behaviors can be analyzed in FER.Classifiers used in FER hav... Facial Expression Recognition(FER)has been an interesting area of research in places where there is human-computer interaction.Human psychol-ogy,emotions and behaviors can be analyzed in FER.Classifiers used in FER have been perfect on normal faces but have been found to be constrained in occluded faces.Recently,Deep Learning Techniques(DLT)have gained popular-ity in applications of real-world problems including recognition of human emo-tions.The human face reflects emotional states and human intentions.An expression is the most natural and powerful way of communicating non-verbally.Systems which form communications between the two are termed Human Machine Interaction(HMI)systems.FER can improve HMI systems as human expressions convey useful information to an observer.This paper proposes a FER scheme called EECNN(Enhanced Convolution Neural Network with Atten-tion mechanism)to recognize seven types of human emotions with satisfying results in its experiments.Proposed EECNN achieved 89.8%accuracy in classi-fying the images. 展开更多
关键词 Facial expression recognition linear discriminant analysis animal migration optimization regions of interest enhanced convolution neural network with attention mechanism
下载PDF
Desertification Detection in Makkah Region based on Aerial Images Classification
8
作者 Yahia Said Mohammad Barr +1 位作者 Taoufik Saidani Mohamed Atri 《Computer Systems Science & Engineering》 SCIE EI 2022年第2期607-618,共12页
Desertification has become a global threat and caused a crisis,especially in Middle Eastern countries,such as Saudi Arabia.Makkah is one of the most important cities in Saudi Arabia that needs to be protected from des... Desertification has become a global threat and caused a crisis,especially in Middle Eastern countries,such as Saudi Arabia.Makkah is one of the most important cities in Saudi Arabia that needs to be protected from desertification.The vegetation area in Makkah has been damaged because of desertification through wind,floods,overgrazing,and global climate change.The damage caused by desertification can be recovered provided urgent action is taken to prevent further degradation of the vegetation area.In this paper,we propose an automatic desertification detection system based on Deep Learning techniques.Aerial images are classified using Convolutional Neural Networks(CNN)to detect land state variation in real-time.CNNs have been widely used for computer vision applications,such as image classification,image segmentation,and quality enhancement.The proposed CNN model was trained and evaluated on the Arial Image Dataset(AID).Compared to state-of-the-art methods,the proposed model has better performance while being suitable for embedded implementation.It has achieved high efficiency with 96.47% accuracy.In light of the current research,we assert the appropriateness of the proposed CNN model in detecting desertification from aerial images. 展开更多
关键词 Desertification detection deep learning convolutional neural networks(CNN) aerial images classification Makkah region
下载PDF
Novel region-based image compression method based on spiking cortical model
9
作者 Rongchang Zhao Yide Ma 《Journal of Systems Engineering and Electronics》 SCIE EI CSCD 2015年第1期161-171,共11页
To get the high compression ratio as well as the high-quality reconstructed image, an effective image compression scheme named irregular segmentation region coding based on spiking cortical model(ISRCS) is presented... To get the high compression ratio as well as the high-quality reconstructed image, an effective image compression scheme named irregular segmentation region coding based on spiking cortical model(ISRCS) is presented. This scheme is region-based and mainly focuses on two issues. Firstly, an appropriate segmentation algorithm is developed to partition an image into some irregular regions and tidy contours, where the crucial regions corresponding to objects are retained and a lot of tiny parts are eliminated. The irregular regions and contours are coded using different methods respectively in the next step. The other issue is the coding method of contours where an efficient and novel chain code is employed. This scheme tries to find a compromise between the quality of reconstructed images and the compression ratio. Some principles and experiments are conducted and the results show its higher performance compared with other compression technologies, in terms of higher quality of reconstructed images, higher compression ratio and less time consuming. 展开更多
关键词 data compaction and compression image processing and computer vision region-based image coding neural network
下载PDF
Detection of ocean internal waves based on Faster R-CNN in SAR images 被引量:4
10
作者 BAO Sude MENG Junmin +1 位作者 SUN Lina LIU Yongxin 《Journal of Oceanology and Limnology》 SCIE CAS CSCD 2020年第1期55-63,共9页
Ocean internal waves appear as irregular bright and dark stripes on synthetic aperture radar(SAR)remote sensing images.Ocean internal waves detection in SAR images consequently constituted a difficult and popular rese... Ocean internal waves appear as irregular bright and dark stripes on synthetic aperture radar(SAR)remote sensing images.Ocean internal waves detection in SAR images consequently constituted a difficult and popular research topic.In this paper,ocean internal waves are detected in SAR images by employing the faster regions with convolutional neural network features(Faster R-CNN)framework;for this purpose,888 internal wave samples are utilized to train the convolutional network and identify internal waves.The experimental results demonstrate a 94.78%recognition rate for internal waves,and the average detection speed is 0.22 s/image.In addition,the detection results of internal wave samples under different conditions are analyzed.This paper lays a foundation for detecting ocean internal waves using convolutional neural networks. 展开更多
关键词 ocean internal waves FASTER regions with convolutional neural network features (Faster R-CNN) convolutional neural network synthetic APERTURE radar (SAR) image region proposal network (RPN)
下载PDF
An Efficient Deep Learning-based Content-based Image Retrieval Framework 被引量:1
11
作者 M.Sivakumar N.M.Saravana Kumar N.Karthikeyan 《Computer Systems Science & Engineering》 SCIE EI 2022年第11期683-700,共18页
The use of massive image databases has increased drastically over the few years due to evolution of multimedia technology.Image retrieval has become one of the vital tools in image processing applications.Content-Base... The use of massive image databases has increased drastically over the few years due to evolution of multimedia technology.Image retrieval has become one of the vital tools in image processing applications.Content-Based Image Retrieval(CBIR)has been widely used in varied applications.But,the results produced by the usage of a single image feature are not satisfactory.So,multiple image features are used very often for attaining better results.But,fast and effective searching for relevant images from a database becomes a challenging task.In the previous existing system,the CBIR has used the combined feature extraction technique using color auto-correlogram,Rotation-Invariant Uniform Local Binary Patterns(RULBP)and local energy.However,the existing system does not provide significant results in terms of recall and precision.Also,the computational complexity is higher for the existing CBIR systems.In order to handle the above mentioned issues,the Gray Level Co-occurrence Matrix(GLCM)with Deep Learning based Enhanced Convolution Neural Network(DLECNN)is proposed in this work.The proposed system framework includes noise reduction using histogram equalization,feature extraction using GLCM,similarity matching computation using Hierarchal and Fuzzy c-Means(HFCM)algorithm and the image retrieval using DLECNN algorithm.The histogram equalization has been used for computing the image enhancement.This enhanced image has a uniform histogram.Then,the GLCM method has been used to extract the features such as shape,texture,colour,annotations and keywords.The HFCM similarity measure is used for computing the query image vector's similarity index with every database images.For enhancing the performance of this image retrieval approach,the DLECNN algorithm is proposed to retrieve more accurate features of the image.The proposed GLCM+DLECNN algorithm provides better results associated with high accuracy,precision,recall,f-measure and lesser complexity.From the experimental results,it is clearly observed that the proposed system provides efficient image retrieval for the given query image. 展开更多
关键词 Content based image retrieval(CBIR) improved gray level cooccurrence matrix(GLCM) hierarchal and fuzzy C-means(HFCM)algorithm deep learning based enhanced convolution neural network(DLECNN)
下载PDF
Web3D Learning Framework for 3D Shape Retrieval Based on Hybrid Convolutional Neural Networks 被引量:1
12
作者 Wen Zhou Jinyuan Jia +1 位作者 Chengxi Huang Yongqing Cheng 《Tsinghua Science and Technology》 SCIE EI CAS CSCD 2020年第1期93-102,共10页
With the rapid development of Web3 D technologies, sketch-based model retrieval has become an increasingly important challenge, while the application of Virtual Reality and 3 D technologies has made shape retrieval of... With the rapid development of Web3 D technologies, sketch-based model retrieval has become an increasingly important challenge, while the application of Virtual Reality and 3 D technologies has made shape retrieval of furniture over a web browser feasible. In this paper, we propose a learning framework for shape retrieval based on two Siamese VGG-16 Convolutional Neural Networks(CNNs), and a CNN-based hybrid learning algorithm to select the best view for a shape. In this algorithm, the AlexNet and VGG-16 CNN architectures are used to perform classification tasks and to extract features, respectively. In addition, a feature fusion method is used to measure the similarity relation of the output features from the two Siamese networks. The proposed framework can provide new alternatives for furniture retrieval in the Web3 D environment. The primary innovation is in the employment of deep learning methods to solve the challenge of obtaining the best view of 3 D furniture,and to address cross-domain feature learning problems. We conduct an experiment to verify the feasibility of the framework and the results show our approach to be superior in comparison to many mainstream state-of-the-art approaches. 展开更多
关键词 WEB3D sketch-based model RETRIEVAL convolutional neural networks(CNNs) best VIEW cross-domain
原文传递
Deep Learning Framework for Classification of Emoji Based Sentiments
13
作者 Nighat Parveen Shaikh Mumtaz Hussain Mahar 《Computers, Materials & Continua》 SCIE EI 2022年第8期3145-3158,共14页
Recent patterns of human sentiments are highly influenced by emoji based sentiments(EBS).Social media users are widely using emoji based sentiments(EBS)in between text messages,tweets and posts.Although tiny pictures ... Recent patterns of human sentiments are highly influenced by emoji based sentiments(EBS).Social media users are widely using emoji based sentiments(EBS)in between text messages,tweets and posts.Although tiny pictures of emoji contains sufficient information to be considered for construction of classification model;but due to the wide range of dissimilar,heterogynous and complex patterns of emoji with similarmeanings(SM)have become one of the significant research areas of machine vision.This paper proposes an approach to provide meticulous assistance to social media application(SMA)users to classify the EBS sentiments.Proposed methodology consists upon three layerswhere first layer deals with data cleaning and feature selection techniques to detect dissimilar emoji patterns(DEP)with similar meanings(SM).In first sub step we input set of emoji,in second sub step every emoji has to qualify user defined threshold,in third sub step algorithm detects every emoji by considering as objects and in fourth step emoji images are cropped,after data cleaning these tiny images are saved as emoji images.In second step we build classification model by using convolutional neural networks(CNN)to explore hidden knowledge of emoji datasets.In third step we present results visualization by using confusion matrix and other estimations.This paper contributes(1)data cleaning method to detect EBS;(2)highest classification accuracy for emoji classification measured as 97.63%. 展开更多
关键词 Deep learning machine vision convolutional neural networks social media emoji based sentiments
下载PDF
Research on Pedestrian Detection Technology Based on MSR and Faster R-CNN
14
作者 Xueyun Zhao Chaoju Hu 《Journal of Computer and Communications》 2018年第7期54-63,共10页
In order to avoid the problem of poor illumination characteristics and inaccurate positioning accuracy, this paper proposed a pedestrian detection algorithm suitable for low-light environments. The algorithm first app... In order to avoid the problem of poor illumination characteristics and inaccurate positioning accuracy, this paper proposed a pedestrian detection algorithm suitable for low-light environments. The algorithm first applied the multi-scale Retinex image enhancement algorithm to the sample pre-processing of deep learning to improve the image resolution. Then the paper used the faster regional convolutional neural network to train the pedestrian detection model, extracted the pedestrian characteristics, and obtained the bounding boxes through classification and position regression. Finally, the pedestrian detection process was carried out by introducing the Soft-NMS algorithm, and the redundant bounding box was eliminated to obtain the best pedestrian detection position. The experimental results showed that the proposed detection algorithm achieves an average accuracy of 89.74% on the low-light dataset, and the pedestrian detection effect was more significant. 展开更多
关键词 Deep Learning PEDESTRIAN Detection region-based convolutional neural network Image Enhancement Non-Maximum SUPPRESSION
下载PDF
Unconstrained Gender Recognition from Periocular Region Using Multiscale Deep Features
15
作者 Raqinah Alrabiah Muhammad Hussain Hatim A.AboAlSamh 《Intelligent Automation & Soft Computing》 SCIE 2023年第3期2941-2962,共22页
The gender recognition problem has attracted the attention of the computer vision community due to its importance in many applications(e.g.,sur-veillance and human–computer interaction[HCI]).Images of varying levels ... The gender recognition problem has attracted the attention of the computer vision community due to its importance in many applications(e.g.,sur-veillance and human–computer interaction[HCI]).Images of varying levels of illumination,occlusion,and other factors are captured in uncontrolled environ-ments.Iris and facial recognition technology cannot be used on these images because iris texture is unclear in these instances,and faces may be covered by a scarf,hijab,or mask due to the COVID-19 pandemic.The periocular region is a reliable source of information because it features rich discriminative biometric features.However,most existing gender classification approaches have been designed based on hand-engineered features or validated in controlled environ-ments.Motivated by the superior performance of deep learning,we proposed a new method,PeriGender,inspired by the design principles of the ResNet and DenseNet models,that can classify gender using features from the periocular region.The proposed system utilizes a dense concept in a residual model.Through skip connections,it reuses features on different scales to strengthen dis-criminative features.Evaluations of the proposed system on challenging datasets indicated that it outperformed state-of-the-art methods.It achieved 87.37%,94.90%,94.14%,99.14%,and 95.17%accuracy on the GROUPS,UFPR-Periocular,Ethnic-Ocular,IMP,and UBIPr datasets,respectively,in the open-world(OW)protocol.It further achieved 97.57%and 93.20%accuracy for adult periocular images from the GROUPS dataset in the closed-world(CW)and OW protocols,respectively.The results showed that the middle region between the eyes plays a crucial role in the recognition of masculine features,and feminine features can be identified through the eyebrow,upper eyelids,and corners of the eyes.Furthermore,using a whole region without cropping enhances PeriGender’s learning capability,improving its understanding of both eyes’global structure without discontinuity. 展开更多
关键词 Gender recognition periocular region deep learning convolutional neural network unconstrained environment
下载PDF
多区域注意力的细粒度图像分类网络 被引量:3
16
作者 白尚旺 王梦瑶 +1 位作者 胡静 陈志泊 《计算机工程》 CSCD 北大核心 2024年第1期271-278,共8页
目前细粒度图像分类的难点在于如何精准定位图像中高度可辨的局部区域以及其他辅助判别特征。提出一种多区域注意力的细粒度图像分类网络来解决这个问题。首先使用Inception-V3对图像特征进行提取,通过重复使用注意力擦除的方法使模型... 目前细粒度图像分类的难点在于如何精准定位图像中高度可辨的局部区域以及其他辅助判别特征。提出一种多区域注意力的细粒度图像分类网络来解决这个问题。首先使用Inception-V3对图像特征进行提取,通过重复使用注意力擦除的方法使模型关注次要特征;然后通过背景去除以及上采样的方法获取图像更精准的局部图像,对提取到的局部特征进行位置统计,并以矩形框的方式获取图像整体,减少细节信息丢失;最后对局部与整体图像进行更加细致的学习。此外,设计联合损失函数,通过动态平衡难易样本和缩小类内差距的方法改善模型的识别效果。实验结果表明,该方法在公开的细粒度图像数据集CUB-200-2011、Stanford-Cars和FGVC-Aircraft上的准确率分别达到89.2%、94.8%、94.0%,相较于对比方法性能更优。 展开更多
关键词 多区域注意力 细粒度图像分类 擦除策略 联合损失 深度学习 卷积神经网络
下载PDF
区域时间变化学习的行为识别
17
作者 杨兴明 徐浩 +3 位作者 汪智文 高旭杰 吴克伟 谢昭 《计算机应用研究》 CSCD 北大核心 2024年第11期3495-3501,共7页
针对现有的行为识别方法缺少对视频帧中区域级特征的学习,造成识别过程中对相似的行为类别混淆的问题,提出一种区域级时间变化网络。该网络包括局部-全局时间特征学习模块、区域语义学习模块、区域语义融合模块。局部-全局时间特征学习... 针对现有的行为识别方法缺少对视频帧中区域级特征的学习,造成识别过程中对相似的行为类别混淆的问题,提出一种区域级时间变化网络。该网络包括局部-全局时间特征学习模块、区域语义学习模块、区域语义融合模块。局部-全局时间特征学习模块学习局部时间注意力,以增强局部视频帧的运动特征,并将其聚合为全局时间区域特征。区域语义学习模块通过计算区域中像素之间的相似度来构建可变化的区域语义卷积核,从而学习随时间变化的行为语义特征。区域语义融合模块将可变化区域特征和全局时间区域特征作为两个独立分支,分别学习每个分支特征的通道注意力用于特征融合。在Something-Something V1&V2与Kinetics-400数据集上的实验结果显示,区域级时间变化网络表现优于多数行为识别方法,证明了该网络能够有效提升行为识别的性能。 展开更多
关键词 行为识别 区域级特征 卷积神经网络 深度学习
下载PDF
基于改进的Cascade RCNN铸管字符检测算法
18
作者 王宇 徐福丽 +5 位作者 王怀震 崔勇 姜岩 陶晔 王译笙 张琦 《计算机集成制造系统》 EI CSCD 北大核心 2024年第11期3954-3966,共13页
由于工业现场采集的铸管字符图像存在背景模糊、字符区域占比小、刻字位置不固定、油漆遮挡等问题,导致现有模型的检测精度难以满足工业现场的需求。针对上述问题,提出改进的Cascade RCNN铸管字符检测算法。首先对特征金字塔进行改进,... 由于工业现场采集的铸管字符图像存在背景模糊、字符区域占比小、刻字位置不固定、油漆遮挡等问题,导致现有模型的检测精度难以满足工业现场的需求。针对上述问题,提出改进的Cascade RCNN铸管字符检测算法。首先对特征金字塔进行改进,提出融合小目标增强的特征金字塔(STE-FPN),利用多尺度特征融合的特征增强能力丰富铸管小目标字符的特征信息。其次引入自矫正/池化的ResNeSt(SCP-ResNeSt)作为特征提取网络,利用自矫正卷积和池化操作以提升背景复杂的铸管字符特征提取效率。最后对级联结构进行改进,引进Mask分支结构,可以自适应地检测字符区域并去除干扰区域,优化了检测结果。将改进后的算法在铸管数据集上进行测试,其平均检测精度mAP为99.1%,比原Cascade RCNN算法提高了2.3%,得到的精度表明改进后的性能优于原算法。 展开更多
关键词 铸管字符检测 背景模糊 Cascade RCNN ResNeSt
下载PDF
基于CNN-OBIA的黄河源区水体提取及时空变化
19
作者 陈伟 张秀霞 +3 位作者 党星海 樊新成 李旺平 徐俊伟 《人民长江》 北大核心 2024年第4期133-141,共9页
准确识别水体信息是分析地表水时空动态变化的重要技术手段。针对目前各种长时序水体信息提取方法精度低的问题,基于Landsat遥感影像,选用1986~2022年5484景黄河源区遥感影像,分别运用卷积神经网络结合面向对象(CNN-OBIA)和多指数水体... 准确识别水体信息是分析地表水时空动态变化的重要技术手段。针对目前各种长时序水体信息提取方法精度低的问题,基于Landsat遥感影像,选用1986~2022年5484景黄河源区遥感影像,分别运用卷积神经网络结合面向对象(CNN-OBIA)和多指数水体检测规则(MIWDR)两种方法提取了黄河源区的地表水体,并对两种方法的提取精度进行了对比分析。在此基础上,探究了1986~2022年黄河源区水体信息的时空变化特征,并对其主要气候因素进行相关分析。结果表明:①CNN-OBIA的总体精度和Kappa系数分别为96.78%和0.93,MIWDR的总体精度和Kappa系数分别为94.28%和0.88,总体而言,CNN-OBIA的提取精度高于MIWDR方法。CNN-OBIA的提取结果可以很好地保持水体边界完整性和有效去除山体阴影,可以较好地对细小河流进行提取。②研究区水体总面积呈现出先减少(1986~2001年)后增加(2001~2022年)的变化趋势。③相关性分析表明,降水和气温与水体面积的变化均表现出显著正相关。 展开更多
关键词 水体面积提取 卷积神经网络 面向对象 驱动力分析 黄河源区
下载PDF
基于深层图卷积的EEG情绪识别方法研究
20
作者 李奇 常立娜 +1 位作者 武岩 闫旭荣 《电子测量技术》 北大核心 2024年第4期18-22,共5页
针对浅层图卷积提取的局部脑区空间关联信息对情感脑电表征不足的问题,本文提出了一种深层图卷积网络模型。该模型利用深层图卷积学习情绪脑电全局通道间的内在关系,在卷积传播过程中应用残差连接和权重自映射解决深层图卷积网络面临的... 针对浅层图卷积提取的局部脑区空间关联信息对情感脑电表征不足的问题,本文提出了一种深层图卷积网络模型。该模型利用深层图卷积学习情绪脑电全局通道间的内在关系,在卷积传播过程中应用残差连接和权重自映射解决深层图卷积网络面临的节点特征收敛到固定空间无法学习到有效特征的问题,并在卷积层后加入PN正则化扩大不同情绪特征间的距离,提高情绪识别的性能。在SEED数据集上进行实验,与浅层图卷积网络相比准确率提高了0.7%,标准差下降了3.15。结果表明该模型提取的全局脑区空间关联信息对情绪识别的有效性。 展开更多
关键词 脑电信号 情绪识别 深度图卷积神经网络 全局脑区
下载PDF
上一页 1 2 25 下一页 到第
使用帮助 返回顶部