期刊文献+
共找到62篇文章
< 1 2 4 >
每页显示 20 50 100
Fireworks Optimization with Deep Learning-Based Arabic Handwritten Characters Recognition Model
1
作者 Abdelwahed Motwakel Badriyya B.Al-onazi +5 位作者 Jaber S.Alzahrani Ayman Yafoz Mahmoud Othman Abu Sarwar Zamani Ishfaq Yaseen Amgad Atta Abdelmageed 《Computer Systems Science & Engineering》 2024年第5期1387-1403,共17页
Handwritten character recognition becomes one of the challenging research matters.More studies were presented for recognizing letters of various languages.The availability of Arabic handwritten characters databases wa... Handwritten character recognition becomes one of the challenging research matters.More studies were presented for recognizing letters of various languages.The availability of Arabic handwritten characters databases was confined.Almost a quarter of a billion people worldwide write and speak Arabic.More historical books and files indicate a vital data set for many Arab nationswritten in Arabic.Recently,Arabic handwritten character recognition(AHCR)has grabbed the attention and has become a difficult topic for pattern recognition and computer vision(CV).Therefore,this study develops fireworks optimizationwith the deep learning-based AHCR(FWODL-AHCR)technique.Themajor intention of the FWODL-AHCR technique is to recognize the distinct handwritten characters in the Arabic language.It initially pre-processes the handwritten images to improve their quality of them.Then,the RetinaNet-based deep convolutional neural network is applied as a feature extractor to produce feature vectors.Next,the deep echo state network(DESN)model is utilized to classify handwritten characters.Finally,the FWO algorithm is exploited as a hyperparameter tuning strategy to boost recognition performance.Various simulations in series were performed to exhibit the enhanced performance of the FWODL-AHCR technique.The comparison study portrayed the supremacy of the FWODL-AHCR technique over other approaches,with 99.91%and 98.94%on Hijja and AHCD datasets,respectively. 展开更多
关键词 Arabic language handwritten character recognition deep learning CLASSIFICATION parameter tuning
下载PDF
Offline Handwritten Characters Recognition Using Moments Features and Neural Networks
2
作者 Mohamed Abaynarh Lahbib Zenkouar 《Computer Technology and Application》 2015年第1期19-29,共11页
In this paper we revise the moment theory for pattern recognition designed, to extract patterns from the noisy character datas, and develop unconstrained handwritten. Amazigh character recognition method based upon or... In this paper we revise the moment theory for pattern recognition designed, to extract patterns from the noisy character datas, and develop unconstrained handwritten. Amazigh character recognition method based upon orthogonal moments and neural networks classifier. We argue that, given the natural flexibility of neural network models and the extent of parallel processing that they allow, our algorithm is a step forward in character recognition. More importantly, following the approach proposed, we apply our system to two different databases, to examine the ability to recognize patterns under noise. We discover overwhelming support for different style of writing. Moreover, this basic conclusion appears to remain valid across different levels of smoothing and insensitive to the nuances of character patterns. Experiments tested the effect of set size on recognition accuracy which can reach 97.46%. The novelty of the proposed method is independence of size, slant, orientation, and translation. The performance of the proposed method is experimentally evaluated and the promising results and findings are presented. Our method is compared to K-NN (k-nearest neighbors) classifier algorithm; results show performances of our method. 展开更多
关键词 Neural network character recognition orthogonal moments pattern recognition.
下载PDF
Instance Segmentation of Characters Recognized in Palmyrene Aramaic Inscriptions
3
作者 Adéla Hamplová Alexey Lyavdansky +3 位作者 TomášNovák Ondrej Svojše David Franc Arnošt Veselý 《Computer Modeling in Engineering & Sciences》 SCIE EI 2024年第9期2869-2889,共21页
This study presents a single-class and multi-class instance segmentation approach applied to ancient Palmyrene inscriptions,employing two state-of-the-art deep learning algorithms,namely YOLOv8 and Roboflow 3.0.The go... This study presents a single-class and multi-class instance segmentation approach applied to ancient Palmyrene inscriptions,employing two state-of-the-art deep learning algorithms,namely YOLOv8 and Roboflow 3.0.The goal is to contribute to the preservation and understanding of historical texts,showcasing the potential of modern deep learning methods in archaeological research.Our research culminates in several key findings and scientific contributions.We comprehensively compare the performance of YOLOv8 and Roboflow 3.0 in the context of Palmyrene character segmentation—this comparative analysis mainly focuses on the strengths and weaknesses of each algorithm in this context.We also created and annotated an extensive dataset of Palmyrene inscriptions,a crucial resource for further research in the field.The dataset serves for training and evaluating the segmentation models.We employ comparative evaluation metrics to quantitatively assess the segmentation results,ensuring the reliability and reproducibility of our findings and we present custom visualization tools for predicted segmentation masks.Our study advances the state of the art in semi-automatic reading of Palmyrene inscriptions and establishes a benchmark for future research.The availability of the Palmyrene dataset and the insights into algorithm performance contribute to the broader understanding of historical text analysis. 展开更多
关键词 Optical character recognition instance segmentation Palmyrene ancient languages computer vision
下载PDF
Optimised CNN Architectures for Handwritten Arabic Character Recognition
4
作者 Salah Alghyaline 《Computers, Materials & Continua》 SCIE EI 2024年第6期4905-4924,共20页
Handwritten character recognition is considered challenging compared with machine-printed characters due to the different human writing styles.Arabic is morphologically rich,and its characters have a high similarity.T... Handwritten character recognition is considered challenging compared with machine-printed characters due to the different human writing styles.Arabic is morphologically rich,and its characters have a high similarity.The Arabic language includes 28 characters.Each character has up to four shapes according to its location in the word(at the beginning,middle,end,and isolated).This paper proposed 12 CNN architectures for recognizing handwritten Arabic characters.The proposed architectures were derived from the popular CNN architectures,such as VGG,ResNet,and Inception,to make them applicable to recognizing character-size images.The experimental results on three well-known datasets showed that the proposed architectures significantly enhanced the recognition rate compared to the baseline models.The experiments showed that data augmentation improved the models’accuracies on all tested datasets.The proposed model outperformed most of the existing approaches.The best achieved results were 93.05%,98.30%,and 96.88%on the HIJJA,AHCD,and AIA9K datasets. 展开更多
关键词 Optical character recognition(OCR) handwritten arabic characters deep learning
下载PDF
Recognition of Characters by Adaptive Combination of Classifiers
5
作者 王飞 李在铭 《Journal of Electronic Science and Technology of China》 2004年第2期7-9,共3页
In this paper, the visual feature space based on the long Horizontals, the long Verticals, and the radicals are given. An adaptive combination of classifiers, whose coefficients vary with the input pattern, is also pr... In this paper, the visual feature space based on the long Horizontals, the long Verticals, and the radicals are given. An adaptive combination of classifiers, whose coefficients vary with the input pattern, is also proposed. Experiments show that the approach is promising for character recognition in video sequences. 展开更多
关键词 character recognition adaptive combination multiple classifiers
下载PDF
Neural Network-Powered License Plate Recognition System Design
6
作者 Sakib Hasan Md Nagib Mahfuz Sunny +1 位作者 Abdullah Al Nahian Mohammad Yasin 《Engineering(科研)》 2024年第9期284-300,共17页
The development of scientific inquiry and research has yielded numerous benefits in the realm of intelligent traffic control systems, particularly in the realm of automatic license plate recognition for vehicles. The ... The development of scientific inquiry and research has yielded numerous benefits in the realm of intelligent traffic control systems, particularly in the realm of automatic license plate recognition for vehicles. The design of license plate recognition algorithms has undergone digitalization through the utilization of neural networks. In contemporary times, there is a growing demand for vehicle surveillance due to the need for efficient vehicle processing and traffic management. The design, development, and implementation of a license plate recognition system hold significant social, economic, and academic importance. The study aims to present contemporary methodologies and empirical findings pertaining to automated license plate recognition. The primary focus of the automatic license plate recognition algorithm was on image extraction, character segmentation, and recognition. The task of character segmentation has been identified as the most challenging function based on my observations. The license plate recognition project that we designed demonstrated the effectiveness of this method across various observed conditions. Particularly in low-light environments, such as during periods of limited illumination or inclement weather characterized by precipitation. The method has been subjected to testing using a sample size of fifty images, resulting in a 100% accuracy rate. The findings of this study demonstrate the project’s ability to effectively determine the optimal outcomes of simulations. 展开更多
关键词 Intelligent Traffic Control Systems Automatic License Plate recognition (ALPR) Neural Networks Vehicle Surveillance Traffic Management License Plate recognition Algorithms Image Extraction Character Segmentation Character recognition Low-Light Environments Inclement Weather Empirical Findings Algorithm Accuracy Simulation Outcomes DIGITALIZATION
下载PDF
Chip Surface Character Recognition Based on OpenCV
7
作者 Lihang Yin 《Journal of Electronic Research and Application》 2024年第4期161-167,共7页
Chip surface character recognition is an important part of quality inspection in the field of microelectronics manufacturing.By recognizing the character information on the chip,automated production,quality control,an... Chip surface character recognition is an important part of quality inspection in the field of microelectronics manufacturing.By recognizing the character information on the chip,automated production,quality control,and data collection and analysis can be achieved.This article studies a chip surface character recognition method based on the OpenCV vision library.Firstly,the obtained chip images are preprocessed.Secondly,the template matching method is used to locate the chip position.In addition,the surface characters on the chip are individually segmented,and each character image is extracted separately.Finally,a Support Vector Machine(SVM)is used to classify and recognize characters.The results show that this method can accurately recognize the surface characters of chips and meet the requirements of chip quality inspection. 展开更多
关键词 Template matching Character recognition SVM OPENCV
下载PDF
A Review of Research on Handwritten Chinese Character Recognition with Multi-Feature Fusion
8
作者 Peng Deng Guiying Yang 《Journal of Electronic Research and Application》 2024年第5期109-117,共9页
This paper analyzes the progress of handwritten Chinese character recognition technology,from two perspectives:traditional recognition methods and deep learning-based recognition methods.Firstly,the complexity of Chin... This paper analyzes the progress of handwritten Chinese character recognition technology,from two perspectives:traditional recognition methods and deep learning-based recognition methods.Firstly,the complexity of Chinese character recognition is pointed out,including its numerous categories,complex structure,and the problem of similar characters,especially the variability of handwritten Chinese characters.Subsequently,recognition methods based on feature optimization,model optimization,and fusion techniques are highlighted.The fusion studies between feature optimization and model improvement are further explored,and these studies further enhance the recognition effect through complementary advantages.Finally,the article summarizes the current challenges of Chinese character recognition technology,including accuracy improvement,model complexity,and real-time problems,and looks forward to future research directions. 展开更多
关键词 Chinese character recognition Multi-feature fusion Machine learning
下载PDF
Traffic light detection and recognition in intersections based on intelligent vehicle
9
作者 张宁 何铁军 +1 位作者 高朝晖 黄卫 《Journal of Southeast University(English Edition)》 EI CAS 2008年第4期517-521,共5页
To ensure revulsive driving of intelligent vehicles at intersections, a method is presented to detect and recognize the traffic lights. First, the stabling siding at intersections is detected by applying Hough transfo... To ensure revulsive driving of intelligent vehicles at intersections, a method is presented to detect and recognize the traffic lights. First, the stabling siding at intersections is detected by applying Hough transformation. Then, the colors of traffic lights are detected with color space transformation. Finally, self-associative memory is used to recognize the countdown characters of the traffic lights. Test results at 20 real intersections show that the ratio of correct stabling siding recognition reaches up to 90%;and the ratios of recognition of traffic lights and divided characters are 85% and 97%, respectively. The research proves that the method is efficient for the detection of stabling siding and is robust enough to recognize the characters from images with noise and broken edges. 展开更多
关键词 intelligent vehicle stabling siding detection traffic lights detection self-associative memory light-emitting diode (LED) characters recognition
下载PDF
SVM Model Selection Using PSO for Learning Handwritten Arabic Characters 被引量:2
10
作者 Mamouni El Mamoun Zennaki Mahmoud Sadouni Kaddour 《Computers, Materials & Continua》 SCIE EI 2019年第9期995-1008,共14页
Using Support Vector Machine(SVM)requires the selection of several parameters such as multi-class strategy type(one-against-all or one-against-one),the regularization parameter C,kernel function and their parameters.T... Using Support Vector Machine(SVM)requires the selection of several parameters such as multi-class strategy type(one-against-all or one-against-one),the regularization parameter C,kernel function and their parameters.The choice of these parameters has a great influence on the performance of the final classifier.This paper considers the grid search method and the particle swarm optimization(PSO)technique that have allowed to quickly select and scan a large space of SVM parameters.A comparative study of the SVM models is also presented to examine the convergence speed and the results of each model.SVM is applied to handwritten Arabic characters learning,with a database containing 4840 Arabic characters in their different positions(isolated,beginning,middle and end).Some very promising results have been achieved. 展开更多
关键词 SVM PSO handwritten Arabic grid search character recognition
下载PDF
ERCS: An Efficient and Robust Card Recognition System for Camera-Based Image 被引量:1
11
作者 Zhonghong Ou Baiqiao Xiong +1 位作者 Fenrui Xiao Meina Song 《China Communications》 SCIE CSCD 2020年第12期247-264,共18页
Cards Recognition Systems,(CRSs)are representative computer vision-based applications.They have a broad range of usage scenarios.For example,they can be used to recognize images containing business cards,personal iden... Cards Recognition Systems,(CRSs)are representative computer vision-based applications.They have a broad range of usage scenarios.For example,they can be used to recognize images containing business cards,personal identification cards,and bank cards etc.Even though CRSs have been studied for many years,it is still difficult to recognize cards in camera-based images taken by ordinary devices,e.g.,mobile phones.Diversity of viewpoints and complex backgrounds in the images make the recognition task challenging.Existing systems employing traditional image processing schemes are not robust to varied environment,and are inefficient in dealing with natural images,e.g.,taken by mobile phones.To tackle the problem,we propose a novel framework for card recognition by employing a Convolutional Neutral Network(CNN)based approach.The system localizes the foreground of the image by utilizing a Fully Convolutional Network(FCN).With the help of the foreground map,the system localizes the corners of the card region and employs perspective transformation to alleviate the effects from distortion.Text lines in the card region are detected and recognized by utilizing CNN and Long Short Term Memory,(LSTM).To evaluate the proposed scheme,we collect a large dataset which contains 4,065 images in a variety of shooting scenarios.Experimental results demonstrate the efficacy of the proposed scheme.Specifically,it is able to achieve an accuracy of 90.62%in the end-toend test,outperforming the state-of-the-art. 展开更多
关键词 card localization card recognition optical character recognition CNN
下载PDF
Arabic Optical Character Recognition:A Review 被引量:1
12
作者 Salah Alghyaline 《Computer Modeling in Engineering & Sciences》 SCIE EI 2023年第6期1825-1861,共37页
This study aims to review the latest contributions in Arabic Optical Character Recognition(OCR)during the last decade,which helps interested researchers know the existing techniques and extend or adapt them accordingl... This study aims to review the latest contributions in Arabic Optical Character Recognition(OCR)during the last decade,which helps interested researchers know the existing techniques and extend or adapt them accordingly.The study describes the characteristics of the Arabic language,different types of OCR systems,different stages of the Arabic OCR system,the researcher’s contributions in each step,and the evaluationmetrics for OCR.The study reviews the existing datasets for the Arabic OCR and their characteristics.Additionally,this study implemented some preprocessing and segmentation stages of Arabic OCR.The study compares the performance of the existing methods in terms of recognition accuracy.In addition to researchers’OCRmethods,commercial and open-source systems are used in the comparison.The Arabic language is morphologically rich and written cursive with dots and diacritics above and under the characters.Most of the existing approaches in the literature were evaluated on isolated characters or isolated words under a controlled environment,and few approaches were tested on pagelevel scripts.Some comparative studies show that the accuracy of the existing Arabic OCR commercial systems is low,under 75%for printed text,and further improvement is needed.Moreover,most of the current approaches are offline OCR systems,and there is no remarkable contribution to online OCR systems. 展开更多
关键词 Arabic Optical Character recognition(OCR) Arabic OCR software Arabic OCR datasets Arabic OCR evaluation
下载PDF
A novel license plate recognition method using HTD and VTD features 被引量:2
13
作者 Zhang Xiangdong Shen Peiyi Li Liangchao Wang Wei Bai Jianhua Zhang Wenbo 《Engineering Sciences》 EI 2010年第1期71-76,共6页
In this paper, a novel method of licence plate recognition (LPR) using the vertical traverse density (VTD) and horizontal traverse density (HTD) is presented. The neutral network algorithm using VTD and HTD features i... In this paper, a novel method of licence plate recognition (LPR) using the vertical traverse density (VTD) and horizontal traverse density (HTD) is presented. The neutral network algorithm using VTD and HTD features is also an innovation. In addition, a so called secondary recognition method which splits characters into different parts is developed. Experimental results show that it is a simple and fast algorithm, which meets the request of real time and nicety performances of LPR and thus has applied value in intelligence transportation system (ITS). 展开更多
关键词 license plate recognition character segment character recognition VTD and HTD features
下载PDF
An on-line free handwritten Chinese character recognition method based on component cascaded HMMs 被引量:1
14
作者 Zhao Wei(赵巍) Liu Jiafeng Tang Xianglong 《High Technology Letters》 EI CAS 2005年第3期301-305,共5页
This paper presents a cascaded Hidden Markov Model (HMM), which allows state's transition, skip and duration. The cascaded HMM extends the way of HMM pattern description of Handwritten Chinese Character (HCC) and... This paper presents a cascaded Hidden Markov Model (HMM), which allows state's transition, skip and duration. The cascaded HMM extends the way of HMM pattern description of Handwritten Chinese Character (HCC) and depicts the behavior of handwritten curve more reliably in terms of the statistic probability. Hence character segmentation and labeling are unnecessary. Viterbi algorithm is integrated in the cascaded HMM after the whole sample sequence of a HCC is input. More than 26,000 component samples are used tor training 407 handwritten component HMMs. At the improved training stage 94 models of 94 Chinese characters are gained by 32,000 samples, Compared with the Segment HMMs approach, the recognition rate of this model tier the tirst candidate is 87.89% and the error rate could be reduced by 12.4%. 展开更多
关键词 chinese character recognition handwritten component HMM cascaded model
下载PDF
CHARACTER DETECTION AND RECOGNITION SYSTEM OF BEER BOTTLES 被引量:1
15
作者 Shen Bangxing Wu Wenjun +2 位作者 Zhang Yepeng Shen Gang Yang Liangen 《Chinese Journal of Mechanical Engineering》 SCIE EI CAS CSCD 2005年第3期467-469,共3页
An optical imaging system and a configuration characteristic algorithm are presented to reduce the difficulties in extracting intact characters image with weak contrast, in recognizing characters on fast moving beer b... An optical imaging system and a configuration characteristic algorithm are presented to reduce the difficulties in extracting intact characters image with weak contrast, in recognizing characters on fast moving beer bottles. The system consists of a hardware subsystem, including a rotating device, CCD, 16 mm focus lens, a frame grabber card, a penetrating lighting and a computer, and a software subsystem. The software subsystem performs pretreatment, character segmentation and character recognition. In the pretreatment, the original image is filtered with preset threshold to remove isolated spots. Then the horizontal projection and the vertical projection are used respectively to retrieve the character segmentation. Subsequently, the configuration characteristic algorithm is applied to recognize the characters. The experimental results demonstrate that this system can recognize the characters on beer bottles accurately and effectively; the algorithm is proven fast, stable and robust, making it suitable in the industrial environment. 展开更多
关键词 Optical imaging system Raised character recognition Configuration characteristic algorithm
下载PDF
A Vision-Based Fingertip-Writing Character Recognition System 被引量:1
16
作者 Ching-Long Shih Wen-Yo Lee Yu-Te Ku 《Journal of Computer and Communications》 2016年第4期160-168,共9页
This paper presents a vision-based fingertip-writing character recognition system. The overall system is implemented through a CMOS image camera on a FPGA chip. A blue cover is mounted on the top of a finger to simpli... This paper presents a vision-based fingertip-writing character recognition system. The overall system is implemented through a CMOS image camera on a FPGA chip. A blue cover is mounted on the top of a finger to simplify fingertip detection and to enhance recognition accuracy. For each character stroke, 8 sample points (including start and end points) are recorded. 7 tangent angles between consecutive sampled points are also recorded as features. In addition, 3 features angles are extracted: angles of the triangle consisting of the start point, end point and average point of all (8 total) sampled points. According to these key feature angles, a simple template matching K-nearest-neighbor classifier is applied to distinguish each character stroke. Experimental result showed that the system can successfully recognize fingertip-writing character strokes of digits and small lower case letter alphabets with an accuracy of almost 100%. Overall, the proposed finger-tip-writing recognition system provides an easy-to-use and accurate visual character input method. 展开更多
关键词 Visual Character recognition Fingertip Detection Template Matching K-Nearest-Neighbor Classifier FPGA
下载PDF
Libyan Licenses Plate Recognition Using Template Matching Method 被引量:1
17
作者 Alla A. El. Senoussi Abdella 《Journal of Computer and Communications》 2016年第7期62-71,共10页
License plate recognition (LPR) applies image processing and character recognition technology to identify vehicles by automatically reading their license plates. The work presented in this paper aims to create a compu... License plate recognition (LPR) applies image processing and character recognition technology to identify vehicles by automatically reading their license plates. The work presented in this paper aims to create a computer vision system capable of taking real-time input image from a static camera and identifying the license plate from extracted image. This problem is examined in two stages: First the license plate region detection and extraction from background and plate segmentation to sub-images, and second the character recognition stage. The method used for the license plate region detection is based on the assumption that the license plate area is a high concentration of smaller details, making it a region of high intensity of edges. The Sobel filter and their vertical and horizontal projections are used to identify the plate region. The result of testing this stage was an accuracy of 67.5%. The final stage of the LPR system is optical character recognition (OCR). The method adopted for this stage is based on template matching using correlation. Testing the performance of OCR resulted in an overall recognition rate of 87.76%. 展开更多
关键词 License Plate recognition Optical Character recognition Computer Vision System
下载PDF
Research on Handwritten Chinese Character Recognition Based on BP Neural Network 被引量:1
18
作者 Zihao Ning 《Modern Electronic Technology》 2022年第1期12-32,共21页
The application of pattern recognition technology enables us to solve various human-computer interaction problems that were difficult to solve before.Handwritten Chinese character recognition,as a hot research object ... The application of pattern recognition technology enables us to solve various human-computer interaction problems that were difficult to solve before.Handwritten Chinese character recognition,as a hot research object in image pattern recognition,has many applications in people’s daily life,and more and more scholars are beginning to study off-line handwritten Chinese character recognition.This paper mainly studies the recognition of handwritten Chinese characters by BP(Back Propagation)neural network.Establish a handwritten Chinese character recognition model based on BP neural network,and then verify the accuracy and feasibility of the neural network through GUI(Graphical User Interface)model established by Matlab.This paper mainly includes the following aspects:Firstly,the preprocessing process of handwritten Chinese character recognition in this paper is analyzed.Among them,image preprocessing mainly includes six processes:graying,binarization,smoothing and denoising,character segmentation,histogram equalization and normalization.Secondly,through the comparative selection of feature extraction methods for handwritten Chinese characters,and through the comparative analysis of the results of three different feature extraction methods,the most suitable feature extraction method for this paper is found.Finally,it is the application of BP neural network in handwritten Chinese character recognition.The establishment,training process and parameter selection of BP neural network are described in detail.The simulation software platform chosen in this paper is Matlab,and the sample images are used to train BP neural network to verify the feasibility of Chinese character recognition.Design the GUI interface of human-computer interaction based on Matlab,show the process and results of handwritten Chinese character recognition,and analyze the experimental results. 展开更多
关键词 Pattern recognition Handwritten Chinese character recognition BP neural network
下载PDF
Improved Approach Based on SVM for License Plate Character Recognition
19
作者 王晓华 王晓光 《Journal of Beijing Institute of Technology》 EI CAS 2005年第4期378-381,共4页
An improved approach based on support vector machine (SVM) called the center distance ratio method is presented for license plate character recognition. First the support vectors are pre-extraeted. A minimal set cal... An improved approach based on support vector machine (SVM) called the center distance ratio method is presented for license plate character recognition. First the support vectors are pre-extraeted. A minimal set called the margin vector set, which contains all support vectors, is extracted. These margin vectors compose new training data and construct the classifier by using the general SVM optimized. The experimental resuhs show that the improved SVM method does well at correct rate and training speed. 展开更多
关键词 support vector machine(SVM) center distance ratio method margin vector support vector character recognition
下载PDF
A Novel Siamese Network for Few/Zero-Shot Handwritten Character Recognition Tasks
20
作者 Nagwa Elaraby Sherif Barakat Amira Rezk 《Computers, Materials & Continua》 SCIE EI 2023年第1期1837-1854,共18页
Deep metric learning is one of the recommended methods for the challenge of supporting few/zero-shot learning by deep networks.It depends on building a Siamese architecture of two homogeneous Convolutional Neural Netw... Deep metric learning is one of the recommended methods for the challenge of supporting few/zero-shot learning by deep networks.It depends on building a Siamese architecture of two homogeneous Convolutional Neural Networks(CNNs)for learning a distance function that can map input data from the input space to the feature space.Instead of determining the class of each sample,the Siamese architecture deals with the existence of a few training samples by deciding if the samples share the same class identity or not.The traditional structure for the Siamese architecture was built by forming two CNNs from scratch with randomly initialized weights and trained by binary cross-entropy loss.Building two CNNs from scratch is a trial and error and time-consuming phase.In addition,training with binary crossentropy loss sometimes leads to poor margins.In this paper,a novel Siamese network is proposed and applied to few/zero-shot Handwritten Character Recognition(HCR)tasks.The novelties of the proposed network are in.1)Utilizing transfer learning and using the pre-trained AlexNet as a feature extractor in the Siamese architecture.Fine-tuning a pre-trained network is typically faster and easier than building from scratch.2)Training the Siamese architecture with contrastive loss instead of the binary cross-entropy.Contrastive loss helps the network to learn a nonlinear mapping function that enables it to map the extracted features in the vector space with an optimal way.The proposed network is evaluated on the challenging Chars74K datasets by conducting two experiments.One is for testing the proposed network in few-shot learning while the other is for testing it in zero-shot learning.The recognition accuracy of the proposed network reaches to 85.6%and 82%in few-and zero-shot learning respectively.In addition,a comparison between the performance of the proposed Siamese network and the traditional Siamese CNNs is conducted.The comparison results show that the proposed network achieves higher recognition results in less time.The proposed network reduces the training time from days to hours in both experiments. 展开更多
关键词 Handwritten character recognition(HCR) few-shot learning zero-shot learning deep metric learning transfer learning contrastive loss Chars74K datasets
下载PDF
上一页 1 2 4 下一页 到第
使用帮助 返回顶部