This paper focuses on a state sharing method for an artificial neural network (ANN) and hidden Markov model (HMM) hybrid on line handwriting recognition system. A modeling precision based distance measure is proposed ...This paper focuses on a state sharing method for an artificial neural network (ANN) and hidden Markov model (HMM) hybrid on line handwriting recognition system. A modeling precision based distance measure is proposed to describe similarity between two ANNs, which are used as HMM state models. Limiting maximum system performance loss, a minimum quantification error aimed hierarchical clustering algorithm is designed to choose the most representative models. The system performance is improved by about 1.5% while saving 40% of the system expense. About 92% of the performance may also be maintained while reducing 70% of system parameters. The suggested method is quite useful for designing pen based interface for various handheld devices.展开更多
In recent years,researchers in handwriting recognition analysis relating to indigenous languages have gained significant internet among research communities.The recent developments of artificial intelligence(AI),natur...In recent years,researchers in handwriting recognition analysis relating to indigenous languages have gained significant internet among research communities.The recent developments of artificial intelligence(AI),natural language processing(NLP),and computational linguistics(CL)find useful in the analysis of regional low resource languages.Automatic lexical task participation might be elaborated to various applications in the NLP.It is apparent from the availability of effective machine recognition models and open access handwritten databases.Arabic language is a commonly spoken Semitic language,and it is written with the cursive Arabic alphabet from right to left.Arabic handwritten Character Recognition(HCR)is a crucial process in optical character recognition.In this view,this paper presents effective Computational linguistics with Deep Learning based Handwriting Recognition and Speech Synthesizer(CLDL-THRSS)for Indigenous Language.The presented CLDL-THRSS model involves two stages of operations namely automated handwriting recognition and speech recognition.Firstly,the automated handwriting recognition procedure involves preprocessing,segmentation,feature extraction,and classification.Also,the Capsule Network(CapsNet)based feature extractor is employed for the recognition of handwritten Arabic characters.For optimal hyperparameter tuning,the cuckoo search(CS)optimization technique was included to tune the parameters of the CapsNet method.Besides,deep neural network with hidden Markov model(DNN-HMM)model is employed for the automatic speech synthesizer.To validate the effective performance of the proposed CLDL-THRSS model,a detailed experimental validation process takes place and investigates the outcomes interms of different measures.The experimental outcomes denoted that the CLDL-THRSS technique has demonstrated the compared methods.展开更多
Unconstrained offiine handwriting recognition is a challenging task in the areas of document analysis and pattern recognition. In recent years, to sufficiently exploit the supervisory information hidden in document im...Unconstrained offiine handwriting recognition is a challenging task in the areas of document analysis and pattern recognition. In recent years, to sufficiently exploit the supervisory information hidden in document images, much effort has been made to integrate multi-layer perceptrons (MLPs) in either a hybrid or a tandem fashion into hidden Markov models (HMMs). However, due to the weak learnability of MLPs, the learnt features are not necessarily optimal for subsequent recognition tasks. In this paper, we propose a deep architecture-based tandem approach for unconstrained offiine handwriting recognition. In the proposed model, deep belief networks arc adopted to learn the compact representations of sequential data, while HMMs are applied for (sub-)word recognition. We evaluate the proposed model on two publicly available datasets, i.e., RIMES and IFN/ENIT, which are based on Latin and Arabic languages respectively, and one dataset collected by ourselves called Devanagari (all Indian script). Extensive experiments show the advantage of the proposed model, especially over the MLP-HMMs taudem approaches.展开更多
Even though several advances have been made in recent years,handwritten script recognition is still a challenging task in the pattern recognition domain.This field has gained much interest lately due to its diverse ap...Even though several advances have been made in recent years,handwritten script recognition is still a challenging task in the pattern recognition domain.This field has gained much interest lately due to its diverse application potentials.Nowadays,different methods are available for automatic script recognition.Among most of the reported script recognition techniques,deep neural networks have achieved impressive results and outperformed the classical machine learning algorithms.However,the process of designing such networks right from scratch intuitively appears to incur a significant amount of trial and error,which renders them unfeasible.This approach often requires manual intervention with domain expertise which consumes substantial time and computational resources.To alleviate this shortcoming,this paper proposes a new neural architecture search approach based on meta-heuristic quantum particle swarm optimization(QPSO),which is capable of automatically evolving the meaningful convolutional neural network(CNN)topologies.The computational experiments have been conducted on eight different datasets belonging to three popular Indic scripts,namely Bangla,Devanagari,and Dogri,consisting of handwritten characters and digits.Empirically,the results imply that the proposed QPSO-CNN algorithm outperforms the classical and state-of-the-art methods with faster prediction and higher accuracy.展开更多
Handwriting recognition is one of the most significant problems in pattern recognition,many studies have been proposed to improve this recognition of handwritten text for different languages.Yet,Fewer studies have bee...Handwriting recognition is one of the most significant problems in pattern recognition,many studies have been proposed to improve this recognition of handwritten text for different languages.Yet,Fewer studies have been done for the Arabic language and the processing of its texts remains a particularly distinctive problem due to the variability of writing styles and the nature of Arabic scripts compared to other scripts.The present paper suggests a feature extraction technique for offlineArabic handwriting recognition.A handwriting recognition system for Arabic words using a few important structural features and based on a Radial Basis Function(RBF)neural networks is proposed.The methods of feature extraction are central to achieve high recognition performance.The proposed methodology relies on a feature extraction technique based on many structural characteristics extracted from the word skeleton(subwords,diacritics,loops,ascenders,and descenders).In order to reach our purpose,we built our own word database and the proposed system has been successfully tested on a handwriting database of Algerian city names(wilayas).Finally,a simple classifier based on the radial basis function neural network is presented to recognize certain words to verify the reliability of the proposed feature extraction.The experiments on some images of the benchmark IFN/ENIT database show that the proposed system improves recognition and the results obtained are indicative of the efficiency of our technique.展开更多
Individuals with special needs learn more slowly than their peers and they need repetitions to be permanent.However,in crowded classrooms,it is dif-ficult for a teacher to deal with each student individually.This probl...Individuals with special needs learn more slowly than their peers and they need repetitions to be permanent.However,in crowded classrooms,it is dif-ficult for a teacher to deal with each student individually.This problem can be overcome by using supportive education applications.However,the majority of such applications are not designed for special education and therefore they are not efficient as expected.Special education students differ from their peers in terms of their development,characteristics,and educational qualifications.The handwriting skills of individuals with special needs are lower than their peers.This makes the task of Handwriting Recognition(HWR)more difficult.To over-come this problem,we propose a new personalized handwriting verification sys-tem that validates digits from the handwriting of special education students.The system uses a Convolutional Neural Network(CNN)created and trained from scratch.The data set used is obtained by collecting the handwriting of the students with the help of a tablet.A special education center is visited and the handwrittenfigures of the students are collected under the supervision of special education tea-chers.The system is designed as a person-dependent system as every student has their writing style.Overall,the system achieves promising results,reaching a recognition accuracy of about 94%.Overall,the system can verify special educa-tion students’handwriting digits with high accuracy and is ready to integrate with a mobile application that is designed to teach digits to special education students.展开更多
Accurate handwriting recognition has been a challenging computer vision problem,because static feature analysis of the text pictures is often inade-quate to account for high variance in handwriting styles across peopl...Accurate handwriting recognition has been a challenging computer vision problem,because static feature analysis of the text pictures is often inade-quate to account for high variance in handwriting styles across people and poor image quality of the handwritten text.Recently,by introducing machine learning,especially convolutional neural networks(CNNs),the recognition accuracy of various handwriting patterns is steadily improved.In this paper,a deep CNN model is developed to further improve the recognition rate of the MNIST hand-written digit dataset with a fast-converging rate in training.The proposed model comes with a multi-layer deep arrange structure,including 3 convolution and acti-vation layers for feature extraction and 2 fully connected layers(i.e.,dense layers)for classification.The model’s hyperparameters,such as the batch sizes,kernel sizes,batch normalization,activation function,and learning rate are optimized to enhance the recognition performance.The average classification accuracy of the proposed methodology is found to reach 99.82%on the training dataset and 99.40%on the testing dataset,making it a nearly error-free system for MNIST recognition.展开更多
In the field of educational examination,automatic marking technology plays an essential role in improving the efficiency of marking and liberating the labor force.At present,the implementation of the policy of expandi...In the field of educational examination,automatic marking technology plays an essential role in improving the efficiency of marking and liberating the labor force.At present,the implementation of the policy of expanding erolments has caused a serious decline in the teacher-student ratio in colleges and universities.The traditional marking system based on Optical Mark Reader technology can no longer meet the requirements of liberating the labor force of teachers in small and medium-sized examinations.With the development of image processing and artificial neural network technology,the recognition of handwritten character in the field of pattern recognition has attracted the attention of many researchers.In this paper,filtering and de-noise processing and binary processing are used as preprocessing methods for handwriting recognition.Extract the pixel feature of handwritten characters through digital image processing of handwritten character pictures,and then,get the feature vector from these feature fragments and use it as the description of the character.The extracted feature values are used to train the neural network to realize the recognition of handwritten English letters and numerical characters.Experimental results on Chars74K and MNIST data sets show that the recognition accuracy of some handwritten English letters and numerical characters can reach 90%and 99%,respectively.展开更多
Given the number and variety of methods used for handwriting recognition, ithas been shown that there is no single method that can be called the ''best''. In recent years, thecombination of different c...Given the number and variety of methods used for handwriting recognition, ithas been shown that there is no single method that can be called the ''best''. In recent years, thecombination of different classifiers and the use of contextual information have become major areasof interest in improving recognition results. This paper addresses a case study on the combinationof multiple classifiers and the integration of syntactic level information for the recognition ofhandwritten Arabic literal amounts. To the best of our knowledge, this is the first time either ofthese methods has been applied to Arabic word recognition. Using three individual classifiers withhigh level global features, we performed word recognition experiments. A parallel combination methodwas tested for all possible configuration cases of the three chosen classifiers. A syntacticanalyzer makes a final decision on the candidate words generated by the best configuration scheme.The effectiveness of contextual knowledge integration in our application is Arabicconfirmed by theobtained results.展开更多
Based on the analysis of the unique shapes and writing styles of Uyghur characters,we design a framework for prototype character recognition system and carry out a systematic theoretical and experimental research on i...Based on the analysis of the unique shapes and writing styles of Uyghur characters,we design a framework for prototype character recognition system and carry out a systematic theoretical and experimental research on its modules.In the preprocessing procedure,we use the linear and nonlinear normalization based on dot density method.Both structural and statistical features are extracted due to the fact that there are some very similar characters in Uyghur literature.In clustering analysis,we adopt the dynamic clustering algorithm based on the minimum spanning tree(MST),and use the k-nearest neighbor matching classification as classifier.The testing results of prototype system show that the recognition rates for characters of the four different types(independent,suffix,intermediate,and initial type) are 74.67%,70.42%,63.33%,and 72.02%,respectively;the recognition rates for the case of five candidates for those characters are 94.34%,94.19%,93.15%,and 95.86%,respectively.The ideas and methods used in this paper have some commonality and usefulness for the recognition of other characters that belong to Altaic languages family.展开更多
文摘This paper focuses on a state sharing method for an artificial neural network (ANN) and hidden Markov model (HMM) hybrid on line handwriting recognition system. A modeling precision based distance measure is proposed to describe similarity between two ANNs, which are used as HMM state models. Limiting maximum system performance loss, a minimum quantification error aimed hierarchical clustering algorithm is designed to choose the most representative models. The system performance is improved by about 1.5% while saving 40% of the system expense. About 92% of the performance may also be maintained while reducing 70% of system parameters. The suggested method is quite useful for designing pen based interface for various handheld devices.
文摘In recent years,researchers in handwriting recognition analysis relating to indigenous languages have gained significant internet among research communities.The recent developments of artificial intelligence(AI),natural language processing(NLP),and computational linguistics(CL)find useful in the analysis of regional low resource languages.Automatic lexical task participation might be elaborated to various applications in the NLP.It is apparent from the availability of effective machine recognition models and open access handwritten databases.Arabic language is a commonly spoken Semitic language,and it is written with the cursive Arabic alphabet from right to left.Arabic handwritten Character Recognition(HCR)is a crucial process in optical character recognition.In this view,this paper presents effective Computational linguistics with Deep Learning based Handwriting Recognition and Speech Synthesizer(CLDL-THRSS)for Indigenous Language.The presented CLDL-THRSS model involves two stages of operations namely automated handwriting recognition and speech recognition.Firstly,the automated handwriting recognition procedure involves preprocessing,segmentation,feature extraction,and classification.Also,the Capsule Network(CapsNet)based feature extractor is employed for the recognition of handwritten Arabic characters.For optimal hyperparameter tuning,the cuckoo search(CS)optimization technique was included to tune the parameters of the CapsNet method.Besides,deep neural network with hidden Markov model(DNN-HMM)model is employed for the automatic speech synthesizer.To validate the effective performance of the proposed CLDL-THRSS model,a detailed experimental validation process takes place and investigates the outcomes interms of different measures.The experimental outcomes denoted that the CLDL-THRSS technique has demonstrated the compared methods.
基金the National Natural Science Foundation of China (No. 61403353)
文摘Unconstrained offiine handwriting recognition is a challenging task in the areas of document analysis and pattern recognition. In recent years, to sufficiently exploit the supervisory information hidden in document images, much effort has been made to integrate multi-layer perceptrons (MLPs) in either a hybrid or a tandem fashion into hidden Markov models (HMMs). However, due to the weak learnability of MLPs, the learnt features are not necessarily optimal for subsequent recognition tasks. In this paper, we propose a deep architecture-based tandem approach for unconstrained offiine handwriting recognition. In the proposed model, deep belief networks arc adopted to learn the compact representations of sequential data, while HMMs are applied for (sub-)word recognition. We evaluate the proposed model on two publicly available datasets, i.e., RIMES and IFN/ENIT, which are based on Latin and Arabic languages respectively, and one dataset collected by ourselves called Devanagari (all Indian script). Extensive experiments show the advantage of the proposed model, especially over the MLP-HMMs taudem approaches.
文摘Even though several advances have been made in recent years,handwritten script recognition is still a challenging task in the pattern recognition domain.This field has gained much interest lately due to its diverse application potentials.Nowadays,different methods are available for automatic script recognition.Among most of the reported script recognition techniques,deep neural networks have achieved impressive results and outperformed the classical machine learning algorithms.However,the process of designing such networks right from scratch intuitively appears to incur a significant amount of trial and error,which renders them unfeasible.This approach often requires manual intervention with domain expertise which consumes substantial time and computational resources.To alleviate this shortcoming,this paper proposes a new neural architecture search approach based on meta-heuristic quantum particle swarm optimization(QPSO),which is capable of automatically evolving the meaningful convolutional neural network(CNN)topologies.The computational experiments have been conducted on eight different datasets belonging to three popular Indic scripts,namely Bangla,Devanagari,and Dogri,consisting of handwritten characters and digits.Empirically,the results imply that the proposed QPSO-CNN algorithm outperforms the classical and state-of-the-art methods with faster prediction and higher accuracy.
文摘Handwriting recognition is one of the most significant problems in pattern recognition,many studies have been proposed to improve this recognition of handwritten text for different languages.Yet,Fewer studies have been done for the Arabic language and the processing of its texts remains a particularly distinctive problem due to the variability of writing styles and the nature of Arabic scripts compared to other scripts.The present paper suggests a feature extraction technique for offlineArabic handwriting recognition.A handwriting recognition system for Arabic words using a few important structural features and based on a Radial Basis Function(RBF)neural networks is proposed.The methods of feature extraction are central to achieve high recognition performance.The proposed methodology relies on a feature extraction technique based on many structural characteristics extracted from the word skeleton(subwords,diacritics,loops,ascenders,and descenders).In order to reach our purpose,we built our own word database and the proposed system has been successfully tested on a handwriting database of Algerian city names(wilayas).Finally,a simple classifier based on the radial basis function neural network is presented to recognize certain words to verify the reliability of the proposed feature extraction.The experiments on some images of the benchmark IFN/ENIT database show that the proposed system improves recognition and the results obtained are indicative of the efficiency of our technique.
文摘Individuals with special needs learn more slowly than their peers and they need repetitions to be permanent.However,in crowded classrooms,it is dif-ficult for a teacher to deal with each student individually.This problem can be overcome by using supportive education applications.However,the majority of such applications are not designed for special education and therefore they are not efficient as expected.Special education students differ from their peers in terms of their development,characteristics,and educational qualifications.The handwriting skills of individuals with special needs are lower than their peers.This makes the task of Handwriting Recognition(HWR)more difficult.To over-come this problem,we propose a new personalized handwriting verification sys-tem that validates digits from the handwriting of special education students.The system uses a Convolutional Neural Network(CNN)created and trained from scratch.The data set used is obtained by collecting the handwriting of the students with the help of a tablet.A special education center is visited and the handwrittenfigures of the students are collected under the supervision of special education tea-chers.The system is designed as a person-dependent system as every student has their writing style.Overall,the system achieves promising results,reaching a recognition accuracy of about 94%.Overall,the system can verify special educa-tion students’handwriting digits with high accuracy and is ready to integrate with a mobile application that is designed to teach digits to special education students.
文摘Accurate handwriting recognition has been a challenging computer vision problem,because static feature analysis of the text pictures is often inade-quate to account for high variance in handwriting styles across people and poor image quality of the handwritten text.Recently,by introducing machine learning,especially convolutional neural networks(CNNs),the recognition accuracy of various handwriting patterns is steadily improved.In this paper,a deep CNN model is developed to further improve the recognition rate of the MNIST hand-written digit dataset with a fast-converging rate in training.The proposed model comes with a multi-layer deep arrange structure,including 3 convolution and acti-vation layers for feature extraction and 2 fully connected layers(i.e.,dense layers)for classification.The model’s hyperparameters,such as the batch sizes,kernel sizes,batch normalization,activation function,and learning rate are optimized to enhance the recognition performance.The average classification accuracy of the proposed methodology is found to reach 99.82%on the training dataset and 99.40%on the testing dataset,making it a nearly error-free system for MNIST recognition.
基金This work was supported by the National Nature Science Foundation of China(Grant No.61702347).
文摘In the field of educational examination,automatic marking technology plays an essential role in improving the efficiency of marking and liberating the labor force.At present,the implementation of the policy of expanding erolments has caused a serious decline in the teacher-student ratio in colleges and universities.The traditional marking system based on Optical Mark Reader technology can no longer meet the requirements of liberating the labor force of teachers in small and medium-sized examinations.With the development of image processing and artificial neural network technology,the recognition of handwritten character in the field of pattern recognition has attracted the attention of many researchers.In this paper,filtering and de-noise processing and binary processing are used as preprocessing methods for handwriting recognition.Extract the pixel feature of handwritten characters through digital image processing of handwritten character pictures,and then,get the feature vector from these feature fragments and use it as the description of the character.The extracted feature values are used to train the neural network to realize the recognition of handwritten English letters and numerical characters.Experimental results on Chars74K and MNIST data sets show that the recognition accuracy of some handwritten English letters and numerical characters can reach 90%and 99%,respectively.
文摘Given the number and variety of methods used for handwriting recognition, ithas been shown that there is no single method that can be called the ''best''. In recent years, thecombination of different classifiers and the use of contextual information have become major areasof interest in improving recognition results. This paper addresses a case study on the combinationof multiple classifiers and the integration of syntactic level information for the recognition ofhandwritten Arabic literal amounts. To the best of our knowledge, this is the first time either ofthese methods has been applied to Arabic word recognition. Using three individual classifiers withhigh level global features, we performed word recognition experiments. A parallel combination methodwas tested for all possible configuration cases of the three chosen classifiers. A syntacticanalyzer makes a final decision on the candidate words generated by the best configuration scheme.The effectiveness of contextual knowledge integration in our application is Arabicconfirmed by theobtained results.
基金Supported by the National Natural Science Foundation of China (61065001)
文摘Based on the analysis of the unique shapes and writing styles of Uyghur characters,we design a framework for prototype character recognition system and carry out a systematic theoretical and experimental research on its modules.In the preprocessing procedure,we use the linear and nonlinear normalization based on dot density method.Both structural and statistical features are extracted due to the fact that there are some very similar characters in Uyghur literature.In clustering analysis,we adopt the dynamic clustering algorithm based on the minimum spanning tree(MST),and use the k-nearest neighbor matching classification as classifier.The testing results of prototype system show that the recognition rates for characters of the four different types(independent,suffix,intermediate,and initial type) are 74.67%,70.42%,63.33%,and 72.02%,respectively;the recognition rates for the case of five candidates for those characters are 94.34%,94.19%,93.15%,and 95.86%,respectively.The ideas and methods used in this paper have some commonality and usefulness for the recognition of other characters that belong to Altaic languages family.