This paper analyzes the progress of handwritten Chinese character recognition technology,from two perspectives:traditional recognition methods and deep learning-based recognition methods.Firstly,the complexity of Chin...This paper analyzes the progress of handwritten Chinese character recognition technology,from two perspectives:traditional recognition methods and deep learning-based recognition methods.Firstly,the complexity of Chinese character recognition is pointed out,including its numerous categories,complex structure,and the problem of similar characters,especially the variability of handwritten Chinese characters.Subsequently,recognition methods based on feature optimization,model optimization,and fusion techniques are highlighted.The fusion studies between feature optimization and model improvement are further explored,and these studies further enhance the recognition effect through complementary advantages.Finally,the article summarizes the current challenges of Chinese character recognition technology,including accuracy improvement,model complexity,and real-time problems,and looks forward to future research directions.展开更多
This paper presents a cascaded Hidden Markov Model (HMM), which allows state's transition, skip and duration. The cascaded HMM extends the way of HMM pattern description of Handwritten Chinese Character (HCC) and...This paper presents a cascaded Hidden Markov Model (HMM), which allows state's transition, skip and duration. The cascaded HMM extends the way of HMM pattern description of Handwritten Chinese Character (HCC) and depicts the behavior of handwritten curve more reliably in terms of the statistic probability. Hence character segmentation and labeling are unnecessary. Viterbi algorithm is integrated in the cascaded HMM after the whole sample sequence of a HCC is input. More than 26,000 component samples are used tor training 407 handwritten component HMMs. At the improved training stage 94 models of 94 Chinese characters are gained by 32,000 samples, Compared with the Segment HMMs approach, the recognition rate of this model tier the tirst candidate is 87.89% and the error rate could be reduced by 12.4%.展开更多
The application of pattern recognition technology enables us to solve various human-computer interaction problems that were difficult to solve before.Handwritten Chinese character recognition,as a hot research object ...The application of pattern recognition technology enables us to solve various human-computer interaction problems that were difficult to solve before.Handwritten Chinese character recognition,as a hot research object in image pattern recognition,has many applications in people’s daily life,and more and more scholars are beginning to study off-line handwritten Chinese character recognition.This paper mainly studies the recognition of handwritten Chinese characters by BP(Back Propagation)neural network.Establish a handwritten Chinese character recognition model based on BP neural network,and then verify the accuracy and feasibility of the neural network through GUI(Graphical User Interface)model established by Matlab.This paper mainly includes the following aspects:Firstly,the preprocessing process of handwritten Chinese character recognition in this paper is analyzed.Among them,image preprocessing mainly includes six processes:graying,binarization,smoothing and denoising,character segmentation,histogram equalization and normalization.Secondly,through the comparative selection of feature extraction methods for handwritten Chinese characters,and through the comparative analysis of the results of three different feature extraction methods,the most suitable feature extraction method for this paper is found.Finally,it is the application of BP neural network in handwritten Chinese character recognition.The establishment,training process and parameter selection of BP neural network are described in detail.The simulation software platform chosen in this paper is Matlab,and the sample images are used to train BP neural network to verify the feasibility of Chinese character recognition.Design the GUI interface of human-computer interaction based on Matlab,show the process and results of handwritten Chinese character recognition,and analyze the experimental results.展开更多
Moment invariants firstly introduced by M. K Hu in 1962, has some shortcomings. After counting a large number of statistical distribution information of Chinese characters,the authors put forward the concept of inform...Moment invariants firstly introduced by M. K Hu in 1962, has some shortcomings. After counting a large number of statistical distribution information of Chinese characters,the authors put forward the concept of information moments and demonstrate its invariance to translation,rotation and scaling.Also they perform the experiment in which information moments compared with moment invaiants for the effects of similar Chinese characters and font recognition.At last they show the recognition rate of 88% by information moments,with 70% by moment inariants.展开更多
In this paper, a new parallel compact integration scheme based on multi-layer perceptron (MLP) networks is proposed to solve handwritten Chinese character recognition (HCCR) problems. The idea of metasynthesis is appl...In this paper, a new parallel compact integration scheme based on multi-layer perceptron (MLP) networks is proposed to solve handwritten Chinese character recognition (HCCR) problems. The idea of metasynthesis is applied to HCCR, and compact MLP network classifier is defined. Human intelligence and computer capabilities are combined together effectively through a procedure of two-step supervised learning. Compared with previous integration schemes, this scheme is characterized with parallel compact structure and better performance. It provides a promising way for applying MLP to large vocabulary classification.展开更多
This paper presents a new linguistic decoding method for online handwritten Chinese character recognition. The method employs a hybrid language model which combines N-gram and linguistic rules by rule quantification t...This paper presents a new linguistic decoding method for online handwritten Chinese character recognition. The method employs a hybrid language model which combines N-gram and linguistic rules by rule quantification technique. The linguistic decoding algorithm consists of three stages: word lattice construction, the optimal sentence hypothesis search and self-adaptive learning mechanism. The technique has been applied to palmtop computer's online handwritten Chinese character recognition. Samples containing millions of characters were used to test the linguistic decoder. In the open experiment, accuracy rate up to 92% is achieved, and the error rate is reduced by 68%.展开更多
Embodied semantics theory asserts that the meaning of action-related words is neurally represented through networks that overlap with or are identical to networks involved in sensory-motor processing. While some studi...Embodied semantics theory asserts that the meaning of action-related words is neurally represented through networks that overlap with or are identical to networks involved in sensory-motor processing. While some studies supporting this theory have focused on Chinese characters, less attention has been paid to their semantic radicals. Indeed, there is still disagreement about whether these radicals are processed independently. The present study investigated whether radicals are processed separately and, if so, whether this processing occurs in sensory-motor regions. Materials consisted of 72 high-frequency Chinese characters, with 18 in each of four categories: hand-action verbs with and without hand-radicals, and verbs not related to hand actions, with and without hand-radicals. Twenty-eight participants underwent functional MRI scans while reading the characters. Compared to characters without hand-radicals, reading characters with hand-radicals activated the right medial frontal gyrus. Verbs involving hand-action activated the left inferior parietal lobule, possibly reflecting integration of information in the radical with the semantic meaning of the verb. The findings may be consistent with embodied semantics theory and suggest that neural representation of radicals is indispensable in processing Chinese characters.展开更多
A method, called Two-Dimensional Extended Attribute Grammars (2-DEAGs). for the recognition of hand-printed Chinese characters is presented. This method uses directly two dimensional information, and pro- vides a sche...A method, called Two-Dimensional Extended Attribute Grammars (2-DEAGs). for the recognition of hand-printed Chinese characters is presented. This method uses directly two dimensional information, and pro- vides a scheme for dealing with various kinds of specific cases in a uniform way. In this method, components are drawn in guided and redundant way and reductions are made level by level just in accordance with the com- ponent combination relations of Chinese characters. The method provides also polysemous grammars, coexisting grammars and structure inferrings which constrain redundant recognition by comparison among similar characters or components and greatly increase the tolerance ability to distortion.展开更多
A good language model is essential to a postprocessing algorithm for recognition systems. In the past, researchers have presented various language models, such as character based language models, word based language m...A good language model is essential to a postprocessing algorithm for recognition systems. In the past, researchers have presented various language models, such as character based language models, word based language model, syntactical rules language model, hybrid models, etc . The word N gram model is by far an effective and efficient model, but one has to address the problem of data sparseness in establishing the model. Katz and Kneser et al. respectively presented effective remedies to solve this challenging problem. In this study, we proposed an improvement to their methods by incorporating Chinese language specific information or Chinese word class information into the system.展开更多
文摘This paper analyzes the progress of handwritten Chinese character recognition technology,from two perspectives:traditional recognition methods and deep learning-based recognition methods.Firstly,the complexity of Chinese character recognition is pointed out,including its numerous categories,complex structure,and the problem of similar characters,especially the variability of handwritten Chinese characters.Subsequently,recognition methods based on feature optimization,model optimization,and fusion techniques are highlighted.The fusion studies between feature optimization and model improvement are further explored,and these studies further enhance the recognition effect through complementary advantages.Finally,the article summarizes the current challenges of Chinese character recognition technology,including accuracy improvement,model complexity,and real-time problems,and looks forward to future research directions.
文摘This paper presents a cascaded Hidden Markov Model (HMM), which allows state's transition, skip and duration. The cascaded HMM extends the way of HMM pattern description of Handwritten Chinese Character (HCC) and depicts the behavior of handwritten curve more reliably in terms of the statistic probability. Hence character segmentation and labeling are unnecessary. Viterbi algorithm is integrated in the cascaded HMM after the whole sample sequence of a HCC is input. More than 26,000 component samples are used tor training 407 handwritten component HMMs. At the improved training stage 94 models of 94 Chinese characters are gained by 32,000 samples, Compared with the Segment HMMs approach, the recognition rate of this model tier the tirst candidate is 87.89% and the error rate could be reduced by 12.4%.
文摘The application of pattern recognition technology enables us to solve various human-computer interaction problems that were difficult to solve before.Handwritten Chinese character recognition,as a hot research object in image pattern recognition,has many applications in people’s daily life,and more and more scholars are beginning to study off-line handwritten Chinese character recognition.This paper mainly studies the recognition of handwritten Chinese characters by BP(Back Propagation)neural network.Establish a handwritten Chinese character recognition model based on BP neural network,and then verify the accuracy and feasibility of the neural network through GUI(Graphical User Interface)model established by Matlab.This paper mainly includes the following aspects:Firstly,the preprocessing process of handwritten Chinese character recognition in this paper is analyzed.Among them,image preprocessing mainly includes six processes:graying,binarization,smoothing and denoising,character segmentation,histogram equalization and normalization.Secondly,through the comparative selection of feature extraction methods for handwritten Chinese characters,and through the comparative analysis of the results of three different feature extraction methods,the most suitable feature extraction method for this paper is found.Finally,it is the application of BP neural network in handwritten Chinese character recognition.The establishment,training process and parameter selection of BP neural network are described in detail.The simulation software platform chosen in this paper is Matlab,and the sample images are used to train BP neural network to verify the feasibility of Chinese character recognition.Design the GUI interface of human-computer interaction based on Matlab,show the process and results of handwritten Chinese character recognition,and analyze the experimental results.
基金supported by the Specical Fund of Taishan Scholar of Shandong Province
文摘Moment invariants firstly introduced by M. K Hu in 1962, has some shortcomings. After counting a large number of statistical distribution information of Chinese characters,the authors put forward the concept of information moments and demonstrate its invariance to translation,rotation and scaling.Also they perform the experiment in which information moments compared with moment invaiants for the effects of similar Chinese characters and font recognition.At last they show the recognition rate of 88% by information moments,with 70% by moment inariants.
文摘In this paper, a new parallel compact integration scheme based on multi-layer perceptron (MLP) networks is proposed to solve handwritten Chinese character recognition (HCCR) problems. The idea of metasynthesis is applied to HCCR, and compact MLP network classifier is defined. Human intelligence and computer capabilities are combined together effectively through a procedure of two-step supervised learning. Compared with previous integration schemes, this scheme is characterized with parallel compact structure and better performance. It provides a promising way for applying MLP to large vocabulary classification.
文摘This paper presents a new linguistic decoding method for online handwritten Chinese character recognition. The method employs a hybrid language model which combines N-gram and linguistic rules by rule quantification technique. The linguistic decoding algorithm consists of three stages: word lattice construction, the optimal sentence hypothesis search and self-adaptive learning mechanism. The technique has been applied to palmtop computer's online handwritten Chinese character recognition. Samples containing millions of characters were used to test the linguistic decoder. In the open experiment, accuracy rate up to 92% is achieved, and the error rate is reduced by 68%.
基金supported by a grant from Ministry of Education,Taiwan,China under the Aiming for the Top University Plan at Taiwan Normal University,China
文摘Embodied semantics theory asserts that the meaning of action-related words is neurally represented through networks that overlap with or are identical to networks involved in sensory-motor processing. While some studies supporting this theory have focused on Chinese characters, less attention has been paid to their semantic radicals. Indeed, there is still disagreement about whether these radicals are processed independently. The present study investigated whether radicals are processed separately and, if so, whether this processing occurs in sensory-motor regions. Materials consisted of 72 high-frequency Chinese characters, with 18 in each of four categories: hand-action verbs with and without hand-radicals, and verbs not related to hand actions, with and without hand-radicals. Twenty-eight participants underwent functional MRI scans while reading the characters. Compared to characters without hand-radicals, reading characters with hand-radicals activated the right medial frontal gyrus. Verbs involving hand-action activated the left inferior parietal lobule, possibly reflecting integration of information in the radical with the semantic meaning of the verb. The findings may be consistent with embodied semantics theory and suggest that neural representation of radicals is indispensable in processing Chinese characters.
基金Supported by the National Natural Science Foundation of China,No.6883024.
文摘A method, called Two-Dimensional Extended Attribute Grammars (2-DEAGs). for the recognition of hand-printed Chinese characters is presented. This method uses directly two dimensional information, and pro- vides a scheme for dealing with various kinds of specific cases in a uniform way. In this method, components are drawn in guided and redundant way and reductions are made level by level just in accordance with the com- ponent combination relations of Chinese characters. The method provides also polysemous grammars, coexisting grammars and structure inferrings which constrain redundant recognition by comparison among similar characters or components and greatly increase the tolerance ability to distortion.
文摘A good language model is essential to a postprocessing algorithm for recognition systems. In the past, researchers have presented various language models, such as character based language models, word based language model, syntactical rules language model, hybrid models, etc . The word N gram model is by far an effective and efficient model, but one has to address the problem of data sparseness in establishing the model. Katz and Kneser et al. respectively presented effective remedies to solve this challenging problem. In this study, we proposed an improvement to their methods by incorporating Chinese language specific information or Chinese word class information into the system.