This paper analyzes the progress of handwritten Chinese character recognition technology,from two perspectives:traditional recognition methods and deep learning-based recognition methods.Firstly,the complexity of Chin...This paper analyzes the progress of handwritten Chinese character recognition technology,from two perspectives:traditional recognition methods and deep learning-based recognition methods.Firstly,the complexity of Chinese character recognition is pointed out,including its numerous categories,complex structure,and the problem of similar characters,especially the variability of handwritten Chinese characters.Subsequently,recognition methods based on feature optimization,model optimization,and fusion techniques are highlighted.The fusion studies between feature optimization and model improvement are further explored,and these studies further enhance the recognition effect through complementary advantages.Finally,the article summarizes the current challenges of Chinese character recognition technology,including accuracy improvement,model complexity,and real-time problems,and looks forward to future research directions.展开更多
The classification for handwritten Chinese character recognition can be viewed as a transformation in discrete vector space. In this paper, from the point of discrete vector space transformation, a new 4-corner codes ...The classification for handwritten Chinese character recognition can be viewed as a transformation in discrete vector space. In this paper, from the point of discrete vector space transformation, a new 4-corner codes classifier based on decision tree inductive learning algorithm ID3 for handwritten Chinese characters is presented. With a feature extraction controller, the classifier can reduce the number of extracted features and accelerate classification speed. Experimental results show that the 4-corner codes classifier performs well on both recognition accuracy and speed.展开更多
Music is an extraordinary bridge between people all over the world so much as to be called a universal language. Idols and B-boys stages are fun, touching, and fantastic. Today, South Korean students are excited and e...Music is an extraordinary bridge between people all over the world so much as to be called a universal language. Idols and B-boys stages are fun, touching, and fantastic. Today, South Korean students are excited and enthusiastic about their colorful dance moves. The study is about creative educational methods that use K-pop music videos to learn the proverbs and old words that our ancestors learned to keep in mind and teach. K-pop lyrics are a rich reflection of the experiences of life and the world in which people are living today. Accordingly, this study can present new teaching and learning method examples that are used in class related to the old language associated with K-pop lyrics and can also introduce interesting class materials.展开更多
A local and global context representation learning model for Chinese characters is designed and a Chinese word segmentation method based on character representations is proposed in this paper. First, the proposed Chin...A local and global context representation learning model for Chinese characters is designed and a Chinese word segmentation method based on character representations is proposed in this paper. First, the proposed Chinese character learning model uses the semanties of loeal context and global context to learn the representation of Chinese characters. Then, Chinese word segmentation model is built by a neural network, while the segmentation model is trained with the eharaeter representations as its input features. Finally, experimental results show that Chinese charaeter representations can effectively learn the semantic information. Characters with similar semantics cluster together in the visualize space. Moreover, the proposed Chinese word segmentation model also achieves a pretty good improvement on precision, recall and f-measure.展开更多
An N-gram Chinese language model incorporating linguistic rules is presented. By constructing elements lattice, rules information is incorporated in statistical frame. To facilitate the hybrid modeling, novel methods ...An N-gram Chinese language model incorporating linguistic rules is presented. By constructing elements lattice, rules information is incorporated in statistical frame. To facilitate the hybrid modeling, novel methods such as MI-based rule evaluating, weighted rule quantification and element-based n-gram probability approximation are presented. Dynamic Viterbi algorithm is adopted to search the best path in lattice. To strengthen the model, transformation-based error-driven rules learning is adopted. Applying proposed model to Chinese Pinyin-to-character conversion, high performance has been achieved in accuracy, flexibility and robustness simultaneously. Tests show correct rate achieves 94.81% instead of 90.53% using bi-gram Markov model alone. Many long-distance dependency and recursion in language can be processed effectively.展开更多
There is a positive transfer from native language vocabulary learning strategy to that of the second language. The comparison between them shows that the traditional Chinese character learning strategies have profound...There is a positive transfer from native language vocabulary learning strategy to that of the second language. The comparison between them shows that the traditional Chinese character learning strategies have profound effect on English vocabulary learning on the basis of morphology, lexicon as well as discourse categories. If the mutual effect can be applied in English vocabulary learning effectively, positive transfer emerges.展开更多
Embodied semantics theory asserts that the meaning of action-related words is neurally represented through networks that overlap with or are identical to networks involved in sensory-motor processing. While some studi...Embodied semantics theory asserts that the meaning of action-related words is neurally represented through networks that overlap with or are identical to networks involved in sensory-motor processing. While some studies supporting this theory have focused on Chinese characters, less attention has been paid to their semantic radicals. Indeed, there is still disagreement about whether these radicals are processed independently. The present study investigated whether radicals are processed separately and, if so, whether this processing occurs in sensory-motor regions. Materials consisted of 72 high-frequency Chinese characters, with 18 in each of four categories: hand-action verbs with and without hand-radicals, and verbs not related to hand actions, with and without hand-radicals. Twenty-eight participants underwent functional MRI scans while reading the characters. Compared to characters without hand-radicals, reading characters with hand-radicals activated the right medial frontal gyrus. Verbs involving hand-action activated the left inferior parietal lobule, possibly reflecting integration of information in the radical with the semantic meaning of the verb. The findings may be consistent with embodied semantics theory and suggest that neural representation of radicals is indispensable in processing Chinese characters.展开更多
The past decade has seen the rapid development of text detection based on deep learning.However,current methods of Chinese character detection and recognition have proven to be poor.The accuracy of segmenting text box...The past decade has seen the rapid development of text detection based on deep learning.However,current methods of Chinese character detection and recognition have proven to be poor.The accuracy of segmenting text boxes in natural scenes is not impressive.The reasons for this strait can be summarized into two points:the complexity of natural scenes and numerous types of Chinese characters.In response to these problems,we proposed a lightweight neural network architecture named CTSF.It consists of two modules,one is a text detection network that combines CTPN and the image feature extraction modules of PVANet,named CDSE.The other is a literacy network based on spatial pyramid pool and fusion of Chinese character skeleton features named SPPCNN-SF,so as to realize the text detection and recognition,respectively.Our model performs much better than the original model on ICDAR2011 and ICDAR2013(achieved 85%and 88%F-measures)and enhanced the processing speed in training phase.In addition,our method achieves extremely performance on three Chinese datasets,with accuracy of 95.12%,95.56%and 96.01%.展开更多
目的为了提升生成对抗网络汉字风格迁移的图像生成质量,实现汉字智能生成在字库产业中的实际应用,提出了一种基于直观汉字构形学的条件生成对抗网络字体生成优化方法(Optimizationof Conditional Fonts Generation with Chinese Charact...目的为了提升生成对抗网络汉字风格迁移的图像生成质量,实现汉字智能生成在字库产业中的实际应用,提出了一种基于直观汉字构形学的条件生成对抗网络字体生成优化方法(Optimizationof Conditional Fonts Generation with Chinese Character Configuration GANs,C^(3)-GAN)。方法建构了直观汉字构形模组(C^(3)Module),该模组包含了利于条件生成对抗网络进行汉字构形语义特征学习的全特征汉字字符集。C^(3)-GAN在条件生成对抗网络模型下进行字体生成训练,降低了必要训练样本数量,实现对字体生成效果的优化。结果使用C^(3)-GAN生成汉字图像的清晰度更高、字形更准确。在图像相似性定量评估中,使用C^(3)-GAN的实验组相比于其他模型,获得了更高的相似值和更小的误差值。结论使用C^(3)-GAN可以降低必要训练样本数量、提升汉字图像质量。在实际项目中具有一定的应用性和可操作性。展开更多
文摘This paper analyzes the progress of handwritten Chinese character recognition technology,from two perspectives:traditional recognition methods and deep learning-based recognition methods.Firstly,the complexity of Chinese character recognition is pointed out,including its numerous categories,complex structure,and the problem of similar characters,especially the variability of handwritten Chinese characters.Subsequently,recognition methods based on feature optimization,model optimization,and fusion techniques are highlighted.The fusion studies between feature optimization and model improvement are further explored,and these studies further enhance the recognition effect through complementary advantages.Finally,the article summarizes the current challenges of Chinese character recognition technology,including accuracy improvement,model complexity,and real-time problems,and looks forward to future research directions.
文摘The classification for handwritten Chinese character recognition can be viewed as a transformation in discrete vector space. In this paper, from the point of discrete vector space transformation, a new 4-corner codes classifier based on decision tree inductive learning algorithm ID3 for handwritten Chinese characters is presented. With a feature extraction controller, the classifier can reduce the number of extracted features and accelerate classification speed. Experimental results show that the 4-corner codes classifier performs well on both recognition accuracy and speed.
文摘Music is an extraordinary bridge between people all over the world so much as to be called a universal language. Idols and B-boys stages are fun, touching, and fantastic. Today, South Korean students are excited and enthusiastic about their colorful dance moves. The study is about creative educational methods that use K-pop music videos to learn the proverbs and old words that our ancestors learned to keep in mind and teach. K-pop lyrics are a rich reflection of the experiences of life and the world in which people are living today. Accordingly, this study can present new teaching and learning method examples that are used in class related to the old language associated with K-pop lyrics and can also introduce interesting class materials.
基金Supported by the National Natural Science Foundation of China(No.61303179,U1135005,61175020)
文摘A local and global context representation learning model for Chinese characters is designed and a Chinese word segmentation method based on character representations is proposed in this paper. First, the proposed Chinese character learning model uses the semanties of loeal context and global context to learn the representation of Chinese characters. Then, Chinese word segmentation model is built by a neural network, while the segmentation model is trained with the eharaeter representations as its input features. Finally, experimental results show that Chinese charaeter representations can effectively learn the semantic information. Characters with similar semantics cluster together in the visualize space. Moreover, the proposed Chinese word segmentation model also achieves a pretty good improvement on precision, recall and f-measure.
文摘An N-gram Chinese language model incorporating linguistic rules is presented. By constructing elements lattice, rules information is incorporated in statistical frame. To facilitate the hybrid modeling, novel methods such as MI-based rule evaluating, weighted rule quantification and element-based n-gram probability approximation are presented. Dynamic Viterbi algorithm is adopted to search the best path in lattice. To strengthen the model, transformation-based error-driven rules learning is adopted. Applying proposed model to Chinese Pinyin-to-character conversion, high performance has been achieved in accuracy, flexibility and robustness simultaneously. Tests show correct rate achieves 94.81% instead of 90.53% using bi-gram Markov model alone. Many long-distance dependency and recursion in language can be processed effectively.
文摘There is a positive transfer from native language vocabulary learning strategy to that of the second language. The comparison between them shows that the traditional Chinese character learning strategies have profound effect on English vocabulary learning on the basis of morphology, lexicon as well as discourse categories. If the mutual effect can be applied in English vocabulary learning effectively, positive transfer emerges.
基金supported by a grant from Ministry of Education,Taiwan,China under the Aiming for the Top University Plan at Taiwan Normal University,China
文摘Embodied semantics theory asserts that the meaning of action-related words is neurally represented through networks that overlap with or are identical to networks involved in sensory-motor processing. While some studies supporting this theory have focused on Chinese characters, less attention has been paid to their semantic radicals. Indeed, there is still disagreement about whether these radicals are processed independently. The present study investigated whether radicals are processed separately and, if so, whether this processing occurs in sensory-motor regions. Materials consisted of 72 high-frequency Chinese characters, with 18 in each of four categories: hand-action verbs with and without hand-radicals, and verbs not related to hand actions, with and without hand-radicals. Twenty-eight participants underwent functional MRI scans while reading the characters. Compared to characters without hand-radicals, reading characters with hand-radicals activated the right medial frontal gyrus. Verbs involving hand-action activated the left inferior parietal lobule, possibly reflecting integration of information in the radical with the semantic meaning of the verb. The findings may be consistent with embodied semantics theory and suggest that neural representation of radicals is indispensable in processing Chinese characters.
基金This work is supported by the National Natural Science Foundation of China(61872231,61701297).
文摘The past decade has seen the rapid development of text detection based on deep learning.However,current methods of Chinese character detection and recognition have proven to be poor.The accuracy of segmenting text boxes in natural scenes is not impressive.The reasons for this strait can be summarized into two points:the complexity of natural scenes and numerous types of Chinese characters.In response to these problems,we proposed a lightweight neural network architecture named CTSF.It consists of two modules,one is a text detection network that combines CTPN and the image feature extraction modules of PVANet,named CDSE.The other is a literacy network based on spatial pyramid pool and fusion of Chinese character skeleton features named SPPCNN-SF,so as to realize the text detection and recognition,respectively.Our model performs much better than the original model on ICDAR2011 and ICDAR2013(achieved 85%and 88%F-measures)and enhanced the processing speed in training phase.In addition,our method achieves extremely performance on three Chinese datasets,with accuracy of 95.12%,95.56%and 96.01%.
文摘目的为了提升生成对抗网络汉字风格迁移的图像生成质量,实现汉字智能生成在字库产业中的实际应用,提出了一种基于直观汉字构形学的条件生成对抗网络字体生成优化方法(Optimizationof Conditional Fonts Generation with Chinese Character Configuration GANs,C^(3)-GAN)。方法建构了直观汉字构形模组(C^(3)Module),该模组包含了利于条件生成对抗网络进行汉字构形语义特征学习的全特征汉字字符集。C^(3)-GAN在条件生成对抗网络模型下进行字体生成训练,降低了必要训练样本数量,实现对字体生成效果的优化。结果使用C^(3)-GAN生成汉字图像的清晰度更高、字形更准确。在图像相似性定量评估中,使用C^(3)-GAN的实验组相比于其他模型,获得了更高的相似值和更小的误差值。结论使用C^(3)-GAN可以降低必要训练样本数量、提升汉字图像质量。在实际项目中具有一定的应用性和可操作性。