This study presents a single-class and multi-class instance segmentation approach applied to ancient Palmyrene inscriptions,employing two state-of-the-art deep learning algorithms,namely YOLOv8 and Roboflow 3.0.The go...This study presents a single-class and multi-class instance segmentation approach applied to ancient Palmyrene inscriptions,employing two state-of-the-art deep learning algorithms,namely YOLOv8 and Roboflow 3.0.The goal is to contribute to the preservation and understanding of historical texts,showcasing the potential of modern deep learning methods in archaeological research.Our research culminates in several key findings and scientific contributions.We comprehensively compare the performance of YOLOv8 and Roboflow 3.0 in the context of Palmyrene character segmentation—this comparative analysis mainly focuses on the strengths and weaknesses of each algorithm in this context.We also created and annotated an extensive dataset of Palmyrene inscriptions,a crucial resource for further research in the field.The dataset serves for training and evaluating the segmentation models.We employ comparative evaluation metrics to quantitatively assess the segmentation results,ensuring the reliability and reproducibility of our findings and we present custom visualization tools for predicted segmentation masks.Our study advances the state of the art in semi-automatic reading of Palmyrene inscriptions and establishes a benchmark for future research.The availability of the Palmyrene dataset and the insights into algorithm performance contribute to the broader understanding of historical text analysis.展开更多
Charaterization, in literature, is the presentation of the hero or heroine in order to make them credible to the readers. A book with welldrawn characters is sure to touch its readers. Atticus Finch, an elaboratelypor...Charaterization, in literature, is the presentation of the hero or heroine in order to make them credible to the readers. A book with welldrawn characters is sure to touch its readers. Atticus Finch, an elaboratelyportrayed character in Miss Lee′s book To Kill A Mockingbird is a fineexample. Mainly because of its successful and adroit characterization,展开更多
The purpose of this paper is to propose a new multi stage algorithm for the recognition of isolated characters. It was similar work done before using only the center of gravity (This paper is extended version of “A f...The purpose of this paper is to propose a new multi stage algorithm for the recognition of isolated characters. It was similar work done before using only the center of gravity (This paper is extended version of “A fast recognition system for isolated printed characters using center of gravity”, LAP LAMBERT Academic Publishing 2011, ISBN: 978-38465-0002-6), but here we add using principal axis in order to make the algorithm rotation invariant. In my previous work which is published in LAP LAMBERT, I face a big problem that when the character is rotated I can’t recognize the character. So this adds constrain on the document to be well oriented but here I use the principal axis in order to unify the orientation of the character set and the characters in the scanned document. The algorithm can be applied for any isolated character such as Latin, Chinese, Japanese, and Arabic characters but it has been applied in this paper for Arabic characters. The approach uses normalized and isolated characters of the same size and extracts an image signature based on the center of gravity of the character after making the character principal axis vertical, and then the system compares these values to a set of signatures for typical characters of the set. The system then provides the closeness of match to all other characters in the set.展开更多
In this paper, a kind of practical image segmentation algorithm for segment characters from car license plate is presented, based on morphology and labeling. First by morphological operation, noise in the binary image...In this paper, a kind of practical image segmentation algorithm for segment characters from car license plate is presented, based on morphology and labeling. First by morphological operation, noise in the binary image of license plate can be greatly decreased. Then, by labeling, each connected pixel component is given a unique label. Finally, by the known data of license plate, each character is extracted correctly. The advantage of this method is that it can deal with plates with different sizes and connected characters plates, and inclined plates. The experiment results show that it is an effective way to extract characters from the license plate, and can be put into practical use.展开更多
Manuscript preprocessing is the earliest stage in transliteration process of manuscripts in Javanese scripts. Manuscript preprocessing stage is aimed to produce images of letters which form the manuscripts to be proce...Manuscript preprocessing is the earliest stage in transliteration process of manuscripts in Javanese scripts. Manuscript preprocessing stage is aimed to produce images of letters which form the manuscripts to be processed further in manuscript transliteration system. There are four main steps in manuscript preprocessing, which are manuscript binarization, noise reduction, line segmentation, and character segmentation for every line image produced by line segmentation. The result of the test on parts of PB.A57 manuscript which contains 291 character images, with 95% level of confidence concluded that the success percentage of preprocessing in producing Javanese character images ranged 85.9% - 94.82%.展开更多
Recently,HUGY has become quite popular in the Chinese market.The character can been seen everywhere,from its emojis,memes,cartoon stories,and art toys,to T-shirts,candies,garments.HUGY is a cartoon of a cute puppy,who...Recently,HUGY has become quite popular in the Chinese market.The character can been seen everywhere,from its emojis,memes,cartoon stories,and art toys,to T-shirts,candies,garments.HUGY is a cartoon of a cute puppy,who is always smiling widely and reaching out his arms,ready to hug you.We invited the character’s creator,Lina Ju for an interview.Lina Ju comes from South Korea but has been working in China for 10 years.She is the chief designer of GENMEC,a trendy brand belonging to Sums Model,a company based in the south of China.展开更多
Vehicle license plate (VLP) character segmentation is an important part of the vehicle license plate recognition system (VLPRS).This paper proposes a least square method (LSM) to treat horizontal tilt and vertical til...Vehicle license plate (VLP) character segmentation is an important part of the vehicle license plate recognition system (VLPRS).This paper proposes a least square method (LSM) to treat horizontal tilt and vertical tilt in VLP images.Auxiliary lines are added into the image (or the tilt-corrected image) to make the separated parts of each Chinese character to be an interconnected region.The noise regions will be eliminated after two fusing images are merged according to the minimum principle of gray values. Then,the characters are segmented by projection method (PM) and the final character images are obtained.The experimental results show that this method features fast processing and good performance in segmentation.展开更多
A fast knowledge based recognition method of the harbor target in large gray remote-sensing image is presented. First, the distributed features and the inherent feature are analyzed according to the knowledge of harbo...A fast knowledge based recognition method of the harbor target in large gray remote-sensing image is presented. First, the distributed features and the inherent feature are analyzed according to the knowledge of harbor targets; then, two methods for extracting the candidate region of harbor are devised in accordance with different sizes of the harbors; after that, thresholds are used to segment the land and the sea with strategies of the segmentation error control; finally, harbor recognition is implemented according to its inherent character (semi-closed region of seawater).展开更多
Segmenting the touching objects in an image has been remaining as a hot subject due to the problematic complexities, and a vast number of algorithms designed to tackle this issue have come into being since a decade ag...Segmenting the touching objects in an image has been remaining as a hot subject due to the problematic complexities, and a vast number of algorithms designed to tackle this issue have come into being since a decade ago. In this paper, a new granule segmentation algorithm is developed using saddle point as the cutting point. The image is binarized and then sequentially eroded to form a gray-scale topographic counterpart, followed by using Hessian matrix computation to search for the saddle point. The segmentation is performed by cutting through the saddle point and along the maximal gradient path on the topographic surface. The results of the algorithm test on the given real images indicate certain superiorities in both the segmenting robustness and execution time to the referenced methods.展开更多
In many image analysis and processing problems, discriminating the size and shape of each individual object in an aggregate pile projected in an image is an important practice. It is relatively easy to distinguish the...In many image analysis and processing problems, discriminating the size and shape of each individual object in an aggregate pile projected in an image is an important practice. It is relatively easy to distinguish these features among the objects already separated from each other. The problems will be undoubtedly more complex and of greater challenge if the objects are touched or/and overlapped. This letter presents an algorithm that can be used to separate the touches and overlaps existing in the objects within a 2-D image. The approach is first to convert the gray-scale image to its corresponding binary one and then to the 3-D topographic one using the erosion operations. A template (or mask) is engineered to search the topographic surface for the saddle point, from which the segmenting orientation is determined followed by the desired separating operation. The algorithm is tested on a real image and the running result is adequately satisfying and encouraging.展开更多
An image segmentation algorithm of the restrained fuzzy Kohonen clustering network (RFKCN) based on high- dimension fuzzy character is proposed. The algorithm includes two steps. The first step is the fuzzification ...An image segmentation algorithm of the restrained fuzzy Kohonen clustering network (RFKCN) based on high- dimension fuzzy character is proposed. The algorithm includes two steps. The first step is the fuzzification of pixels in which two redundant images are built by fuzzy mean value and fuzzy median value. The second step is to construct a three-dimensional (3-D) feature vector of redundant images and their original images and cluster the feature vector through RFKCN, to realize image seg- mentation. The proposed algorithm fully takes into account not only gray distribution information of pixels, but also relevant information and fuzzy information among neighboring pixels in constructing 3- D character space. Based on the combination of competitiveness, redundancy and complementary of the information, the proposed algorithm improves the accuracy of clustering. Theoretical anal- yses and experimental results demonstrate that the proposed algorithm has a good segmentation performance.展开更多
A local and global context representation learning model for Chinese characters is designed and a Chinese word segmentation method based on character representations is proposed in this paper. First, the proposed Chin...A local and global context representation learning model for Chinese characters is designed and a Chinese word segmentation method based on character representations is proposed in this paper. First, the proposed Chinese character learning model uses the semanties of loeal context and global context to learn the representation of Chinese characters. Then, Chinese word segmentation model is built by a neural network, while the segmentation model is trained with the eharaeter representations as its input features. Finally, experimental results show that Chinese charaeter representations can effectively learn the semantic information. Characters with similar semantics cluster together in the visualize space. Moreover, the proposed Chinese word segmentation model also achieves a pretty good improvement on precision, recall and f-measure.展开更多
This paper discusses methods for character extraction on the basis of statistical and structural features of gray_level images,and proposes a dynamic local contrast threshold method accommodating to line width.Precise...This paper discusses methods for character extraction on the basis of statistical and structural features of gray_level images,and proposes a dynamic local contrast threshold method accommodating to line width.Precise locating of character string is realized by exploiting horizontal projection and character arrangements of binary images in horizontal and vertical directions respectively.Also discussed is the method for segmentation of characters in binary images,which is based on projection taking stroke width and character sizes into account.A new method for character identification is explored,which is based on compound neural networks.A complex neural network consists of two sub_nets,the first sub_net performs self_association of patterns via 2_dimentional local_connected 3_order networks,the second sub_net,linking with a locally connected BP networks,performs classification.The reliability of the network recognition is reinforced by introducing conditions for identification denial.Experiments confirm that the proposed methods possess the advantages of impressive robustness,rapid processing and high accuracy of identification.展开更多
A new approach to extract and segment characters in natural scenes was proposed in this paper. First, a set of intrinsic features were calculated based on connected components (CCs) extracted by a non-linear Nilblack ...A new approach to extract and segment characters in natural scenes was proposed in this paper. First, a set of intrinsic features were calculated based on connected components (CCs) extracted by a non-linear Nilblack algorithm. Then, feature propagation was conducted for feature enhancement, under the constraint of the layout relations. Next, candidate CCs were fed into classifiers with the enhanced feature vector. At last, a model-based hierarchical merging (MHM) procedure was presented to obtain understandable characters. The proposed merging algorithm utilized the constraint of text lines for specific languages and dynamically merges CCs into characters. The whole algorithm was evaluated at both pixel level and character level, experimental results showed that the proposed method is effective in detecting scene characters with significant geometric variations, uneven illumination, extremely low contrast and cluttered background.展开更多
To understand the responses of flag leaf shape in rice to elevated CO2 environment and their genetic characteristics, quantitative trait loci (QTLs) for flag leaf shape in rice were mapped onto the molecular marker ...To understand the responses of flag leaf shape in rice to elevated CO2 environment and their genetic characteristics, quantitative trait loci (QTLs) for flag leaf shape in rice were mapped onto the molecular marker linkage map of chromosome segment substitution lines (CSSLs) derived from a cross between a japonica variety Asominori and an indica variety IR24 under free air carbon dioxide enrichment (FACE, 200 μmol/mol above current levels) and current CO2 concentration (Ambient, about 370 μmol/mol). Three flag-leaf traits, flag-leaf length (LL), width (LW) and the ratio of LL to LW (RLW), were estimated for each CSSL and their parental varieties. The differences in LL, LW and RLW between parents and in LL and LW within IR24 between FACE and Ambient were significant at 1% level. The continuous distributions and transgressive segregations of LL, LW and RLW were also observed in CSSL population, showing that the three traits were quantitatively inherited under both FACE and Ambient. A total of 16 QTLs for the three traits were detected on chromosomes 1, 2, 3, 4, 6, 8 and 11 with LOD (Log10-1ikelihood ratio) scores ranging from 3.0 to 6.7. Among them, four QTLs (qLL-6*, qLL-8* qLW-4* and qRLW-6*) were commonly detected under both FACE and Ambient. Therefore, based on the different responses to elevated CO2 in comparison with current CO2 level, it can be suggested that the expressions of several QTLs associated with flag-leaf shape in rice could be induced by the high CO2 level.展开更多
This paper presents a methodology for off-line handwritten Chinese character recognition based on mergence of consecutive segments of adaptive duration. The handwritten Chinese character string is partitioned into a s...This paper presents a methodology for off-line handwritten Chinese character recognition based on mergence of consecutive segments of adaptive duration. The handwritten Chinese character string is partitioned into a sequence of consecutive segments, which are combined to implement dissimilarity evaluation within a sliding window whose durations are determined adaptively by the integration of shapes and context of evaluations. The average stroke width is estimated for the handwritten Chinese character string, and a set of candidate character segmentation boundaries is found by using the integration of pixel and stroke features. The final decisions on segmentation and recognition are made under minimal arithmetical mean dissimilarities. Experiments proved that the proposed approach of adaptive duration outperforms the method of fixed duration, and is very effective for the recognition of overlapped, broken, touched, loosely configured Chinese characters.展开更多
ESA is an unsupervised approach to word segmentation previously proposed by Wang, which is an iterative process consisting of three phases: Evaluation, Selection and Adjustment. In this article, we propose Ex ESA, the...ESA is an unsupervised approach to word segmentation previously proposed by Wang, which is an iterative process consisting of three phases: Evaluation, Selection and Adjustment. In this article, we propose Ex ESA, the extension of ESA. In Ex ESA, the original approach is extended to a 2-pass process and the ratio of different word lengths is introduced as the third type of information combined with cohesion and separation. A maximum strategy is adopted to determine the best segmentation of a character sequence in the phrase of Selection. Besides, in Adjustment, Ex ESA re-evaluates separation information and individual information to overcome the overestimation frequencies. Additionally, a smoothing algorithm is applied to alleviate sparseness. The experiment results show that Ex ESA can further improve the performance and is time-saving by properly utilizing more information from un-annotated corpora. Moreover, the parameters of Ex ESA can be predicted by a set of empirical formulae or combined with the minimum description length principle.展开更多
The segmentation of individual words into characters is a vital process in handwritten character recognition systems. In this paper, a novel approach is proposed to segment handwritten Arabic text (words). We consider...The segmentation of individual words into characters is a vital process in handwritten character recognition systems. In this paper, a novel approach is proposed to segment handwritten Arabic text (words). We consider the “Naskh” font style. The segmentation algorithm employs seven agents in order to detect regions where segmentation is illegal. Feature points (end points) are extracted from the remaining regions of the word-image. Initially, the middle of every two successive end points is considered as a candidate segmentation point based on a set of rules. The experimental results are very promising as we achieved a success rate of 86%.展开更多
基金The results and knowledge included herein have been obtained owing to support from the following institutional grant.Internal grant agency of the Faculty of Economics and Management,Czech University of Life Sciences Prague,Grant No.2023A0004-“Text Segmentation Methods of Historical Alphabets in OCR Development”.https://iga.pef.czu.cz/.Funds were granted to T.Novák,A.Hamplová,O.Svojše,and A.Veselýfrom the author team.
文摘This study presents a single-class and multi-class instance segmentation approach applied to ancient Palmyrene inscriptions,employing two state-of-the-art deep learning algorithms,namely YOLOv8 and Roboflow 3.0.The goal is to contribute to the preservation and understanding of historical texts,showcasing the potential of modern deep learning methods in archaeological research.Our research culminates in several key findings and scientific contributions.We comprehensively compare the performance of YOLOv8 and Roboflow 3.0 in the context of Palmyrene character segmentation—this comparative analysis mainly focuses on the strengths and weaknesses of each algorithm in this context.We also created and annotated an extensive dataset of Palmyrene inscriptions,a crucial resource for further research in the field.The dataset serves for training and evaluating the segmentation models.We employ comparative evaluation metrics to quantitatively assess the segmentation results,ensuring the reliability and reproducibility of our findings and we present custom visualization tools for predicted segmentation masks.Our study advances the state of the art in semi-automatic reading of Palmyrene inscriptions and establishes a benchmark for future research.The availability of the Palmyrene dataset and the insights into algorithm performance contribute to the broader understanding of historical text analysis.
文摘Charaterization, in literature, is the presentation of the hero or heroine in order to make them credible to the readers. A book with welldrawn characters is sure to touch its readers. Atticus Finch, an elaboratelyportrayed character in Miss Lee′s book To Kill A Mockingbird is a fineexample. Mainly because of its successful and adroit characterization,
文摘The purpose of this paper is to propose a new multi stage algorithm for the recognition of isolated characters. It was similar work done before using only the center of gravity (This paper is extended version of “A fast recognition system for isolated printed characters using center of gravity”, LAP LAMBERT Academic Publishing 2011, ISBN: 978-38465-0002-6), but here we add using principal axis in order to make the algorithm rotation invariant. In my previous work which is published in LAP LAMBERT, I face a big problem that when the character is rotated I can’t recognize the character. So this adds constrain on the document to be well oriented but here I use the principal axis in order to unify the orientation of the character set and the characters in the scanned document. The algorithm can be applied for any isolated character such as Latin, Chinese, Japanese, and Arabic characters but it has been applied in this paper for Arabic characters. The approach uses normalized and isolated characters of the same size and extracts an image signature based on the center of gravity of the character after making the character principal axis vertical, and then the system compares these values to a set of signatures for typical characters of the set. The system then provides the closeness of match to all other characters in the set.
文摘In this paper, a kind of practical image segmentation algorithm for segment characters from car license plate is presented, based on morphology and labeling. First by morphological operation, noise in the binary image of license plate can be greatly decreased. Then, by labeling, each connected pixel component is given a unique label. Finally, by the known data of license plate, each character is extracted correctly. The advantage of this method is that it can deal with plates with different sizes and connected characters plates, and inclined plates. The experiment results show that it is an effective way to extract characters from the license plate, and can be put into practical use.
文摘Manuscript preprocessing is the earliest stage in transliteration process of manuscripts in Javanese scripts. Manuscript preprocessing stage is aimed to produce images of letters which form the manuscripts to be processed further in manuscript transliteration system. There are four main steps in manuscript preprocessing, which are manuscript binarization, noise reduction, line segmentation, and character segmentation for every line image produced by line segmentation. The result of the test on parts of PB.A57 manuscript which contains 291 character images, with 95% level of confidence concluded that the success percentage of preprocessing in producing Javanese character images ranged 85.9% - 94.82%.
文摘Recently,HUGY has become quite popular in the Chinese market.The character can been seen everywhere,from its emojis,memes,cartoon stories,and art toys,to T-shirts,candies,garments.HUGY is a cartoon of a cute puppy,who is always smiling widely and reaching out his arms,ready to hug you.We invited the character’s creator,Lina Ju for an interview.Lina Ju comes from South Korea but has been working in China for 10 years.She is the chief designer of GENMEC,a trendy brand belonging to Sums Model,a company based in the south of China.
基金Scientific Research Fund of Hunan Province,PRC (No.07JJ6141)Scientific Research Fund of Hunan Provincial Education Department,PRC (No.05C720).
文摘Vehicle license plate (VLP) character segmentation is an important part of the vehicle license plate recognition system (VLPRS).This paper proposes a least square method (LSM) to treat horizontal tilt and vertical tilt in VLP images.Auxiliary lines are added into the image (or the tilt-corrected image) to make the separated parts of each Chinese character to be an interconnected region.The noise regions will be eliminated after two fusing images are merged according to the minimum principle of gray values. Then,the characters are segmented by projection method (PM) and the final character images are obtained.The experimental results show that this method features fast processing and good performance in segmentation.
文摘A fast knowledge based recognition method of the harbor target in large gray remote-sensing image is presented. First, the distributed features and the inherent feature are analyzed according to the knowledge of harbor targets; then, two methods for extracting the candidate region of harbor are devised in accordance with different sizes of the harbors; after that, thresholds are used to segment the land and the sea with strategies of the segmentation error control; finally, harbor recognition is implemented according to its inherent character (semi-closed region of seawater).
基金Ningbo Natural Science Foundation (No.2006A610016)Foundation of the Ministry of Education Ministry for Returned Overseas Students & Scholars (SRF for ROCS, SEM. No.2006699).
文摘Segmenting the touching objects in an image has been remaining as a hot subject due to the problematic complexities, and a vast number of algorithms designed to tackle this issue have come into being since a decade ago. In this paper, a new granule segmentation algorithm is developed using saddle point as the cutting point. The image is binarized and then sequentially eroded to form a gray-scale topographic counterpart, followed by using Hessian matrix computation to search for the saddle point. The segmentation is performed by cutting through the saddle point and along the maximal gradient path on the topographic surface. The results of the algorithm test on the given real images indicate certain superiorities in both the segmenting robustness and execution time to the referenced methods.
基金Suppprted by the Scientific Research Start-up foundation of Ningbo University (No.2004037)Zhejiang Provincial Foundation for Returned Overseas Students and Scholars (No.2004884).
文摘In many image analysis and processing problems, discriminating the size and shape of each individual object in an aggregate pile projected in an image is an important practice. It is relatively easy to distinguish these features among the objects already separated from each other. The problems will be undoubtedly more complex and of greater challenge if the objects are touched or/and overlapped. This letter presents an algorithm that can be used to separate the touches and overlaps existing in the objects within a 2-D image. The approach is first to convert the gray-scale image to its corresponding binary one and then to the 3-D topographic one using the erosion operations. A template (or mask) is engineered to search the topographic surface for the saddle point, from which the segmenting orientation is determined followed by the desired separating operation. The algorithm is tested on a real image and the running result is adequately satisfying and encouraging.
基金supported by the National Natural Science Foundation of China(61073106)the Aerospace Science and Technology Innovation Fund(CASC201105)
文摘An image segmentation algorithm of the restrained fuzzy Kohonen clustering network (RFKCN) based on high- dimension fuzzy character is proposed. The algorithm includes two steps. The first step is the fuzzification of pixels in which two redundant images are built by fuzzy mean value and fuzzy median value. The second step is to construct a three-dimensional (3-D) feature vector of redundant images and their original images and cluster the feature vector through RFKCN, to realize image seg- mentation. The proposed algorithm fully takes into account not only gray distribution information of pixels, but also relevant information and fuzzy information among neighboring pixels in constructing 3- D character space. Based on the combination of competitiveness, redundancy and complementary of the information, the proposed algorithm improves the accuracy of clustering. Theoretical anal- yses and experimental results demonstrate that the proposed algorithm has a good segmentation performance.
基金Supported by the National Natural Science Foundation of China(No.61303179,U1135005,61175020)
文摘A local and global context representation learning model for Chinese characters is designed and a Chinese word segmentation method based on character representations is proposed in this paper. First, the proposed Chinese character learning model uses the semanties of loeal context and global context to learn the representation of Chinese characters. Then, Chinese word segmentation model is built by a neural network, while the segmentation model is trained with the eharaeter representations as its input features. Finally, experimental results show that Chinese charaeter representations can effectively learn the semantic information. Characters with similar semantics cluster together in the visualize space. Moreover, the proposed Chinese word segmentation model also achieves a pretty good improvement on precision, recall and f-measure.
文摘This paper discusses methods for character extraction on the basis of statistical and structural features of gray_level images,and proposes a dynamic local contrast threshold method accommodating to line width.Precise locating of character string is realized by exploiting horizontal projection and character arrangements of binary images in horizontal and vertical directions respectively.Also discussed is the method for segmentation of characters in binary images,which is based on projection taking stroke width and character sizes into account.A new method for character identification is explored,which is based on compound neural networks.A complex neural network consists of two sub_nets,the first sub_net performs self_association of patterns via 2_dimentional local_connected 3_order networks,the second sub_net,linking with a locally connected BP networks,performs classification.The reliability of the network recognition is reinforced by introducing conditions for identification denial.Experiments confirm that the proposed methods possess the advantages of impressive robustness,rapid processing and high accuracy of identification.
文摘A new approach to extract and segment characters in natural scenes was proposed in this paper. First, a set of intrinsic features were calculated based on connected components (CCs) extracted by a non-linear Nilblack algorithm. Then, feature propagation was conducted for feature enhancement, under the constraint of the layout relations. Next, candidate CCs were fed into classifiers with the enhanced feature vector. At last, a model-based hierarchical merging (MHM) procedure was presented to obtain understandable characters. The proposed merging algorithm utilized the constraint of text lines for specific languages and dynamically merges CCs into characters. The whole algorithm was evaluated at both pixel level and character level, experimental results showed that the proposed method is effective in detecting scene characters with significant geometric variations, uneven illumination, extremely low contrast and cluttered background.
基金The study was supported by the National Natural Science Foundation, China (Grant Nos. 30270800 and 40231003)
文摘To understand the responses of flag leaf shape in rice to elevated CO2 environment and their genetic characteristics, quantitative trait loci (QTLs) for flag leaf shape in rice were mapped onto the molecular marker linkage map of chromosome segment substitution lines (CSSLs) derived from a cross between a japonica variety Asominori and an indica variety IR24 under free air carbon dioxide enrichment (FACE, 200 μmol/mol above current levels) and current CO2 concentration (Ambient, about 370 μmol/mol). Three flag-leaf traits, flag-leaf length (LL), width (LW) and the ratio of LL to LW (RLW), were estimated for each CSSL and their parental varieties. The differences in LL, LW and RLW between parents and in LL and LW within IR24 between FACE and Ambient were significant at 1% level. The continuous distributions and transgressive segregations of LL, LW and RLW were also observed in CSSL population, showing that the three traits were quantitatively inherited under both FACE and Ambient. A total of 16 QTLs for the three traits were detected on chromosomes 1, 2, 3, 4, 6, 8 and 11 with LOD (Log10-1ikelihood ratio) scores ranging from 3.0 to 6.7. Among them, four QTLs (qLL-6*, qLL-8* qLW-4* and qRLW-6*) were commonly detected under both FACE and Ambient. Therefore, based on the different responses to elevated CO2 in comparison with current CO2 level, it can be suggested that the expressions of several QTLs associated with flag-leaf shape in rice could be induced by the high CO2 level.
文摘This paper presents a methodology for off-line handwritten Chinese character recognition based on mergence of consecutive segments of adaptive duration. The handwritten Chinese character string is partitioned into a sequence of consecutive segments, which are combined to implement dissimilarity evaluation within a sliding window whose durations are determined adaptively by the integration of shapes and context of evaluations. The average stroke width is estimated for the handwritten Chinese character string, and a set of candidate character segmentation boundaries is found by using the integration of pixel and stroke features. The final decisions on segmentation and recognition are made under minimal arithmetical mean dissimilarities. Experiments proved that the proposed approach of adaptive duration outperforms the method of fixed duration, and is very effective for the recognition of overlapped, broken, touched, loosely configured Chinese characters.
基金supported in part by National Science Foundation of China under Grants No. 61303105 and 61402304the Humanity & Social Science general project of Ministry of Education under Grants No.14YJAZH046+2 种基金the Beijing Natural Science Foundation under Grants No. 4154065the Beijing Educational Committee Science and Technology Development Planned under Grants No.KM201410028017Beijing Key Disciplines of Computer Application Technology
文摘ESA is an unsupervised approach to word segmentation previously proposed by Wang, which is an iterative process consisting of three phases: Evaluation, Selection and Adjustment. In this article, we propose Ex ESA, the extension of ESA. In Ex ESA, the original approach is extended to a 2-pass process and the ratio of different word lengths is introduced as the third type of information combined with cohesion and separation. A maximum strategy is adopted to determine the best segmentation of a character sequence in the phrase of Selection. Besides, in Adjustment, Ex ESA re-evaluates separation information and individual information to overcome the overestimation frequencies. Additionally, a smoothing algorithm is applied to alleviate sparseness. The experiment results show that Ex ESA can further improve the performance and is time-saving by properly utilizing more information from un-annotated corpora. Moreover, the parameters of Ex ESA can be predicted by a set of empirical formulae or combined with the minimum description length principle.
文摘The segmentation of individual words into characters is a vital process in handwritten character recognition systems. In this paper, a novel approach is proposed to segment handwritten Arabic text (words). We consider the “Naskh” font style. The segmentation algorithm employs seven agents in order to detect regions where segmentation is illegal. Feature points (end points) are extracted from the remaining regions of the word-image. Initially, the middle of every two successive end points is considered as a candidate segmentation point based on a set of rules. The experimental results are very promising as we achieved a success rate of 86%.