Aim To study the parking management in the condition of vehicles' increasing. Methods The methods of pattern recognition and image processing were used to analyze the eigenvalues of parking lot images. Results ...Aim To study the parking management in the condition of vehicles' increasing. Methods The methods of pattern recognition and image processing were used to analyze the eigenvalues of parking lot images. Results The automatic identification of every parking place in the parking plot was realized. The automatic measuring of parked vehicle count and parking lot utilization was completed. Conclusion It can complete the real time recognition, and has some practicabilities.展开更多
The paper covers analysis and investigation of lighting automation system in low-traffic long-roads. The main objective is to provide optimal solution between expensive safe design that utilizes continuous street ligh...The paper covers analysis and investigation of lighting automation system in low-traffic long-roads. The main objective is to provide optimal solution between expensive safe design that utilizes continuous street lighting system at night for the entire road, or inexpensive design that sacrifices the safety, relying on using vehicles lighting, to eliminate the problem of high cost energy consumption during the night operation of the road. By taking into account both of these factors, smart lighting automation system is proposed using Pattern Recognition Technique applied on vehicle number-plates. In this proposal, the road is sectionalized into zones, and based on smart Pattern Recognition Technique, the control system of the road lighting illuminates only the zone that the vehicles pass through. Economic analysis is provided in this paper to support the value of using this design of lighting control system.展开更多
With characteristics of log cross-section image taken into consideration,this paper presents a new image processing algorithm for recognization and measurement of log cross sections,by which the number and area of qua...With characteristics of log cross-section image taken into consideration,this paper presents a new image processing algorithm for recognization and measurement of log cross sections,by which the number and area of quasi-circular log cross sections can be calculated automatically,thereby obtaining the total cross-section area and log volume.展开更多
In this study,microcomputer image processing and pattern recognition technology,and the knowledge of morphology and optical characteristics of Cryptococcus neoformans were used for identification of Cryptococcus neofo...In this study,microcomputer image processing and pattern recognition technology,and the knowledge of morphology and optical characteristics of Cryptococcus neoformans were used for identification of Cryptococcus neoformans.Four groups of mice were lethally infected with standard strain,Wuhan strain,American B-2643 strain and Var.Shanghainesis of the Cryptococcus neoformans.The samplescollected included mice brain,lung,kidney,liver,small intestine tissue and were observed under a light microscope.More than 600 images of the fungus were input into a microcomputer.A system of computer for automatic identification of the Cryptoccocus neoformans was developed. The technique involved image preprocessing,imagesegmenting,coding of line-length on the edge,curve fitting,extracting of image feature,building of image library and feature data bank etc..And then,768 images of the clinical samples and other fungus samples whose morphological features tend to be confused with Cryptococcus neoformans were input into microcomputer and subjected to automatic identification.The Cryptococcus neoformans was accurately identified within 15 min,and the consistence rate with results of routine culture was 98%.展开更多
The objective of this work is to provide an automatic system to count white blood cells in a blood smear. To do so an experiment was assembled, composed by a standard microscope with two step motors coupled to its kno...The objective of this work is to provide an automatic system to count white blood cells in a blood smear. To do so an experiment was assembled, composed by a standard microscope with two step motors coupled to its knobs in order to move the microscope in x and y directions and a web cam which was mounted in the top of the microscope responsible for to acquire images from the smear. The step motors and the web cam are controlled by a microcomputer PC standard via software developed inDelphi. The motors use the parallel port to communicate with the PC and the camera use the USB port. The main idea is to set an initial point into the smear and the automated system will carry over the smear acquiring images (frames with 640 × 480 pixels) and counting the white blood cells encountered. The double histogram threshold technique is implemented to initially exclude the red cells from the image leaving only the white ones. Preliminaries results are obtained and show that the system is quite fast and has a good capacity of selection, even when different kinds of smear are used.展开更多
A new two-step framework is proposed for image segmentation. In the first step, the gray-value distribution of the given image is reshaped to have larger inter-class variance and less intra-class variance. In the sec-...A new two-step framework is proposed for image segmentation. In the first step, the gray-value distribution of the given image is reshaped to have larger inter-class variance and less intra-class variance. In the sec- ond step, the discriminant-based methods or clustering-based methods are performed on the reformed distribution. It is focused on the typical clustering methods-Gaussian mixture model (GMM) and its variant to demonstrate the feasibility of the framework. Due to the independence of the first step in its second step, it can be integrated into the pixel-based and the histogram-based methods to improve their segmentation quality. The experiments on artificial and real images show that the framework can achieve effective and robust segmentation results.展开更多
This letter proposes an effective method for recognizing face images by combining two-Dimen- sional Principal Component Analysis (2DPCA) with IMage Euclidean Distance (IMED) method. The proposed method is comprised of...This letter proposes an effective method for recognizing face images by combining two-Dimen- sional Principal Component Analysis (2DPCA) with IMage Euclidean Distance (IMED) method. The proposed method is comprised of four main stages. The first stage uses the wavelet decomposition to extract low fre- quency subimages from original face images and omits the other three subimages. The second stage concerns the application of IMED to face images. In the third stage, 2DPCA is employed to extract the face features from the processed results in the second stage. Finally, Support Vector Machine (SVM) is applied to classify the extracted face features. Experimental results on the AR face image database show that the proposed method yields better recognition performance in comparison with the 2DPCA method that is not combined with IMED.展开更多
The pharmaceutical industry increasingly values medicinal plants due to their perceived safety and costeffectiveness compared to modern drugs.Throughout the extensive history of medicinal plant usage,various plant par...The pharmaceutical industry increasingly values medicinal plants due to their perceived safety and costeffectiveness compared to modern drugs.Throughout the extensive history of medicinal plant usage,various plant parts,including flowers,leaves,and roots,have been acknowledged for their healing properties and employed in plant identification.Leaf images,however,stand out as the preferred and easily accessible source of information.Manual plant identification by plant taxonomists is intricate,time-consuming,and prone to errors,relying heavily on human perception.Artificial intelligence(AI)techniques offer a solution by automating plant recognition processes.This study thoroughly examines cutting-edge AI approaches for leaf image-based plant identification,drawing insights from literature across renowned repositories.This paper critically summarizes relevant literature based on AI algorithms,extracted features,and results achieved.Additionally,it analyzes extensively used datasets in automated plant classification research.It also offers deep insights into implemented techniques and methods employed for medicinal plant recognition.Moreover,this rigorous review study discusses opportunities and challenges in employing these AI-based approaches.Furthermore,in-depth statistical findings and lessons learned from this survey are highlighted with novel research areas with the aim of offering insights to the readers and motivating new research directions.This review is expected to serve as a foundational resource for future researchers in the field of AI-based identification of medicinal plants.展开更多
Depending on the techniques of pattern recognition and image processing, we established a computer analytic system for photocclusal image. The analysing results made by this system are more accurate and reliable than ...Depending on the techniques of pattern recognition and image processing, we established a computer analytic system for photocclusal image. The analysing results made by this system are more accurate and reliable than those by the naked eye and grid for analysing photocclusal image. We analysed photocclusal images for a patient with prematurity of lower first right molar be fore and after occlusal adjustment with the system. The result appeared that occlusal adjustment mainly brought about distributive variation of occlusal stress rather than alteration of absolute value of overall occlusal force.展开更多
Arabic Sign Language recognition is an emerging field of research. Previous attempts at automatic vision-based recog-nition of Arabic Sign Language mainly focused on finger spelling and recognizing isolated gestures. ...Arabic Sign Language recognition is an emerging field of research. Previous attempts at automatic vision-based recog-nition of Arabic Sign Language mainly focused on finger spelling and recognizing isolated gestures. In this paper we report the first continuous Arabic Sign Language by building on existing research in feature extraction and pattern recognition. The development of the presented work required collecting a continuous Arabic Sign Language database which we designed and recorded in cooperation with a sign language expert. We intend to make the collected database available for the research community. Our system which we based on spatio-temporal feature extraction and hidden Markov models has resulted in an average word recognition rate of 94%, keeping in the mind the use of a high perplex-ity vocabulary and unrestrictive grammar. We compare our proposed work against existing sign language techniques based on accumulated image difference and motion estimation. The experimental results section shows that the pro-posed work outperforms existing solutions in terms of recognition accuracy.展开更多
A novel example-based process for Automated Colorization of grayscale images using Texture Descriptors (ACTD) without any human intervention is proposed. By analyzing a set of sample color images, coherent regions of ...A novel example-based process for Automated Colorization of grayscale images using Texture Descriptors (ACTD) without any human intervention is proposed. By analyzing a set of sample color images, coherent regions of homogeneous textures are extracted. A multi-channel filtering technique is used for texture-based image segmentation, combined with a modified Fuzzy C-means (FCM) clustering algorithm. This modified FCM clustering algorithm includes both the local spatial information from neighboring pixels, and the spatial Euclidian distance to the cluster’s center of gravity. For each area of interest, state-of-the-art texture descriptors are then computed and stored, along with corresponding color information. These texture descriptors and the color information are used for colorization of a grayscale image with similar textures. Given a grayscale image to be colorized, the segmentation and feature extraction processes are repeated. The texture descriptors are used to perform Content-Based Image Retrieval (CBIR). The colorization process is performed by Chroma replacement. This research finds numerous applications, ranging from classic film restoration and enhancement, to adding valuable information into medical and satellite imaging. Also, this can be used to enhance the detection of objects from x-ray images at the airports.展开更多
Diabetic retinopathy(DR),the main cause of irreversible blindness,is one of the most common complications of diabetes.At present,deep convolutional neural networks have achieved promising performance in automatic DR d...Diabetic retinopathy(DR),the main cause of irreversible blindness,is one of the most common complications of diabetes.At present,deep convolutional neural networks have achieved promising performance in automatic DR detection tasks.The convolution operation of methods is a local cross-correlation operation,whose receptive field de-termines the size of the local neighbourhood for processing.However,for retinal fundus photographs,there is not only the local information but also long-distance dependence between the lesion features(e.g.hemorrhages and exudates)scattered throughout the whole image.The proposed method incorporates correlations between long-range patches into the deep learning framework to improve DR detection.Patch-wise re-lationships are used to enhance the local patch features since lesions of DR usually appear as plaques.The Long-Range unit in the proposed network with a residual structure can be flexibly embedded into other trained networks.Extensive experimental results demon-strate that the proposed approach can achieve higher accuracy than existing state-of-the-art models on Messidor and EyePACS datasets.展开更多
Transformer tracking always takes paired template and search images as encoder input and conduct feature extraction and target‐search feature correlation by self and/or cross attention operations,thus the model compl...Transformer tracking always takes paired template and search images as encoder input and conduct feature extraction and target‐search feature correlation by self and/or cross attention operations,thus the model complexity will grow quadratically with the number of input images.To alleviate the burden of this tracking paradigm and facilitate practical deployment of Transformer‐based trackers,we propose a dual pooling transformer tracking framework,dubbed as DPT,which consists of three components:a simple yet efficient spatiotemporal attention model(SAM),a mutual correlation pooling Trans-former(MCPT)and a multiscale aggregation pooling Transformer(MAPT).SAM is designed to gracefully aggregates temporal dynamics and spatial appearance information of multi‐frame templates along space‐time dimensions.MCPT aims to capture multi‐scale pooled and correlated contextual features,which is followed by MAPT that aggregates multi‐scale features into a unified feature representation for tracking prediction.DPT tracker achieves AUC score of 69.5 on LaSOT and precision score of 82.8 on Track-ingNet while maintaining a shorter sequence length of attention tokens,fewer parameters and FLOPs compared to existing state‐of‐the‐art(SOTA)Transformer tracking methods.Extensive experiments demonstrate that DPT tracker yields a strong real‐time tracking baseline with a good trade‐off between tracking performance and inference efficiency.展开更多
文摘Aim To study the parking management in the condition of vehicles' increasing. Methods The methods of pattern recognition and image processing were used to analyze the eigenvalues of parking lot images. Results The automatic identification of every parking place in the parking plot was realized. The automatic measuring of parked vehicle count and parking lot utilization was completed. Conclusion It can complete the real time recognition, and has some practicabilities.
文摘The paper covers analysis and investigation of lighting automation system in low-traffic long-roads. The main objective is to provide optimal solution between expensive safe design that utilizes continuous street lighting system at night for the entire road, or inexpensive design that sacrifices the safety, relying on using vehicles lighting, to eliminate the problem of high cost energy consumption during the night operation of the road. By taking into account both of these factors, smart lighting automation system is proposed using Pattern Recognition Technique applied on vehicle number-plates. In this proposal, the road is sectionalized into zones, and based on smart Pattern Recognition Technique, the control system of the road lighting illuminates only the zone that the vehicles pass through. Economic analysis is provided in this paper to support the value of using this design of lighting control system.
文摘With characteristics of log cross-section image taken into consideration,this paper presents a new image processing algorithm for recognization and measurement of log cross sections,by which the number and area of quasi-circular log cross sections can be calculated automatically,thereby obtaining the total cross-section area and log volume.
文摘In this study,microcomputer image processing and pattern recognition technology,and the knowledge of morphology and optical characteristics of Cryptococcus neoformans were used for identification of Cryptococcus neoformans.Four groups of mice were lethally infected with standard strain,Wuhan strain,American B-2643 strain and Var.Shanghainesis of the Cryptococcus neoformans.The samplescollected included mice brain,lung,kidney,liver,small intestine tissue and were observed under a light microscope.More than 600 images of the fungus were input into a microcomputer.A system of computer for automatic identification of the Cryptoccocus neoformans was developed. The technique involved image preprocessing,imagesegmenting,coding of line-length on the edge,curve fitting,extracting of image feature,building of image library and feature data bank etc..And then,768 images of the clinical samples and other fungus samples whose morphological features tend to be confused with Cryptococcus neoformans were input into microcomputer and subjected to automatic identification.The Cryptococcus neoformans was accurately identified within 15 min,and the consistence rate with results of routine culture was 98%.
文摘The objective of this work is to provide an automatic system to count white blood cells in a blood smear. To do so an experiment was assembled, composed by a standard microscope with two step motors coupled to its knobs in order to move the microscope in x and y directions and a web cam which was mounted in the top of the microscope responsible for to acquire images from the smear. The step motors and the web cam are controlled by a microcomputer PC standard via software developed inDelphi. The motors use the parallel port to communicate with the PC and the camera use the USB port. The main idea is to set an initial point into the smear and the automated system will carry over the smear acquiring images (frames with 640 × 480 pixels) and counting the white blood cells encountered. The double histogram threshold technique is implemented to initially exclude the red cells from the image leaving only the white ones. Preliminaries results are obtained and show that the system is quite fast and has a good capacity of selection, even when different kinds of smear are used.
基金Supported by the National Natural Science Foundation of China(60505004,60773061)~~
文摘A new two-step framework is proposed for image segmentation. In the first step, the gray-value distribution of the given image is reshaped to have larger inter-class variance and less intra-class variance. In the sec- ond step, the discriminant-based methods or clustering-based methods are performed on the reformed distribution. It is focused on the typical clustering methods-Gaussian mixture model (GMM) and its variant to demonstrate the feasibility of the framework. Due to the independence of the first step in its second step, it can be integrated into the pixel-based and the histogram-based methods to improve their segmentation quality. The experiments on artificial and real images show that the framework can achieve effective and robust segmentation results.
文摘This letter proposes an effective method for recognizing face images by combining two-Dimen- sional Principal Component Analysis (2DPCA) with IMage Euclidean Distance (IMED) method. The proposed method is comprised of four main stages. The first stage uses the wavelet decomposition to extract low fre- quency subimages from original face images and omits the other three subimages. The second stage concerns the application of IMED to face images. In the third stage, 2DPCA is employed to extract the face features from the processed results in the second stage. Finally, Support Vector Machine (SVM) is applied to classify the extracted face features. Experimental results on the AR face image database show that the proposed method yields better recognition performance in comparison with the 2DPCA method that is not combined with IMED.
文摘The pharmaceutical industry increasingly values medicinal plants due to their perceived safety and costeffectiveness compared to modern drugs.Throughout the extensive history of medicinal plant usage,various plant parts,including flowers,leaves,and roots,have been acknowledged for their healing properties and employed in plant identification.Leaf images,however,stand out as the preferred and easily accessible source of information.Manual plant identification by plant taxonomists is intricate,time-consuming,and prone to errors,relying heavily on human perception.Artificial intelligence(AI)techniques offer a solution by automating plant recognition processes.This study thoroughly examines cutting-edge AI approaches for leaf image-based plant identification,drawing insights from literature across renowned repositories.This paper critically summarizes relevant literature based on AI algorithms,extracted features,and results achieved.Additionally,it analyzes extensively used datasets in automated plant classification research.It also offers deep insights into implemented techniques and methods employed for medicinal plant recognition.Moreover,this rigorous review study discusses opportunities and challenges in employing these AI-based approaches.Furthermore,in-depth statistical findings and lessons learned from this survey are highlighted with novel research areas with the aim of offering insights to the readers and motivating new research directions.This review is expected to serve as a foundational resource for future researchers in the field of AI-based identification of medicinal plants.
文摘Depending on the techniques of pattern recognition and image processing, we established a computer analytic system for photocclusal image. The analysing results made by this system are more accurate and reliable than those by the naked eye and grid for analysing photocclusal image. We analysed photocclusal images for a patient with prematurity of lower first right molar be fore and after occlusal adjustment with the system. The result appeared that occlusal adjustment mainly brought about distributive variation of occlusal stress rather than alteration of absolute value of overall occlusal force.
文摘Arabic Sign Language recognition is an emerging field of research. Previous attempts at automatic vision-based recog-nition of Arabic Sign Language mainly focused on finger spelling and recognizing isolated gestures. In this paper we report the first continuous Arabic Sign Language by building on existing research in feature extraction and pattern recognition. The development of the presented work required collecting a continuous Arabic Sign Language database which we designed and recorded in cooperation with a sign language expert. We intend to make the collected database available for the research community. Our system which we based on spatio-temporal feature extraction and hidden Markov models has resulted in an average word recognition rate of 94%, keeping in the mind the use of a high perplex-ity vocabulary and unrestrictive grammar. We compare our proposed work against existing sign language techniques based on accumulated image difference and motion estimation. The experimental results section shows that the pro-posed work outperforms existing solutions in terms of recognition accuracy.
文摘A novel example-based process for Automated Colorization of grayscale images using Texture Descriptors (ACTD) without any human intervention is proposed. By analyzing a set of sample color images, coherent regions of homogeneous textures are extracted. A multi-channel filtering technique is used for texture-based image segmentation, combined with a modified Fuzzy C-means (FCM) clustering algorithm. This modified FCM clustering algorithm includes both the local spatial information from neighboring pixels, and the spatial Euclidian distance to the cluster’s center of gravity. For each area of interest, state-of-the-art texture descriptors are then computed and stored, along with corresponding color information. These texture descriptors and the color information are used for colorization of a grayscale image with similar textures. Given a grayscale image to be colorized, the segmentation and feature extraction processes are repeated. The texture descriptors are used to perform Content-Based Image Retrieval (CBIR). The colorization process is performed by Chroma replacement. This research finds numerous applications, ranging from classic film restoration and enhancement, to adding valuable information into medical and satellite imaging. Also, this can be used to enhance the detection of objects from x-ray images at the airports.
基金National Natural Science Foundation of China,Grant/Award Numbers:62001141,62272319Science,Technology and Innovation Commission of Shenzhen Municipality,Grant/Award Numbers:GJHZ20210705141812038,JCYJ20210324094413037,JCYJ20210324131800002,RCBS20210609103820029Stable Support Projects for Shenzhen Higher Education Institutions,Grant/Award Number:20220715183602001。
文摘Diabetic retinopathy(DR),the main cause of irreversible blindness,is one of the most common complications of diabetes.At present,deep convolutional neural networks have achieved promising performance in automatic DR detection tasks.The convolution operation of methods is a local cross-correlation operation,whose receptive field de-termines the size of the local neighbourhood for processing.However,for retinal fundus photographs,there is not only the local information but also long-distance dependence between the lesion features(e.g.hemorrhages and exudates)scattered throughout the whole image.The proposed method incorporates correlations between long-range patches into the deep learning framework to improve DR detection.Patch-wise re-lationships are used to enhance the local patch features since lesions of DR usually appear as plaques.The Long-Range unit in the proposed network with a residual structure can be flexibly embedded into other trained networks.Extensive experimental results demon-strate that the proposed approach can achieve higher accuracy than existing state-of-the-art models on Messidor and EyePACS datasets.
基金the National Natural Science Foundation of China,Grant/Award Number:62006065the Science and Technology Research Program of Chongqing Municipal Education Commission,Grant/Award Number:KJQN202100634+1 种基金the Natural Science Foundation of Chongqing,Grant/Award Number:CSTB2022NSCQ‐MSX1202Chongqing Municipal Education Commission,Grant/Award Number:KJQN202100634。
文摘Transformer tracking always takes paired template and search images as encoder input and conduct feature extraction and target‐search feature correlation by self and/or cross attention operations,thus the model complexity will grow quadratically with the number of input images.To alleviate the burden of this tracking paradigm and facilitate practical deployment of Transformer‐based trackers,we propose a dual pooling transformer tracking framework,dubbed as DPT,which consists of three components:a simple yet efficient spatiotemporal attention model(SAM),a mutual correlation pooling Trans-former(MCPT)and a multiscale aggregation pooling Transformer(MAPT).SAM is designed to gracefully aggregates temporal dynamics and spatial appearance information of multi‐frame templates along space‐time dimensions.MCPT aims to capture multi‐scale pooled and correlated contextual features,which is followed by MAPT that aggregates multi‐scale features into a unified feature representation for tracking prediction.DPT tracker achieves AUC score of 69.5 on LaSOT and precision score of 82.8 on Track-ingNet while maintaining a shorter sequence length of attention tokens,fewer parameters and FLOPs compared to existing state‐of‐the‐art(SOTA)Transformer tracking methods.Extensive experiments demonstrate that DPT tracker yields a strong real‐time tracking baseline with a good trade‐off between tracking performance and inference efficiency.