Feature extraction of signals plays an important role in classification problems because of data dimension reduction property and potential improvement of a classification accuracy rate. Principal component analysis (...Feature extraction of signals plays an important role in classification problems because of data dimension reduction property and potential improvement of a classification accuracy rate. Principal component analysis (PCA), wavelets transform or Fourier transform methods are often used for feature extraction. In this paper, we propose a multi-scale PCA, which combines discrete wavelet transform, and PCA for feature extraction of signals in both the spatial and temporal domains. Our study shows that the multi-scale PCA combined with the proposed new classification methods leads to high classification accuracy for the considered signals.展开更多
To improve the feature extraction of ship-radiated noise in a complex ocean environment,a novel feature extraction method for ship-radiated noise based on complete ensemble empirical mode decomposition with adaptive s...To improve the feature extraction of ship-radiated noise in a complex ocean environment,a novel feature extraction method for ship-radiated noise based on complete ensemble empirical mode decomposition with adaptive selective noise(CEEMDASN) and refined composite multiscale fluctuation-based dispersion entropy(RCMFDE) is proposed.CEEMDASN is proposed in this paper which takes into account the high frequency intermittent components when decomposing the signal.In addition,RCMFDE is also proposed in this paper which refines the preprocessing process of the original signal based on composite multi-scale theory.Firstly,the original signal is decomposed into several intrinsic mode functions(IMFs)by CEEMDASN.Energy distribution ratio(EDR) and average energy distribution ratio(AEDR) of all IMF components are calculated.Then,the IMF with the minimum difference between EDR and AEDR(MEDR)is selected as characteristic IMF.The RCMFDE of characteristic IMF is estimated as the feature vectors of ship-radiated noise.Finally,these feature vectors are sent to self-organizing map(SOM) for classifying and identifying.The proposed method is applied to the feature extraction of ship-radiated noise.The result shows its effectiveness and universality.展开更多
The motivation for this article is to propose new damage classifiers based on a supervised learning problem for locating and quantifying damage.A new feature extraction approach using time series analysis is introduce...The motivation for this article is to propose new damage classifiers based on a supervised learning problem for locating and quantifying damage.A new feature extraction approach using time series analysis is introduced to extract damage-sensitive features from auto-regressive models.This approach sets out to improve current feature extraction techniques in the context of time series modeling.The coefficients and residuals of the AR model obtained from the proposed approach are selected as the main features and are applied to the proposed supervised learning classifiers that are categorized as coefficient-based and residual-based classifiers.These classifiers compute the relative errors in the extracted features between the undamaged and damaged states.Eventually,the abilities of the proposed methods to localize and quantify single and multiple damage scenarios are verified by applying experimental data for a laboratory frame and a four-story steel structure.Comparative analyses are performed to validate the superiority of the proposed methods over some existing techniques.Results show that the proposed classifiers,with the aid of extracted features from the proposed feature extraction approach,are able to locate and quantify damage;however,the residual-based classifiers yield better results than the coefficient-based classifiers.Moreover,these methods are superior to some classical techniques.展开更多
A novel approach to extract edge features from wideband echo is proposed. The set of extracted features not only represents the echo waveform in a concise way, but also is sufficient and well suited for classification...A novel approach to extract edge features from wideband echo is proposed. The set of extracted features not only represents the echo waveform in a concise way, but also is sufficient and well suited for classification of non-stationary echo data from objects with different property.The feature extraction is derived from the Discrete Dyadic Wavlet Transform (DDWT) of the echo through the undecimated algorithm. The motivation we use the DDWT is that it is time-shift-invariant which is beneficial for localization of edge, and the wavelet coefficients at larger scale represent the main shape feature of echo, i.e. edge, and the noise and modulated high-frequency components are reduced with scale increased. Some experimental results using real data which contain 144 samples from 4 classes of lake bottoms with different sediments are provided. The results show that our approach is a prospective way to represent wideband echo for reliable recognition of nonstationary echo with great variability.展开更多
To study the characteristics of license plate characters recognition,this paper proposes a method for fea- ture extraction of license plate characters based on two-dimensional wavelet packet.We decompose license plate...To study the characteristics of license plate characters recognition,this paper proposes a method for fea- ture extraction of license plate characters based on two-dimensional wavelet packet.We decompose license plate character images with two dimensional-wavelet packet and search for the optimal wavelet packet basis.This paper pre- sents a criterion of searching for the optimal wavelet packet basis,and a practical algorithm.The obtained optimal wavelet packet basis is used as the feature of license plate character,and a BP neural network is used to classify the character.The test- ing results show that the proposed method achieved higher recognition rate than the traditional methods.展开更多
In this paper,a novel face recognition method,named as wavelet-curvelet-fractal technique,is proposed. Based on the similarities embedded in the images,we propose to utilize the wave-let-curvelet-fractal technique to ...In this paper,a novel face recognition method,named as wavelet-curvelet-fractal technique,is proposed. Based on the similarities embedded in the images,we propose to utilize the wave-let-curvelet-fractal technique to extract facial features. Thus we have the wavelet’s details in diagonal,vertical,and horizontal directions,and the eight curvelet details at different angles. Then we adopt the Euclidean minimum distance classifier to recognize different faces. Extensive comparison tests on dif-ferent data sets are carried out,and higher recognition rate is obtained by the proposed technique.展开更多
This research proposes and implements an Arabic Sub-Words Recognition System (ASWR). The system focuses on employing a combination of statistical and structural features to provide complete pattern's description an...This research proposes and implements an Arabic Sub-Words Recognition System (ASWR). The system focuses on employing a combination of statistical and structural features to provide complete pattern's description and enhances the recognition rate. Support Vector Machines (SVMs) is utilized as a promising pattern recognition tool. In addition to that, the problems of dots and holes are solved in a completely different way from the ones previously employed. The proposed system proceeds in several phases as follows: (1) image acquisition, (2) binarisation, (3) morphological processing, (4) feature extraction, which includes statistical features, i.e., moment invariants, and structural features, i.e., dot number, dot position, and number of holes, features, and (5) classification, using multi-class SVMs and applying a one-against-all technique. The proposed system has been tested using different sets of words and subwords and has achieved a nearly 98.90% recogiaition rate. Comparative results with NNs are also presented.展开更多
In order to effectively make biological image classification and identification, this paper studies the biological owned characteristics, gives an encryption algorithm, and presents a biological classification algorit...In order to effectively make biological image classification and identification, this paper studies the biological owned characteristics, gives an encryption algorithm, and presents a biological classification algorithm based on the encryption process. Through studying the composition characteristics of palm, this paper uses the biological classification algorithm to carry out the classification or recognition of palm, improves the accuracy and efficiency of the existing biological classification and recognition approaches, and compares it with existing main approaches of palm classification by experiments. Experimental results show that this classification approach has the better classification effect, the faster computing speed and the higher classification rate which is improved averagely by 1.46% than those of the main classification approaches.展开更多
An efficient face recognition system with face image representation using averaged wavelet packet coefficients, compact and meaningful feature vectors dimensional reduction and recognition using radial basis function ...An efficient face recognition system with face image representation using averaged wavelet packet coefficients, compact and meaningful feature vectors dimensional reduction and recognition using radial basis function (RBF) neural network is presented. The face images are decomposed by 2-level two-dimensional (2-D) wavelet packet transformation. The wavelet packet coefficients obtained from the wavelet packet transformation are averaged using two different proposed methods. In the first method, wavelet packet coefficients of individual samples of a class are averaged then decomposed. The wavelet packet coefficients of all the samples of a class are averaged in the second method. The averaged wavelet packet coefficients are recognized by a RBF network. The proposed work tested on three face databases such as Olivetti-Oracle Research Lab (ORL), Japanese Female Facial Expression (JAFFE) and Essexface database. The proposed methods result in dimensionality reduction, low computational complexity and provide better recognition rates. The computational complexity is low as the dimensionality of the input pattern is reduced.展开更多
Agriculture plays an important role in the economy of all countries.However,plant diseases may badly affect the quality of food,production,and ultimately the economy.For plant disease detection and management,agricult...Agriculture plays an important role in the economy of all countries.However,plant diseases may badly affect the quality of food,production,and ultimately the economy.For plant disease detection and management,agriculturalists spend a huge amount of money.However,the manual detection method of plant diseases is complicated and time-consuming.Consequently,automated systems for plant disease detection using machine learning(ML)approaches are proposed.However,most of the existing ML techniques of plants diseases recognition are based on handcrafted features and they rarely deal with huge amount of input data.To address the issue,this article proposes a fully automated method for plant disease detection and recognition using deep neural networks.In the proposed method,AlexNet and VGG19 CNNs are considered as pre-trained architectures.It is capable to obtain the feature extraction of the given data with fine-tuning details.After convolutional neural network feature extraction,it selects the best subset of features through the correlation coefficient and feeds them to the number of classifiers including K-Nearest Neighbor,Support Vector Machine,Probabilistic Neural Network,Fuzzy logic,and Artificial Neural Network.The validation of the proposed method is carried out on a self-collected dataset generated through the augmentation step.The achieved average accuracy of our method is more than 96%and outperforms the recent techniques.展开更多
Over the years,the continuous development of new technology has promoted research in the field of posture recognition and also made the application field of posture recognition have been greatly expanded.The purpose o...Over the years,the continuous development of new technology has promoted research in the field of posture recognition and also made the application field of posture recognition have been greatly expanded.The purpose of this paper is to introduce the latest methods of posture recognition and review the various techniques and algorithms of posture recognition in recent years,such as scale-invariant feature transform,histogram of oriented gradients,support vectormachine(SVM),Gaussian mixturemodel,dynamic time warping,hiddenMarkovmodel(HMM),lightweight network,convolutional neural network(CNN).We also investigate improved methods of CNN,such as stacked hourglass networks,multi-stage pose estimation networks,convolutional posemachines,and high-resolution nets.The general process and datasets of posture recognition are analyzed and summarized,and several improved CNNmethods and threemain recognition techniques are compared.In addition,the applications of advanced neural networks in posture recognition,such as transfer learning,ensemble learning,graph neural networks,and explainable deep neural networks,are introduced.It was found that CNN has achieved great success in posture recognition and is favored by researchers.Still,a more in-depth research is needed in feature extraction,information fusion,and other aspects.Among classification methods,HMM and SVM are the most widely used,and lightweight network gradually attracts the attention of researchers.In addition,due to the lack of 3Dbenchmark data sets,data generation is a critical research direction.展开更多
With the arrival of new data acquisition platforms derived from the Internet of Things(IoT),this paper goes beyond the understanding of traditional remote sensing technologies.Deep fusion of remote sensing and compute...With the arrival of new data acquisition platforms derived from the Internet of Things(IoT),this paper goes beyond the understanding of traditional remote sensing technologies.Deep fusion of remote sensing and computer vision has hit the industrial world and makes it possible to apply Artificial intelligence to solve problems such as automatic extraction of information and image interpretation.However,due to the complex architecture of IoT and the lack of a unified security protection mechanism,devices in remote sensing are vulnerable to privacy leaks when sharing data.It is necessary to design a security scheme suitable for computation‐limited devices in IoT,since traditional encryption methods are based on computational complexity.Visual Cryptography(VC)is a threshold scheme for images that can be decoded directly by the human visual system when superimposing encrypted images.The stacking‐to‐see feature and simple Boolean decryption operation make VC an ideal solution for privacy‐preserving recognition for large‐scale remote sensing images in IoT.In this study,the secure and efficient transmission of high‐resolution remote sensing images by meaningful VC is achieved.By diffusing the error between the encryption block and the original block to adjacent blocks,the degradation of quality in recovery images is mitigated.By fine‐tuning the pre‐trained model from large‐scale datasets,we improve the recognition performance of small encryption datasets for remote sensing images.The experimental results show that the proposed lightweight privacy‐preserving recognition framework maintains high recognition performance while enhancing security.展开更多
The thermal-based imaging technique has recently attracted the attention of researchers who are interested in the recognition of human affects dueto its ability to measure the facial transient temperature, which is co...The thermal-based imaging technique has recently attracted the attention of researchers who are interested in the recognition of human affects dueto its ability to measure the facial transient temperature, which is correlated withhuman affects and robustness against illumination changes. Therefore, studieshave increasingly used the thermal imaging as a potential and supplemental solution to overcome the challenges of visual (RGB) imaging, such as the variation oflight conditions and revealing original human affect. Moreover, the thermal-basedimaging has shown promising results in the detection of psychophysiological signals, such as pulse rate and respiration rate in a contactless and noninvasive way.This paper presents a brief review on human affects and focuses on the advantages and challenges of the thermal imaging technique. In addition, this paper discusses the stages of thermal-based human affective state recognition, such asdataset type, preprocessing stage, region of interest (ROI), feature descriptors,and classification approaches with a brief performance analysis based on a number of works in the literature. This analysis could help beginners in the thermalimaging and affective recognition domain to explore numerous approaches usedby researchers to construct an affective state system based on thermal imaging.展开更多
This paper deals with a real-life application of epilepsy classification, where three phases of absence seizure, namely pre-seizure, seizure and seizure-free, are classified using real clinical data. Artificial neural...This paper deals with a real-life application of epilepsy classification, where three phases of absence seizure, namely pre-seizure, seizure and seizure-free, are classified using real clinical data. Artificial neural network (ANN) and support vector machines (SVMs) combined with su- pervised learning algorithms, and k-means clustering (k-MC) combined with unsupervised techniques are employed to classify the three seizure phases. Different techniques to combine binary SVMs, namely One Vs One (OvO), One Vs All (OVA) and Binary Decision Tree (BDT), are employed for multiclass classification. Comparisons are performed with two traditional classification methods, namely, k-Nearest Neighbour (k- NN) and Naive Bayes classifier. It is concluded that SVM-based classifiers outperform the traditional ones in terms of recognition accuracy and robustness property when the original clinical data is distorted with noise. Furthermore, SVM-based classifier with OvO provides the highest recognition accuracy, whereas ANN-based classifier overtakes by demonstrating maximum accuracy in the presence of noise.展开更多
In this work,a system for recognition of newspaper printed in Gurumukhi script is presented.Four feature extraction techniques,namely,zoning features,diagonal features,parabola curve fitting based features,and power c...In this work,a system for recognition of newspaper printed in Gurumukhi script is presented.Four feature extraction techniques,namely,zoning features,diagonal features,parabola curve fitting based features,and power curve fitting based features are considered for extracting the statistical properties of the characters printed in the newspaper.Different combinations of these features are also applied to improve the recognition accuracy.For recognition,four classification techniques,namely,k-NN,linear-SVM,decision tree,and random forest are used.A database for the experiments is collected from three major Gurumukhi script newspapers which are Ajit,Jagbani and Punjabi Tribune.Using 5-fold cross validation and random forest classifier,a recognition accuracy of 96.19%with a combination of zoning features,diagonal features and parabola curve fitting based features has been reported.A recognition accuracy of 95.21%with a partitioning strategy of data set(70%data as training data and remaining 30%data as testing data)has been achieved.展开更多
Facial expression recognition(FER) in video has attracted the increasing interest and many approaches have been made.The crucial problem of classifying a given video sequence into several basic emotions is how to fuse...Facial expression recognition(FER) in video has attracted the increasing interest and many approaches have been made.The crucial problem of classifying a given video sequence into several basic emotions is how to fuse facial features of individual frames.In this paper, a frame-level attention module is integrated into an improved VGG-based frame work and a lightweight facial expression recognition method is proposed.The proposed network takes a sub video cut from an experimental video sequence as its input and generates a fixed-dimension representation.The VGG-based network with an enhanced branch embeds face images into feature vectors.The frame-level attention module learns weights which are used to adaptively aggregate the feature vectors to form a single discriminative video representation.Finally, a regression module outputs the classification results.The experimental results on CK+and AFEW databases show that the recognition rates of the proposed method can achieve the state-of-the-art performance.展开更多
Biometrics represents the technology for measuring the characteristics of the human body.Biometric authentication currently allows for secure,easy,and fast access by recognizing a person based on facial,voice,and fing...Biometrics represents the technology for measuring the characteristics of the human body.Biometric authentication currently allows for secure,easy,and fast access by recognizing a person based on facial,voice,and fingerprint traits.Iris authentication is one of the essential biometric methods for identifying a person.This authentication type has become popular in research and practical applications.Unlike the face and hands,the iris is an internal organ,protected and therefore less likely to be damaged.However,the number of helpful information collected from the iris is much greater than the other biometric human organs.This work proposes a new iris identification model based on a multilevel thresholding technique and modified Fuzzy cmeans algorithm.The multilevel thresholding technique extracts the iris from its surroundings,such as specular reflections,eyelashes,pupils,and sclera.On the other hand,the modified Fuzzy c-means is used to combine and classify the most useful statistical features to maximize the accuracy of the collected information.Therefore,having the most optimal iris recognition.The proposed model results are validated using True Success Rate(TSR)and compared to other existing models.The results show how effective the combination of the two stages of the proposed model is:the Otsu method and modified Fuzzy c-means for the 400 tested images representing 40 people.展开更多
In this paper, an expert system for security based on biometric human features that can be obtained without any contact with the registering sensor is presented. These features are extracted from human’s voice, so th...In this paper, an expert system for security based on biometric human features that can be obtained without any contact with the registering sensor is presented. These features are extracted from human’s voice, so the system is called Voice Recognition System (VRS). The proposed system?consists of a combination of three stages: signal pre-processing, features extraction by using?Wavelet Packet Transform (WPT) and features matching by using Artificial Neural Networks (ANNs). The features vectors are formed after two steps: firstly, decomposing the speech signal at level 7 with Daubechies 20-tap (db20), secondly, the energy corresponding to each WPT node is calculated which collected to form a features vector. One hundred twenty eight features vector for each speaker was fed to the Feed Forward Back-propagation Neural Network (FFBPNN). The data used in this paper are drawn from the English Language Speech Database for Speaker Recognition (ELSDSR) database which composes of audio files for training and other files for testing. The performance of the proposed system is evaluated by using the test files. Our results showed that the rate of correct recognition of the proposed system is about 100% for training files and 95.7% for one testing file for each speaker from the ELSDSR database. The proposed method showed efficiency results were better than the well-known Mel Frequency Cepstral Coefficient (MFCC) and the Zak transform.展开更多
文摘Feature extraction of signals plays an important role in classification problems because of data dimension reduction property and potential improvement of a classification accuracy rate. Principal component analysis (PCA), wavelets transform or Fourier transform methods are often used for feature extraction. In this paper, we propose a multi-scale PCA, which combines discrete wavelet transform, and PCA for feature extraction of signals in both the spatial and temporal domains. Our study shows that the multi-scale PCA combined with the proposed new classification methods leads to high classification accuracy for the considered signals.
基金supported by the National Natural Science Foundation of China under Grant 51709228。
文摘To improve the feature extraction of ship-radiated noise in a complex ocean environment,a novel feature extraction method for ship-radiated noise based on complete ensemble empirical mode decomposition with adaptive selective noise(CEEMDASN) and refined composite multiscale fluctuation-based dispersion entropy(RCMFDE) is proposed.CEEMDASN is proposed in this paper which takes into account the high frequency intermittent components when decomposing the signal.In addition,RCMFDE is also proposed in this paper which refines the preprocessing process of the original signal based on composite multi-scale theory.Firstly,the original signal is decomposed into several intrinsic mode functions(IMFs)by CEEMDASN.Energy distribution ratio(EDR) and average energy distribution ratio(AEDR) of all IMF components are calculated.Then,the IMF with the minimum difference between EDR and AEDR(MEDR)is selected as characteristic IMF.The RCMFDE of characteristic IMF is estimated as the feature vectors of ship-radiated noise.Finally,these feature vectors are sent to self-organizing map(SOM) for classifying and identifying.The proposed method is applied to the feature extraction of ship-radiated noise.The result shows its effectiveness and universality.
文摘The motivation for this article is to propose new damage classifiers based on a supervised learning problem for locating and quantifying damage.A new feature extraction approach using time series analysis is introduced to extract damage-sensitive features from auto-regressive models.This approach sets out to improve current feature extraction techniques in the context of time series modeling.The coefficients and residuals of the AR model obtained from the proposed approach are selected as the main features and are applied to the proposed supervised learning classifiers that are categorized as coefficient-based and residual-based classifiers.These classifiers compute the relative errors in the extracted features between the undamaged and damaged states.Eventually,the abilities of the proposed methods to localize and quantify single and multiple damage scenarios are verified by applying experimental data for a laboratory frame and a four-story steel structure.Comparative analyses are performed to validate the superiority of the proposed methods over some existing techniques.Results show that the proposed classifiers,with the aid of extracted features from the proposed feature extraction approach,are able to locate and quantify damage;however,the residual-based classifiers yield better results than the coefficient-based classifiers.Moreover,these methods are superior to some classical techniques.
文摘A novel approach to extract edge features from wideband echo is proposed. The set of extracted features not only represents the echo waveform in a concise way, but also is sufficient and well suited for classification of non-stationary echo data from objects with different property.The feature extraction is derived from the Discrete Dyadic Wavlet Transform (DDWT) of the echo through the undecimated algorithm. The motivation we use the DDWT is that it is time-shift-invariant which is beneficial for localization of edge, and the wavelet coefficients at larger scale represent the main shape feature of echo, i.e. edge, and the noise and modulated high-frequency components are reduced with scale increased. Some experimental results using real data which contain 144 samples from 4 classes of lake bottoms with different sediments are provided. The results show that our approach is a prospective way to represent wideband echo for reliable recognition of nonstationary echo with great variability.
基金This work was supported by the Natural Science Foundation of Jiangsu Province(Grant No.BK2004077).
文摘To study the characteristics of license plate characters recognition,this paper proposes a method for fea- ture extraction of license plate characters based on two-dimensional wavelet packet.We decompose license plate character images with two dimensional-wavelet packet and search for the optimal wavelet packet basis.This paper pre- sents a criterion of searching for the optimal wavelet packet basis,and a practical algorithm.The obtained optimal wavelet packet basis is used as the feature of license plate character,and a BP neural network is used to classify the character.The test- ing results show that the proposed method achieved higher recognition rate than the traditional methods.
基金Supported by the College of Heilongjiang Province, Electronic Engineering Key Lab Project dzzd200602Heilongjiang Province Educational Bureau Scientific Technology Important Project 11531z18
文摘In this paper,a novel face recognition method,named as wavelet-curvelet-fractal technique,is proposed. Based on the similarities embedded in the images,we propose to utilize the wave-let-curvelet-fractal technique to extract facial features. Thus we have the wavelet’s details in diagonal,vertical,and horizontal directions,and the eight curvelet details at different angles. Then we adopt the Euclidean minimum distance classifier to recognize different faces. Extensive comparison tests on dif-ferent data sets are carried out,and higher recognition rate is obtained by the proposed technique.
文摘This research proposes and implements an Arabic Sub-Words Recognition System (ASWR). The system focuses on employing a combination of statistical and structural features to provide complete pattern's description and enhances the recognition rate. Support Vector Machines (SVMs) is utilized as a promising pattern recognition tool. In addition to that, the problems of dots and holes are solved in a completely different way from the ones previously employed. The proposed system proceeds in several phases as follows: (1) image acquisition, (2) binarisation, (3) morphological processing, (4) feature extraction, which includes statistical features, i.e., moment invariants, and structural features, i.e., dot number, dot position, and number of holes, features, and (5) classification, using multi-class SVMs and applying a one-against-all technique. The proposed system has been tested using different sets of words and subwords and has achieved a nearly 98.90% recogiaition rate. Comparative results with NNs are also presented.
文摘In order to effectively make biological image classification and identification, this paper studies the biological owned characteristics, gives an encryption algorithm, and presents a biological classification algorithm based on the encryption process. Through studying the composition characteristics of palm, this paper uses the biological classification algorithm to carry out the classification or recognition of palm, improves the accuracy and efficiency of the existing biological classification and recognition approaches, and compares it with existing main approaches of palm classification by experiments. Experimental results show that this classification approach has the better classification effect, the faster computing speed and the higher classification rate which is improved averagely by 1.46% than those of the main classification approaches.
文摘An efficient face recognition system with face image representation using averaged wavelet packet coefficients, compact and meaningful feature vectors dimensional reduction and recognition using radial basis function (RBF) neural network is presented. The face images are decomposed by 2-level two-dimensional (2-D) wavelet packet transformation. The wavelet packet coefficients obtained from the wavelet packet transformation are averaged using two different proposed methods. In the first method, wavelet packet coefficients of individual samples of a class are averaged then decomposed. The wavelet packet coefficients of all the samples of a class are averaged in the second method. The averaged wavelet packet coefficients are recognized by a RBF network. The proposed work tested on three face databases such as Olivetti-Oracle Research Lab (ORL), Japanese Female Facial Expression (JAFFE) and Essexface database. The proposed methods result in dimensionality reduction, low computational complexity and provide better recognition rates. The computational complexity is low as the dimensionality of the input pattern is reduced.
基金the MSIT(Ministry of Science and ICT),Korea,under the ITRC(Information Technology Research Center)support program(IITP-2020-2016-0-00312)supervised by the IITP(Institute for Information&Communications Technology Planning&Evaluation)in part by the MSIP(Ministry of Science,ICT&Future Planning),Korea,under the National Program for Excellence in SW)(2015-0-00938)supervised by the IITP(Institute for Information&communications Technology Planning&Evaluation).
文摘Agriculture plays an important role in the economy of all countries.However,plant diseases may badly affect the quality of food,production,and ultimately the economy.For plant disease detection and management,agriculturalists spend a huge amount of money.However,the manual detection method of plant diseases is complicated and time-consuming.Consequently,automated systems for plant disease detection using machine learning(ML)approaches are proposed.However,most of the existing ML techniques of plants diseases recognition are based on handcrafted features and they rarely deal with huge amount of input data.To address the issue,this article proposes a fully automated method for plant disease detection and recognition using deep neural networks.In the proposed method,AlexNet and VGG19 CNNs are considered as pre-trained architectures.It is capable to obtain the feature extraction of the given data with fine-tuning details.After convolutional neural network feature extraction,it selects the best subset of features through the correlation coefficient and feeds them to the number of classifiers including K-Nearest Neighbor,Support Vector Machine,Probabilistic Neural Network,Fuzzy logic,and Artificial Neural Network.The validation of the proposed method is carried out on a self-collected dataset generated through the augmentation step.The achieved average accuracy of our method is more than 96%and outperforms the recent techniques.
基金supported by British Heart Foundation Accelerator Award,UK(AA/18/3/34220)Royal Society International Exchanges Cost Share Award,UK(RP202G0230)+7 种基金Hope Foundation for Cancer Research,UK(RM60G0680)Medical Research Council Confidence in Concept Award,UK(MC_PC_17171)Sino-UK Industrial Fund,UK(RP202G0289)Global Challenges Research Fund(GCRF),UK(P202PF11)LIAS Pioneering Partnerships award,UK(P202ED10)Data Science Enhancement Fund,UK(P202RE237)Fight for Sight,UK(24NN201)Sino-UK Education Fund,UK(OP202006).
文摘Over the years,the continuous development of new technology has promoted research in the field of posture recognition and also made the application field of posture recognition have been greatly expanded.The purpose of this paper is to introduce the latest methods of posture recognition and review the various techniques and algorithms of posture recognition in recent years,such as scale-invariant feature transform,histogram of oriented gradients,support vectormachine(SVM),Gaussian mixturemodel,dynamic time warping,hiddenMarkovmodel(HMM),lightweight network,convolutional neural network(CNN).We also investigate improved methods of CNN,such as stacked hourglass networks,multi-stage pose estimation networks,convolutional posemachines,and high-resolution nets.The general process and datasets of posture recognition are analyzed and summarized,and several improved CNNmethods and threemain recognition techniques are compared.In addition,the applications of advanced neural networks in posture recognition,such as transfer learning,ensemble learning,graph neural networks,and explainable deep neural networks,are introduced.It was found that CNN has achieved great success in posture recognition and is favored by researchers.Still,a more in-depth research is needed in feature extraction,information fusion,and other aspects.Among classification methods,HMM and SVM are the most widely used,and lightweight network gradually attracts the attention of researchers.In addition,due to the lack of 3Dbenchmark data sets,data generation is a critical research direction.
基金supported in part by the National Natural Science Foundation of China under Grants(62250410365,62071084)the Guangdong Basic and Applied Basic Research Foundation of China(2022A1515011542)the Guangzhou Science and technology program of China(202201010606).
文摘With the arrival of new data acquisition platforms derived from the Internet of Things(IoT),this paper goes beyond the understanding of traditional remote sensing technologies.Deep fusion of remote sensing and computer vision has hit the industrial world and makes it possible to apply Artificial intelligence to solve problems such as automatic extraction of information and image interpretation.However,due to the complex architecture of IoT and the lack of a unified security protection mechanism,devices in remote sensing are vulnerable to privacy leaks when sharing data.It is necessary to design a security scheme suitable for computation‐limited devices in IoT,since traditional encryption methods are based on computational complexity.Visual Cryptography(VC)is a threshold scheme for images that can be decoded directly by the human visual system when superimposing encrypted images.The stacking‐to‐see feature and simple Boolean decryption operation make VC an ideal solution for privacy‐preserving recognition for large‐scale remote sensing images in IoT.In this study,the secure and efficient transmission of high‐resolution remote sensing images by meaningful VC is achieved.By diffusing the error between the encryption block and the original block to adjacent blocks,the degradation of quality in recovery images is mitigated.By fine‐tuning the pre‐trained model from large‐scale datasets,we improve the recognition performance of small encryption datasets for remote sensing images.The experimental results show that the proposed lightweight privacy‐preserving recognition framework maintains high recognition performance while enhancing security.
基金funded by the research university grant by Universiti Sains Malaysia[1001.PKOMP.8014001].
文摘The thermal-based imaging technique has recently attracted the attention of researchers who are interested in the recognition of human affects dueto its ability to measure the facial transient temperature, which is correlated withhuman affects and robustness against illumination changes. Therefore, studieshave increasingly used the thermal imaging as a potential and supplemental solution to overcome the challenges of visual (RGB) imaging, such as the variation oflight conditions and revealing original human affect. Moreover, the thermal-basedimaging has shown promising results in the detection of psychophysiological signals, such as pulse rate and respiration rate in a contactless and noninvasive way.This paper presents a brief review on human affects and focuses on the advantages and challenges of the thermal imaging technique. In addition, this paper discusses the stages of thermal-based human affective state recognition, such asdataset type, preprocessing stage, region of interest (ROI), feature descriptors,and classification approaches with a brief performance analysis based on a number of works in the literature. This analysis could help beginners in the thermalimaging and affective recognition domain to explore numerous approaches usedby researchers to construct an affective state system based on thermal imaging.
文摘This paper deals with a real-life application of epilepsy classification, where three phases of absence seizure, namely pre-seizure, seizure and seizure-free, are classified using real clinical data. Artificial neural network (ANN) and support vector machines (SVMs) combined with su- pervised learning algorithms, and k-means clustering (k-MC) combined with unsupervised techniques are employed to classify the three seizure phases. Different techniques to combine binary SVMs, namely One Vs One (OvO), One Vs All (OVA) and Binary Decision Tree (BDT), are employed for multiclass classification. Comparisons are performed with two traditional classification methods, namely, k-Nearest Neighbour (k- NN) and Naive Bayes classifier. It is concluded that SVM-based classifiers outperform the traditional ones in terms of recognition accuracy and robustness property when the original clinical data is distorted with noise. Furthermore, SVM-based classifier with OvO provides the highest recognition accuracy, whereas ANN-based classifier overtakes by demonstrating maximum accuracy in the presence of noise.
文摘In this work,a system for recognition of newspaper printed in Gurumukhi script is presented.Four feature extraction techniques,namely,zoning features,diagonal features,parabola curve fitting based features,and power curve fitting based features are considered for extracting the statistical properties of the characters printed in the newspaper.Different combinations of these features are also applied to improve the recognition accuracy.For recognition,four classification techniques,namely,k-NN,linear-SVM,decision tree,and random forest are used.A database for the experiments is collected from three major Gurumukhi script newspapers which are Ajit,Jagbani and Punjabi Tribune.Using 5-fold cross validation and random forest classifier,a recognition accuracy of 96.19%with a combination of zoning features,diagonal features and parabola curve fitting based features has been reported.A recognition accuracy of 95.21%with a partitioning strategy of data set(70%data as training data and remaining 30%data as testing data)has been achieved.
基金Supported by the Future Network Scientific Research Fund Project of Jiangsu Province (No. FNSRFP2021YB26)the Jiangsu Key R&D Fund on Social Development (No. BE2022789)the Science Foundation of Nanjing Institute of Technology (No. ZKJ202003)。
文摘Facial expression recognition(FER) in video has attracted the increasing interest and many approaches have been made.The crucial problem of classifying a given video sequence into several basic emotions is how to fuse facial features of individual frames.In this paper, a frame-level attention module is integrated into an improved VGG-based frame work and a lightweight facial expression recognition method is proposed.The proposed network takes a sub video cut from an experimental video sequence as its input and generates a fixed-dimension representation.The VGG-based network with an enhanced branch embeds face images into feature vectors.The frame-level attention module learns weights which are used to adaptively aggregate the feature vectors to form a single discriminative video representation.Finally, a regression module outputs the classification results.The experimental results on CK+and AFEW databases show that the recognition rates of the proposed method can achieve the state-of-the-art performance.
基金This research is supported by the faculty of computers and information Technology and the Industrial Innovation and Robotics Center,University of Tabuk.
文摘Biometrics represents the technology for measuring the characteristics of the human body.Biometric authentication currently allows for secure,easy,and fast access by recognizing a person based on facial,voice,and fingerprint traits.Iris authentication is one of the essential biometric methods for identifying a person.This authentication type has become popular in research and practical applications.Unlike the face and hands,the iris is an internal organ,protected and therefore less likely to be damaged.However,the number of helpful information collected from the iris is much greater than the other biometric human organs.This work proposes a new iris identification model based on a multilevel thresholding technique and modified Fuzzy cmeans algorithm.The multilevel thresholding technique extracts the iris from its surroundings,such as specular reflections,eyelashes,pupils,and sclera.On the other hand,the modified Fuzzy c-means is used to combine and classify the most useful statistical features to maximize the accuracy of the collected information.Therefore,having the most optimal iris recognition.The proposed model results are validated using True Success Rate(TSR)and compared to other existing models.The results show how effective the combination of the two stages of the proposed model is:the Otsu method and modified Fuzzy c-means for the 400 tested images representing 40 people.
文摘In this paper, an expert system for security based on biometric human features that can be obtained without any contact with the registering sensor is presented. These features are extracted from human’s voice, so the system is called Voice Recognition System (VRS). The proposed system?consists of a combination of three stages: signal pre-processing, features extraction by using?Wavelet Packet Transform (WPT) and features matching by using Artificial Neural Networks (ANNs). The features vectors are formed after two steps: firstly, decomposing the speech signal at level 7 with Daubechies 20-tap (db20), secondly, the energy corresponding to each WPT node is calculated which collected to form a features vector. One hundred twenty eight features vector for each speaker was fed to the Feed Forward Back-propagation Neural Network (FFBPNN). The data used in this paper are drawn from the English Language Speech Database for Speaker Recognition (ELSDSR) database which composes of audio files for training and other files for testing. The performance of the proposed system is evaluated by using the test files. Our results showed that the rate of correct recognition of the proposed system is about 100% for training files and 95.7% for one testing file for each speaker from the ELSDSR database. The proposed method showed efficiency results were better than the well-known Mel Frequency Cepstral Coefficient (MFCC) and the Zak transform.