Automatic modulation recognition(AMR)of radiation source signals is a research focus in the field of cognitive radio.However,the AMR of radiation source signals at low SNRs still faces a great challenge.Therefore,the ...Automatic modulation recognition(AMR)of radiation source signals is a research focus in the field of cognitive radio.However,the AMR of radiation source signals at low SNRs still faces a great challenge.Therefore,the AMR method of radiation source signals based on two-dimensional data matrix and improved residual neural network is proposed in this paper.First,the time series of the radiation source signals are reconstructed into two-dimensional data matrix,which greatly simplifies the signal preprocessing process.Second,the depthwise convolution and large-size convolutional kernels based residual neural network(DLRNet)is proposed to improve the feature extraction capability of the AMR model.Finally,the model performs feature extraction and classification on the two-dimensional data matrix to obtain the recognition vector that represents the signal modulation type.Theoretical analysis and simulation results show that the AMR method based on two-dimensional data matrix and improved residual network can significantly improve the accuracy of the AMR method.The recognition accuracy of the proposed method maintains a high level greater than 90% even at -14 dB SNR.展开更多
Mandarin Chinese tone patterns vary in one of the four ways, i.e, (1) high level; (2) rising; (3) low falling and rising; and (4) high falling. The present study is to examine the efficacy of an artificial neural netw...Mandarin Chinese tone patterns vary in one of the four ways, i.e, (1) high level; (2) rising; (3) low falling and rising; and (4) high falling. The present study is to examine the efficacy of an artificial neural network in recognizing these tone patterns. Speech data were recorded from 12 children (3-6 years of age) and 15 adults. All subjects were native Mandarin Chinese speakers. The fundamental frequencies (F0) of each monosyllabic word of the speech data were extracted with an autocorrelation method. The pitch data(i.e., the F0 contours) were the inputs to a feed-forward backpropagation artificial neural network. The number of inputs to the neural network varied from 1 to 16 and the hidden layer of the network contained neurons that varied from 1 to 16 in number. The output of the network consisted of four neurons representing the four tone patterns of Mandarin Chinese. After being trained with the Levenberg-Marquardt optimization, the neural network was able to successfully classify the tone patterns with an accuracy of about 90% correct for speech samples from both adults and children. The artificial neural network may provide an objective and effective way of assessing tone production in prelingually-deafened children who have received cochlear implants.展开更多
Natural language processing technologies have become more widely available in recent years,making them more useful in everyday situations.Machine learning systems that employ accessible datasets and corporate work to ...Natural language processing technologies have become more widely available in recent years,making them more useful in everyday situations.Machine learning systems that employ accessible datasets and corporate work to serve the whole spectrum of problems addressed in computational linguistics have lately yielded a number of promising breakthroughs.These methods were particularly advantageous for regional languages,as they were provided with cut-ting-edge language processing tools as soon as the requisite corporate information was generated.The bulk of modern people are unconcerned about the importance of reading.Reading aloud,on the other hand,is an effective technique for nour-ishing feelings as well as a necessary skill in the learning process.This paper pro-posed a novel approach for speech recognition based on neural networks.The attention mechanism isfirst utilized to determine the speech accuracy andfluency assessments,with the spectrum map as the feature extraction input.To increase phoneme identification accuracy,reading precision,for example,employs a new type of deep speech.It makes use of the exportchapter tool,which provides a corpus,as well as the TensorFlow framework in the experimental setting.The experimentalfindings reveal that the suggested model can more effectively assess spoken speech accuracy and readingfluency than the old model,and its evalua-tion model’s score outcomes are more accurate.展开更多
A graphic processing unit (GPU)-accelerated biological species recognition method using partially connected neural evolutionary network model is introduced in this paper. The partial connected neural evolutionary netw...A graphic processing unit (GPU)-accelerated biological species recognition method using partially connected neural evolutionary network model is introduced in this paper. The partial connected neural evolutionary network adopted in the paper can overcome the disadvantage of traditional neural network with small inputs. The whole image is considered as the input of the neural network, so the maximal features can be kept for recognition. To speed up the recognition process of the neural network, a fast implementation of the partially connected neural network was conducted on NVIDIA Tesla C1060 using the NVIDIA compute unified device architecture (CUDA) framework. Image sets of eight biological species were obtained to test the GPU implementation and counterpart serial CPU implementation, and experiment results showed GPU implementation works effectively on both recognition rate and speed, and gained 343 speedup over its counterpart CPU implementation. Comparing to feature-based recognition method on the same recognition task, the method also achieved an acceptable correct rate of 84.6% when testing on eight biological species.展开更多
Underwater target recognition is a key technology for underwater acoustic countermeasure.How to classify and recognize underwater targets according to the noise information of underwater targets has been a hot topic i...Underwater target recognition is a key technology for underwater acoustic countermeasure.How to classify and recognize underwater targets according to the noise information of underwater targets has been a hot topic in the field of underwater acoustic signals.In this paper,the deep learning model is applied to underwater target recognition.Improved anti-noise Power-Normalized Cepstral Coefficients(ia-PNCC)is proposed,based on PNCC applied to underwater noises.Multitaper and normalized Gammatone filter banks are applied to improve the anti-noise capacity.The method is combined with a convolutional neural network in order to recognize the underwater target.Experiment results show that the acoustic feature presented by ia-PNCC has lower noise and are wellsuited to underwater target recognition using a convolutional neural network.Compared with the combination of convolutional neural network with single acoustic feature,such as MFCC(Mel-scale Frequency Cepstral Coefficients)or LPCC(Linear Prediction Cepstral Coefficients),the combination of the ia-PNCC with a convolutional neural network offers better accuracy for underwater target recognition.展开更多
Human body posture recognition has attracted considerable attention in recent years in wireless body area networks(WBAN). In order to precisely recognize human body posture,many recognition algorithms have been propos...Human body posture recognition has attracted considerable attention in recent years in wireless body area networks(WBAN). In order to precisely recognize human body posture,many recognition algorithms have been proposed.However, the recognition rate is relatively low. In this paper, we apply back propagation(BP) neural network as a classifier to recognizing human body posture, where signals are collected from VG350 acceleration sensor and a posture signal collection system based on WBAN is designed. Human body signal vector magnitude(SVM) and tri-axial acceleration sensor data are used to describe the human body postures. We are able to recognize 4postures: Walk, Run, Squat and Sit. Our posture recognition rate is up to 91.67%. Furthermore, we find an implied relationship between hidden layer neurons and the posture recognition rate. The proposed human body posture recognition algorithm lays the foundation for the subsequent applications.展开更多
Effective vibration recognition can improve the performance of vibration control and structural damage detection and is in high demand for signal processing and advanced classification.Signal-processing methods can ex...Effective vibration recognition can improve the performance of vibration control and structural damage detection and is in high demand for signal processing and advanced classification.Signal-processing methods can extract the potent time-frequency-domain characteristics of signals;however,the performance of conventional characteristics-based classification needs to be improved.Widely used deep learning algorithms(e.g.,convolutional neural networks(CNNs))can conduct classification by extracting high-dimensional data features,with outstanding performance.Hence,combining the advantages of signal processing and deep-learning algorithms can significantly enhance vibration recognition performance.A novel vibration recognition method based on signal processing and deep neural networks is proposed herein.First,environmental vibration signals are collected;then,signal processing is conducted to obtain the coefficient matrices of the time-frequency-domain characteristics using three typical algorithms:the wavelet transform,Hilbert-Huang transform,and Mel frequency cepstral coefficient extraction method.Subsequently,CNNs,long short-term memory(LSTM)networks,and combined deep CNN-LSTM networks are trained for vibration recognition,according to the time-frequencydomain characteristics.Finally,the performance of the trained deep neural networks is evaluated and validated.The results confirm the effectiveness of the proposed vibration recognition method combining signal preprocessing and deep learning.展开更多
Drill wear not only affects the surface smoothness of the hole, but also influences the life of the drill. Drill wear state recognition is important in the manufacturing process, which consists of two steps: first, d...Drill wear not only affects the surface smoothness of the hole, but also influences the life of the drill. Drill wear state recognition is important in the manufacturing process, which consists of two steps: first, decomposing cutting torque components from the original signals by wavelet packet decomposition (WPD); second, extracting wavelet coefficients of different wear states (i.e., slight, normal, or severe wear) with signal features adapting to Welch spectrum. Finally, monitoring and recognition of the feature vectors of cutting torque signal are performed by using the K-means cluster and radial basis function neural network (RBFNN). The experiments on different tool wears of the multivariable features reveal that the results of monitoring and recognition are significant and effective.展开更多
In this study,we developed a system based on deep space–time neural networks for gesture recognition.When users change or the number of gesture categories increases,the accuracy of gesture recognition decreases consi...In this study,we developed a system based on deep space–time neural networks for gesture recognition.When users change or the number of gesture categories increases,the accuracy of gesture recognition decreases considerably because most gesture recognition systems cannot accommodate both user differentiation and gesture diversity.To overcome the limitations of existing methods,we designed a onedimensional parallel long short-term memory–fully convolutional network(LSTM–FCN)model to extract gesture features of different dimensions.LSTM can learn complex time dynamic information,whereas FCN can predict gestures efficiently by extracting the deep,abstract features of gestures in the spatial dimension.In the experiment,50 types of gestures of five users were collected and evaluated.The experimental results demonstrate the effectiveness of this system and robustness to various gestures and individual changes.Statistical analysis of the recognition results indicated that an average accuracy of approximately 98.9% was achieved.展开更多
In many existing multi-view gait recognition methods based on images or video sequences,gait sequences are usually used to superimpose and synthesize images and construct energy-like template.However,information may b...In many existing multi-view gait recognition methods based on images or video sequences,gait sequences are usually used to superimpose and synthesize images and construct energy-like template.However,information may be lost during the process of compositing image and capture EMG signals.Errors and the recognition accuracy may be introduced and affected respectively by some factors such as period detection.To better solve the problems,a multi-view gait recognition method using deep convolutional neural network and channel attention mechanism is proposed.Firstly,the sliding time window method is used to capture EMG signals.Then,the back-propagation learning algorithm is used to train each layer of convolution,which improves the learning ability of the convolutional neural network.Finally,the channel attention mechanism is integrated into the neural network,which will improve the ability of expressing gait features.And a classifier is used to classify gait.As can be shown from experimental results on two public datasets,OULP and CASIA-B,the recognition rate of the proposed method can be achieved at 88.44%and 97.25%respectively.As can be shown from the comparative experimental results,the proposed method has better recognition effect than several other newer convolutional neural network methods.Therefore,the combination of convolutional neural network and channel attention mechanism is of great value for gait recognition.展开更多
Plant recognition has great potential in forestry research and management.A new method combined back propagation neural network and radial basis function neural network to identify tree species using a few features an...Plant recognition has great potential in forestry research and management.A new method combined back propagation neural network and radial basis function neural network to identify tree species using a few features and samples.The process was carried out in three steps:image pretreatment,feature extraction,and leaf recognition.In the image pretreatment processing,an image segmentation method based on hue,saturation and value color space and connected component labeling was presented,which can obtain the complete leaf image without veins and back-ground.The BP-RBF hybrid neural network was used to test the influence of shape and texture on species recogni-tion.The recognition accuracy of different classifiers was used to compare classification performance.The accuracy of the BP-RBF hybrid neural network using nine dimensional features was 96.2%,highest among all the classifiers.展开更多
Speech recognition or speech to text includes capturing and digitizing the sound waves, transformation of basic linguistic units or phonemes, constructing words from phonemes and contextually analyzing the words to en...Speech recognition or speech to text includes capturing and digitizing the sound waves, transformation of basic linguistic units or phonemes, constructing words from phonemes and contextually analyzing the words to ensure the correct spelling of words that sounds the same. Approach: Studying the possibility of designing a software system using one of the techniques of artificial intelligence applications neuron networks where this system is able to distinguish the sound signals and neural networks of irregular users. Fixed weights are trained on those forms first and then the system gives the output match for each of these formats and high speed. The proposed neural network study is based on solutions of speech recognition tasks, detecting signals using angular modulation and detection of modulated techniques.展开更多
A new concept, the generalized inverse group (GIG) of signal, is firstly proposed and its properties, leaking coefficients and implementation with neural networks are presented. Theoretical analysis and computational ...A new concept, the generalized inverse group (GIG) of signal, is firstly proposed and its properties, leaking coefficients and implementation with neural networks are presented. Theoretical analysis and computational simulation have shown that (1) there is a group of finite length of generalized inverse signals for any given finite signal, which forms the GIG; (2) each inverse group has different leaking coefficients, thus different abnormal states; (3) each GIG can be implemented by a grouped and improved single-layer perceptron which appears with fast convergence. When used in deconvolution, the proposed GIG can form a new parallel finite length of filtering deconvolution method. On off-line processing, the computational time is reduced to O(N) from O(N2). And the less the leaking coefficient is, the more reliable the deconvolution will be.展开更多
Handwritten signature and character recognition has become challenging research topic due to its numerous applications. In this paper, we proposed a system that has three sub-systems. The three subsystems focus on off...Handwritten signature and character recognition has become challenging research topic due to its numerous applications. In this paper, we proposed a system that has three sub-systems. The three subsystems focus on offline recognition of handwritten English alphabetic characters (uppercase and lowercase), numeric characters (0 - 9) and individual signatures respectively. The system includes several stages like image preprocessing, the post-processing, the segmentation, the detection of the required amount of the character and signature, feature extraction and finally Neural Network recognition. At first, the scanned image is filtered after conversion of the scanned image into a gray image. Then image cropping method is applied to detect the signature. Then an accurate recognition is ensured by post-processing the cropped images. MATLAB has been used to design the system. The subsystems are then tested for several samples and the results are found satisfactory at about 97% success rate. The quality of the image plays a vital role as the images of poor or mediocre quality may lead to unsuccessful recognition and verification.展开更多
This paper describes a modified speed-sensorless control for induction motor (IM) based on space vector pulse width modulation and neural network. An Elman ANN method to identify the IM speed is proposed, with IM para...This paper describes a modified speed-sensorless control for induction motor (IM) based on space vector pulse width modulation and neural network. An Elman ANN method to identify the IM speed is proposed, with IM parameters employed as associated elements. The BP algorithm is used to provide an adaptive estimation of the motor speed. The effectiveness of the proposed method is verified by simulation results. The implementation on TMS320F240 fixed DSP is provided.展开更多
An improved pulse width modulation (PWM) neural network VLSI circuit for fault diagnosis is presented, which differs from the software-based fault diagnosis approach and exploits the merits of neural network VLSI circ...An improved pulse width modulation (PWM) neural network VLSI circuit for fault diagnosis is presented, which differs from the software-based fault diagnosis approach and exploits the merits of neural network VLSI circuit. A simple synapse multiplier is introduced, which has high precision, large linear range and less switching noise effects. A voltage-mode sigmoid circuit with adjustable gain is introduced for realization of different neuron activation functions. A voltage-pulse conversion circuit required for PWM is also introduced, which has high conversion precision and linearity. These 3 circuits are used to design a PWM VLSI neural network circuit to solve noise fault diagnosis for a main bearing. It can classify the fault samples directly. After signal processing, feature extraction and neural network computation for the analog noise signals including fault information,each output capacitor voltage value of VLSI circuit can be obtained, which represents Euclid distance between the corresponding fault signal template and the diagnosing signal, The real-time online recognition of noise fault signal can also be realized.展开更多
Photoplethysmography(PPG)biometrics have received considerable attention.Although deep learning has achieved good performance for PPG biometrics,several challenges remain open:1)How to effectively extract the feature ...Photoplethysmography(PPG)biometrics have received considerable attention.Although deep learning has achieved good performance for PPG biometrics,several challenges remain open:1)How to effectively extract the feature fusion representation from time and frequency PPG signals.2)How to effectively capture a series of PPG signal transition information.3)How to extract timevarying information from one-dimensional time-frequency sequential data.To address these challenges,we propose a dual-domain and multiscale fusion deep neural network(DMFDNN)for PPG biometric recognition.The DMFDNN is mainly composed of a two-branch deep learning framework for PPG biometrics,which can learn the time-varying and multiscale discriminative features from the time and frequency domains.Meanwhile,we design a multiscale extraction module to capture transition information,which consists of multiple convolution layers with different receptive fields for capturing multiscale transition information.In addition,the dual-domain attention module is proposed to strengthen the domain of greater contributions from time-domain and frequency-domain data for PPG biometrics.Experiments on the four datasets demonstrate that DMFDNN outperforms the state-of-the-art methods for PPG biometrics.展开更多
This paper describes the inverstigation devoted to establish suitable weights in a feed-forward neural network realizing the narrow-band filtering map in the case of adaptive line enhanccment(ALE) by the utility of th...This paper describes the inverstigation devoted to establish suitable weights in a feed-forward neural network realizing the narrow-band filtering map in the case of adaptive line enhanccment(ALE) by the utility of the optimum common learning rate back propagation (OCLR BP) algorithm. It is found that a feed-forward network with 64 linear input and output neurons, and 8 odd sigmoid neurons in the hidden layer, i.e. an (64→8→64) architecture, could establish the specific input-output function in the case of relatively low signal-to-noise radio. Only is an input signal consisting of mixed periodic and broad-band components available to the network system. After learning, both the 'fanning-in-connection patterns', each of which consists of weights fanning into a hidden-neuron from all the outputs of input-neurons, and the 'fanning-out-connection patterns', each of which consists of weights fanning out from a hidden-neuron to all the inputs of output-neurons, are tuned to the periodic signals. The nonlinear map formed by this neural network provided substantial improvement in performance over that formed by an Adaline-ALE with same frequency resolution.展开更多
基金National Natural Science Foundation of China under Grant No.61973037China Postdoctoral Science Foundation under Grant No.2022M720419。
文摘Automatic modulation recognition(AMR)of radiation source signals is a research focus in the field of cognitive radio.However,the AMR of radiation source signals at low SNRs still faces a great challenge.Therefore,the AMR method of radiation source signals based on two-dimensional data matrix and improved residual neural network is proposed in this paper.First,the time series of the radiation source signals are reconstructed into two-dimensional data matrix,which greatly simplifies the signal preprocessing process.Second,the depthwise convolution and large-size convolutional kernels based residual neural network(DLRNet)is proposed to improve the feature extraction capability of the AMR model.Finally,the model performs feature extraction and classification on the two-dimensional data matrix to obtain the recognition vector that represents the signal modulation type.Theoretical analysis and simulation results show that the AMR method based on two-dimensional data matrix and improved residual network can significantly improve the accuracy of the AMR method.The recognition accuracy of the proposed method maintains a high level greater than 90% even at -14 dB SNR.
文摘Mandarin Chinese tone patterns vary in one of the four ways, i.e, (1) high level; (2) rising; (3) low falling and rising; and (4) high falling. The present study is to examine the efficacy of an artificial neural network in recognizing these tone patterns. Speech data were recorded from 12 children (3-6 years of age) and 15 adults. All subjects were native Mandarin Chinese speakers. The fundamental frequencies (F0) of each monosyllabic word of the speech data were extracted with an autocorrelation method. The pitch data(i.e., the F0 contours) were the inputs to a feed-forward backpropagation artificial neural network. The number of inputs to the neural network varied from 1 to 16 and the hidden layer of the network contained neurons that varied from 1 to 16 in number. The output of the network consisted of four neurons representing the four tone patterns of Mandarin Chinese. After being trained with the Levenberg-Marquardt optimization, the neural network was able to successfully classify the tone patterns with an accuracy of about 90% correct for speech samples from both adults and children. The artificial neural network may provide an objective and effective way of assessing tone production in prelingually-deafened children who have received cochlear implants.
基金the Deanship of Scientific Research at Umm Al-Qura University for supporting this work by Grant Code:(22UQU4170008DSR06).
文摘Natural language processing technologies have become more widely available in recent years,making them more useful in everyday situations.Machine learning systems that employ accessible datasets and corporate work to serve the whole spectrum of problems addressed in computational linguistics have lately yielded a number of promising breakthroughs.These methods were particularly advantageous for regional languages,as they were provided with cut-ting-edge language processing tools as soon as the requisite corporate information was generated.The bulk of modern people are unconcerned about the importance of reading.Reading aloud,on the other hand,is an effective technique for nour-ishing feelings as well as a necessary skill in the learning process.This paper pro-posed a novel approach for speech recognition based on neural networks.The attention mechanism isfirst utilized to determine the speech accuracy andfluency assessments,with the spectrum map as the feature extraction input.To increase phoneme identification accuracy,reading precision,for example,employs a new type of deep speech.It makes use of the exportchapter tool,which provides a corpus,as well as the TensorFlow framework in the experimental setting.The experimentalfindings reveal that the suggested model can more effectively assess spoken speech accuracy and readingfluency than the old model,and its evalua-tion model’s score outcomes are more accurate.
基金National Natural Science Foundation of China (No. 60975084)Natural Science Foundation of Fujian Province,China (No.2011J05159)
文摘A graphic processing unit (GPU)-accelerated biological species recognition method using partially connected neural evolutionary network model is introduced in this paper. The partial connected neural evolutionary network adopted in the paper can overcome the disadvantage of traditional neural network with small inputs. The whole image is considered as the input of the neural network, so the maximal features can be kept for recognition. To speed up the recognition process of the neural network, a fast implementation of the partially connected neural network was conducted on NVIDIA Tesla C1060 using the NVIDIA compute unified device architecture (CUDA) framework. Image sets of eight biological species were obtained to test the GPU implementation and counterpart serial CPU implementation, and experiment results showed GPU implementation works effectively on both recognition rate and speed, and gained 343 speedup over its counterpart CPU implementation. Comparing to feature-based recognition method on the same recognition task, the method also achieved an acceptable correct rate of 84.6% when testing on eight biological species.
基金This work was funded by the National Natural Science Foundation of China under Grant(Nos.61772152,61502037)the Basic Research Project(Nos.JCKY2016206B001,JCKY2014206C002,JCKY2017604C010)and the Technical Foundation Project(No.JSQB2017206C002).
文摘Underwater target recognition is a key technology for underwater acoustic countermeasure.How to classify and recognize underwater targets according to the noise information of underwater targets has been a hot topic in the field of underwater acoustic signals.In this paper,the deep learning model is applied to underwater target recognition.Improved anti-noise Power-Normalized Cepstral Coefficients(ia-PNCC)is proposed,based on PNCC applied to underwater noises.Multitaper and normalized Gammatone filter banks are applied to improve the anti-noise capacity.The method is combined with a convolutional neural network in order to recognize the underwater target.Experiment results show that the acoustic feature presented by ia-PNCC has lower noise and are wellsuited to underwater target recognition using a convolutional neural network.Compared with the combination of convolutional neural network with single acoustic feature,such as MFCC(Mel-scale Frequency Cepstral Coefficients)or LPCC(Linear Prediction Cepstral Coefficients),the combination of the ia-PNCC with a convolutional neural network offers better accuracy for underwater target recognition.
基金supported by the National Natural Science Foundation of China(No.61074165 and No.61273064)Jilin Provincial Science&Technology Department Key Scientific and Technological Project(No.20140204034GX)Jilin Province Development and Reform Commission Project(No.2015Y043)
文摘Human body posture recognition has attracted considerable attention in recent years in wireless body area networks(WBAN). In order to precisely recognize human body posture,many recognition algorithms have been proposed.However, the recognition rate is relatively low. In this paper, we apply back propagation(BP) neural network as a classifier to recognizing human body posture, where signals are collected from VG350 acceleration sensor and a posture signal collection system based on WBAN is designed. Human body signal vector magnitude(SVM) and tri-axial acceleration sensor data are used to describe the human body postures. We are able to recognize 4postures: Walk, Run, Squat and Sit. Our posture recognition rate is up to 91.67%. Furthermore, we find an implied relationship between hidden layer neurons and the posture recognition rate. The proposed human body posture recognition algorithm lays the foundation for the subsequent applications.
文摘Effective vibration recognition can improve the performance of vibration control and structural damage detection and is in high demand for signal processing and advanced classification.Signal-processing methods can extract the potent time-frequency-domain characteristics of signals;however,the performance of conventional characteristics-based classification needs to be improved.Widely used deep learning algorithms(e.g.,convolutional neural networks(CNNs))can conduct classification by extracting high-dimensional data features,with outstanding performance.Hence,combining the advantages of signal processing and deep-learning algorithms can significantly enhance vibration recognition performance.A novel vibration recognition method based on signal processing and deep neural networks is proposed herein.First,environmental vibration signals are collected;then,signal processing is conducted to obtain the coefficient matrices of the time-frequency-domain characteristics using three typical algorithms:the wavelet transform,Hilbert-Huang transform,and Mel frequency cepstral coefficient extraction method.Subsequently,CNNs,long short-term memory(LSTM)networks,and combined deep CNN-LSTM networks are trained for vibration recognition,according to the time-frequencydomain characteristics.Finally,the performance of the trained deep neural networks is evaluated and validated.The results confirm the effectiveness of the proposed vibration recognition method combining signal preprocessing and deep learning.
文摘Drill wear not only affects the surface smoothness of the hole, but also influences the life of the drill. Drill wear state recognition is important in the manufacturing process, which consists of two steps: first, decomposing cutting torque components from the original signals by wavelet packet decomposition (WPD); second, extracting wavelet coefficients of different wear states (i.e., slight, normal, or severe wear) with signal features adapting to Welch spectrum. Finally, monitoring and recognition of the feature vectors of cutting torque signal are performed by using the K-means cluster and radial basis function neural network (RBFNN). The experiments on different tool wears of the multivariable features reveal that the results of monitoring and recognition are significant and effective.
基金supported in part by the National Natural Science Foundation of China under Grant 61461013in part of the Natural Science Foundation of Guangxi Province under Grant 2018GXNSFAA281179in part of the Dean Project of Guangxi Key Laboratory of Wireless Broadband Communication and Signal Processing under Grant GXKL06160103.
文摘In this study,we developed a system based on deep space–time neural networks for gesture recognition.When users change or the number of gesture categories increases,the accuracy of gesture recognition decreases considerably because most gesture recognition systems cannot accommodate both user differentiation and gesture diversity.To overcome the limitations of existing methods,we designed a onedimensional parallel long short-term memory–fully convolutional network(LSTM–FCN)model to extract gesture features of different dimensions.LSTM can learn complex time dynamic information,whereas FCN can predict gestures efficiently by extracting the deep,abstract features of gestures in the spatial dimension.In the experiment,50 types of gestures of five users were collected and evaluated.The experimental results demonstrate the effectiveness of this system and robustness to various gestures and individual changes.Statistical analysis of the recognition results indicated that an average accuracy of approximately 98.9% was achieved.
基金This work was supported by the Natural Science Foundation of China(No.61902133)Fujian natural science foundation project(No.2018J05106)Xiamen Collaborative Innovation projects of Produces study grinds(3502Z20173046)。
文摘In many existing multi-view gait recognition methods based on images or video sequences,gait sequences are usually used to superimpose and synthesize images and construct energy-like template.However,information may be lost during the process of compositing image and capture EMG signals.Errors and the recognition accuracy may be introduced and affected respectively by some factors such as period detection.To better solve the problems,a multi-view gait recognition method using deep convolutional neural network and channel attention mechanism is proposed.Firstly,the sliding time window method is used to capture EMG signals.Then,the back-propagation learning algorithm is used to train each layer of convolution,which improves the learning ability of the convolutional neural network.Finally,the channel attention mechanism is integrated into the neural network,which will improve the ability of expressing gait features.And a classifier is used to classify gait.As can be shown from experimental results on two public datasets,OULP and CASIA-B,the recognition rate of the proposed method can be achieved at 88.44%and 97.25%respectively.As can be shown from the comparative experimental results,the proposed method has better recognition effect than several other newer convolutional neural network methods.Therefore,the combination of convolutional neural network and channel attention mechanism is of great value for gait recognition.
基金This work is supported by the Fundamental Research Funds for the Central Universities(No.2572020BC07)the Project of National Science Foundation of China(No.31570712).
文摘Plant recognition has great potential in forestry research and management.A new method combined back propagation neural network and radial basis function neural network to identify tree species using a few features and samples.The process was carried out in three steps:image pretreatment,feature extraction,and leaf recognition.In the image pretreatment processing,an image segmentation method based on hue,saturation and value color space and connected component labeling was presented,which can obtain the complete leaf image without veins and back-ground.The BP-RBF hybrid neural network was used to test the influence of shape and texture on species recogni-tion.The recognition accuracy of different classifiers was used to compare classification performance.The accuracy of the BP-RBF hybrid neural network using nine dimensional features was 96.2%,highest among all the classifiers.
文摘Speech recognition or speech to text includes capturing and digitizing the sound waves, transformation of basic linguistic units or phonemes, constructing words from phonemes and contextually analyzing the words to ensure the correct spelling of words that sounds the same. Approach: Studying the possibility of designing a software system using one of the techniques of artificial intelligence applications neuron networks where this system is able to distinguish the sound signals and neural networks of irregular users. Fixed weights are trained on those forms first and then the system gives the output match for each of these formats and high speed. The proposed neural network study is based on solutions of speech recognition tasks, detecting signals using angular modulation and detection of modulated techniques.
基金Supported partly by Natural Science Foundation of ChinaAviation Science Grant of China
文摘A new concept, the generalized inverse group (GIG) of signal, is firstly proposed and its properties, leaking coefficients and implementation with neural networks are presented. Theoretical analysis and computational simulation have shown that (1) there is a group of finite length of generalized inverse signals for any given finite signal, which forms the GIG; (2) each inverse group has different leaking coefficients, thus different abnormal states; (3) each GIG can be implemented by a grouped and improved single-layer perceptron which appears with fast convergence. When used in deconvolution, the proposed GIG can form a new parallel finite length of filtering deconvolution method. On off-line processing, the computational time is reduced to O(N) from O(N2). And the less the leaking coefficient is, the more reliable the deconvolution will be.
文摘Handwritten signature and character recognition has become challenging research topic due to its numerous applications. In this paper, we proposed a system that has three sub-systems. The three subsystems focus on offline recognition of handwritten English alphabetic characters (uppercase and lowercase), numeric characters (0 - 9) and individual signatures respectively. The system includes several stages like image preprocessing, the post-processing, the segmentation, the detection of the required amount of the character and signature, feature extraction and finally Neural Network recognition. At first, the scanned image is filtered after conversion of the scanned image into a gray image. Then image cropping method is applied to detect the signature. Then an accurate recognition is ensured by post-processing the cropped images. MATLAB has been used to design the system. The subsystems are then tested for several samples and the results are found satisfactory at about 97% success rate. The quality of the image plays a vital role as the images of poor or mediocre quality may lead to unsuccessful recognition and verification.
基金This project was supported by the National Natural Science Foundation of China (No. 69874086).
文摘This paper describes a modified speed-sensorless control for induction motor (IM) based on space vector pulse width modulation and neural network. An Elman ANN method to identify the IM speed is proposed, with IM parameters employed as associated elements. The BP algorithm is used to provide an adaptive estimation of the motor speed. The effectiveness of the proposed method is verified by simulation results. The implementation on TMS320F240 fixed DSP is provided.
基金Supported by National Natural Science Foundation (60274015) the "863" Program of P, R. China (2002AA412420)
文摘An improved pulse width modulation (PWM) neural network VLSI circuit for fault diagnosis is presented, which differs from the software-based fault diagnosis approach and exploits the merits of neural network VLSI circuit. A simple synapse multiplier is introduced, which has high precision, large linear range and less switching noise effects. A voltage-mode sigmoid circuit with adjustable gain is introduced for realization of different neuron activation functions. A voltage-pulse conversion circuit required for PWM is also introduced, which has high conversion precision and linearity. These 3 circuits are used to design a PWM VLSI neural network circuit to solve noise fault diagnosis for a main bearing. It can classify the fault samples directly. After signal processing, feature extraction and neural network computation for the analog noise signals including fault information,each output capacitor voltage value of VLSI circuit can be obtained, which represents Euclid distance between the corresponding fault signal template and the diagnosing signal, The real-time online recognition of noise fault signal can also be realized.
基金supported by National Nature Science Foundation of China(No.62276093)in part by Natural Science Foundation of Shandong Province,China(No.2022MF86).
文摘Photoplethysmography(PPG)biometrics have received considerable attention.Although deep learning has achieved good performance for PPG biometrics,several challenges remain open:1)How to effectively extract the feature fusion representation from time and frequency PPG signals.2)How to effectively capture a series of PPG signal transition information.3)How to extract timevarying information from one-dimensional time-frequency sequential data.To address these challenges,we propose a dual-domain and multiscale fusion deep neural network(DMFDNN)for PPG biometric recognition.The DMFDNN is mainly composed of a two-branch deep learning framework for PPG biometrics,which can learn the time-varying and multiscale discriminative features from the time and frequency domains.Meanwhile,we design a multiscale extraction module to capture transition information,which consists of multiple convolution layers with different receptive fields for capturing multiscale transition information.In addition,the dual-domain attention module is proposed to strengthen the domain of greater contributions from time-domain and frequency-domain data for PPG biometrics.Experiments on the four datasets demonstrate that DMFDNN outperforms the state-of-the-art methods for PPG biometrics.
文摘This paper describes the inverstigation devoted to establish suitable weights in a feed-forward neural network realizing the narrow-band filtering map in the case of adaptive line enhanccment(ALE) by the utility of the optimum common learning rate back propagation (OCLR BP) algorithm. It is found that a feed-forward network with 64 linear input and output neurons, and 8 odd sigmoid neurons in the hidden layer, i.e. an (64→8→64) architecture, could establish the specific input-output function in the case of relatively low signal-to-noise radio. Only is an input signal consisting of mixed periodic and broad-band components available to the network system. After learning, both the 'fanning-in-connection patterns', each of which consists of weights fanning into a hidden-neuron from all the outputs of input-neurons, and the 'fanning-out-connection patterns', each of which consists of weights fanning out from a hidden-neuron to all the inputs of output-neurons, are tuned to the periodic signals. The nonlinear map formed by this neural network provided substantial improvement in performance over that formed by an Adaline-ALE with same frequency resolution.