A new grey forecasting model based on BP neural network and Markov chain was proposed. In order to combine the grey forecasting model with neural network, an important theorem that the grey differential equation is eq...A new grey forecasting model based on BP neural network and Markov chain was proposed. In order to combine the grey forecasting model with neural network, an important theorem that the grey differential equation is equivalent to the time response model, was proved by analyzing the features of grey forecasting model(GM(1,1)). Based on this, the differential equation parameters were included in the network when the BP neural network was constructed, and the neural network was trained by extracting samples from grey system's known data. When BP network was converged, the whitened grey differential equation parameters were extracted and then the grey neural network forecasting model (GNNM(1,1)) was built. In order to reduce stochastic phenomenon in GNNM(1,1), the state transition probability between two states was defined and the Markov transition matrix was established by building the residual sequences between grey forecasting and actual value. Thus, the new grey forecasting model(MNNGM(1,1)) was proposed by combining Markov chain with GNNM(1,1). Based on the above discussion, three different approaches were put forward for forecasting China electricity demands. By comparing GM(1, 1) and GNNM(1,1) with the proposed model, the results indicate that the absolute mean error of MNNGM(1,1) is about 0.4 times of GNNM(1,1) and 0.2 times of GM(I, 1), and the mean square error of MNNGM(1,1) is about 0.25 times of GNNM(1,1) and 0.1 times of GM(1,1).展开更多
Based on the theory of modal acoustic emission(AE),when the convolutional neural network(CNN)is used to identify rotor rub-impact faults,the training data has a small sample size,and the AE sound segment belongs to a ...Based on the theory of modal acoustic emission(AE),when the convolutional neural network(CNN)is used to identify rotor rub-impact faults,the training data has a small sample size,and the AE sound segment belongs to a single channel signal with less pixel-level information and strong local correlation.Due to the convolutional pooling operations of CNN,coarse-grained and edge information are lost,and the top-level information dimension in CNN network is low,which can easily lead to overfitting.To solve the above problems,we first propose the use of sound spectrograms and their differential features to construct multi-channel image input features suitable for CNN and fully exploit the intrinsic characteristics of the sound spectra.Then,the traditional CNN network structure is improved,and the outputs of all convolutional layers are connected as one layer constitutes a fused feature that contains information at each layer,and is input into the network’s fully connected layer for classification and identification.Experiments indicate that the improved CNN recognition algorithm has significantly improved recognition rate compared with CNN and dynamical neural network(DNN)algorithms.展开更多
With the development of industrial production modernization, FMS and CIMS will become more and more popularized. For its control system is increasingly modeled, intellectualized and automatized, in order to raise the ...With the development of industrial production modernization, FMS and CIMS will become more and more popularized. For its control system is increasingly modeled, intellectualized and automatized, in order to raise the reliability and stability in the manufacturing process, the comprehensive monitoring and diagnosis aimed at cutting tool wear and chatter become more and more important and get rapid development. The paper tried to discuss of the intellectual status identification method based on acoustics-vibra characteristics of machining process, and propose that the working conditions may be taken as a core, complex fuzzy inference neural network model based on artificial neural network theory, and by using various kinds of modernized signal processing method to abstract enough characteristics parameters which will reflect overall processing status from machining acoustics-vibra signal as information source, to identify different working condition, and provide guarantee for automation and intelligence in machining process. The complex network is composed of NNw and NNs, Each of them is composed of BP model network, NNw is weight network at rule condition, NNs is decision-making network of each status. Y out is final inference result which is to take subordinate degree as weight from NNw, to weight reflecting result from NNs and obtain status inference of monitoring system. In the process of machining, the acoustics-vibor signal were gotten by the acoustimeter and the acceleration piezoelectricity detector, the date is analysed by the signal processing software in time and frequency domain, then form multi feature parameter vector of criterion pattern samples for the different stage of cutting chatter and acoustics-vibra multi feature parameter vector. The vector can give a accurate and comprehensive description for the cutting process, and have the characteristic which are speediness of time domain and veracity of frequency domain. The research works have been practically applied in identification of tool wear, cutting chatter, experiment results showed that it is practicable to identify the cutting chatter based on fuzzy neural network, and the new method based on fuzzy neural network can be applied to other state identification in machining process.展开更多
Hornik, Stinchcombe & White have shown that the multilayer feed forward networks with enough hidden layers are universal approximators. Roux & Bengio have proved that adding hidden units yield a strictly impro...Hornik, Stinchcombe & White have shown that the multilayer feed forward networks with enough hidden layers are universal approximators. Roux & Bengio have proved that adding hidden units yield a strictly improved modeling power, and Restricted Boltzmann Machines (RBM) are universal approximators of discrete distributions. In this paper, we provide yet another proof. The advantage of this new proof is that it will lead to several new learning algorithms. We prove that the Deep Neural Networks implement an expansion and the expansion is complete. First, we briefly review the basic Boltzmann Machine and that the invariant distributions of the Boltzmann Machine generate Markov chains. We then review the θ-transformation and its completeness, i.e. any function can be expanded by θ-transformation. We further review ABM (Attrasoft Boltzmann Machine). The invariant distribution of the ABM is a θ-transformation;therefore, an ABM can simulate any distribution. We discuss how to convert an ABM into a Deep Neural Network. Finally, by establishing the equivalence between an ABM and the Deep Neural Network, we prove that the Deep Neural Network is complete.展开更多
In the present study,an expert weed seeds recognition system combining acoustic emissions analysis,Multilayer Feedforward Neural Network(MFNN)classifier was developed and tested for classifying wheat seeds.This experi...In the present study,an expert weed seeds recognition system combining acoustic emissions analysis,Multilayer Feedforward Neural Network(MFNN)classifier was developed and tested for classifying wheat seeds.This experiment was performed for classifying two major important wheat varieties from five species of weed seeds.In order to produce sound signals,a 60o inclined glass plate was used.Fast Fourier Transform(FFT),Phase and Power Spectral Density(PSD)of impact signals were calculated.All features of sound signals are computed via a 1024-point FFT.After feature generation,60%of data sets were used for training,20%for validation,and remaining samples were selected for testing.The optimized MFNN model was found to have 500-12-2 and 500-10-2 architectures for“101”and“Shiroodi”wheat varieties,respectively.The selection of the optimal model was based on the evaluation of mean square error(MSE)and correct separation rate(CSR).The CSR percentages for two wheat varieties were 100%.Considering the overall aspects of the results,it can be stated that the developed system was successful enough to correlate the acoustic features with wheat seed type.展开更多
Randomize-then-optimize (RTO) is widely used for sampling from posterior distributions in Bayesian inverse problems. However, RTO can be computationally intensive forcomplexity problems due to repetitive evaluations o...Randomize-then-optimize (RTO) is widely used for sampling from posterior distributions in Bayesian inverse problems. However, RTO can be computationally intensive forcomplexity problems due to repetitive evaluations of the expensive forward model and itsgradient. In this work, we present a novel goal-oriented deep neural networks (DNN) surrogate approach to substantially reduce the computation burden of RTO. In particular,we propose to drawn the training points for the DNN-surrogate from a local approximatedposterior distribution – yielding a flexible and efficient sampling algorithm that convergesto the direct RTO approach. We present a Bayesian inverse problem governed by ellipticPDEs to demonstrate the computational accuracy and efficiency of our DNN-RTO approach, which shows that DNN-RTO can significantly outperform the traditional RTO.展开更多
Accurate wind speed and consequently wind power forecasts form a critical enabling tool for large scale wind energy adoption.Probabilistic machine learning models such as Bayesian Neural Network(BNN)models are often p...Accurate wind speed and consequently wind power forecasts form a critical enabling tool for large scale wind energy adoption.Probabilistic machine learning models such as Bayesian Neural Network(BNN)models are often preferred in the forecasting task as they facilitate estimates of predictive uncertainty and automatic relevance determination(ARD).Hybrid Monte Carlo(HMC)is widely used to perform asymptotically exact inference of the network parameters.A significant limitation to the increased adoption of HMC in inference for large scale machine learning systems is the exponential degradation of the acceptance rates and the corresponding effective sample sizes with increasing model dimensionality due to numerical integration errors.This paper presents a solution to this problem by sampling from a modified or shadow Hamiltonian that is conserved to a higher-order by the leapfrog integrator.BNNs trained using Separable Shadow Hamiltonian Hybrid Monte Carlo(S2HMC)are used to forecast one hour ahead wind speeds on the Wind Atlas for South Africa(WASA)datasets.Experimental results find that S2HMC yields higher effective sample sizes than the competing HMC.The predictive performance of S2HMC and HMC based BNNs is found to be similar.We further perform hierarchical inference for BNN parameters by combining the S2HMC sampler with Gibbs sampling of hyperparameters for relevance determination.A generalisable ARD committee framework is introduced to synthesise the various sampler ARD outputs into robust feature selections.Experimental results show that this ARD committee approach selects features of high predictive information value.Further,the results show that dimensionality reduction performed through this approach improves the sampling performance of samplers that suffer from random walk behaviour such as Metropolis–Hastings(MH).展开更多
Acoustic signals play an essential role in machine state monitoring. Efficient processing of real-time machine acoustic signals improves production quality. However, generating semantically useful information from sou...Acoustic signals play an essential role in machine state monitoring. Efficient processing of real-time machine acoustic signals improves production quality. However, generating semantically useful information from sound signals is an ill-defined problem that exhibits a highly non-linear relationship between sound and subjective perceptions. This paper outlines two neural network models to analyze and classify acoustic signals emanating from machines:(i) a backpropagation neural network (BPNN);and (ii) a convolutional neural network (CNN). Microphones are used to collect acoustic data for training models from a computer numeric control (CNC) lathe. Numerical experiments demonstrate that CNN performs better than the BP-NN.展开更多
基金Project(70572090) supported by the National Natural Science Foundation of China
文摘A new grey forecasting model based on BP neural network and Markov chain was proposed. In order to combine the grey forecasting model with neural network, an important theorem that the grey differential equation is equivalent to the time response model, was proved by analyzing the features of grey forecasting model(GM(1,1)). Based on this, the differential equation parameters were included in the network when the BP neural network was constructed, and the neural network was trained by extracting samples from grey system's known data. When BP network was converged, the whitened grey differential equation parameters were extracted and then the grey neural network forecasting model (GNNM(1,1)) was built. In order to reduce stochastic phenomenon in GNNM(1,1), the state transition probability between two states was defined and the Markov transition matrix was established by building the residual sequences between grey forecasting and actual value. Thus, the new grey forecasting model(MNNGM(1,1)) was proposed by combining Markov chain with GNNM(1,1). Based on the above discussion, three different approaches were put forward for forecasting China electricity demands. By comparing GM(1, 1) and GNNM(1,1) with the proposed model, the results indicate that the absolute mean error of MNNGM(1,1) is about 0.4 times of GNNM(1,1) and 0.2 times of GM(I, 1), and the mean square error of MNNGM(1,1) is about 0.25 times of GNNM(1,1) and 0.1 times of GM(1,1).
基金The authors would like to acknowledge the Six Talent Peaks Project in Jiangsu Province[XCL-CXTD-007]China Postdoctoral Science Foundation[2018M630559]for their financial support in this project。
文摘Based on the theory of modal acoustic emission(AE),when the convolutional neural network(CNN)is used to identify rotor rub-impact faults,the training data has a small sample size,and the AE sound segment belongs to a single channel signal with less pixel-level information and strong local correlation.Due to the convolutional pooling operations of CNN,coarse-grained and edge information are lost,and the top-level information dimension in CNN network is low,which can easily lead to overfitting.To solve the above problems,we first propose the use of sound spectrograms and their differential features to construct multi-channel image input features suitable for CNN and fully exploit the intrinsic characteristics of the sound spectra.Then,the traditional CNN network structure is improved,and the outputs of all convolutional layers are connected as one layer constitutes a fused feature that contains information at each layer,and is input into the network’s fully connected layer for classification and identification.Experiments indicate that the improved CNN recognition algorithm has significantly improved recognition rate compared with CNN and dynamical neural network(DNN)algorithms.
文摘With the development of industrial production modernization, FMS and CIMS will become more and more popularized. For its control system is increasingly modeled, intellectualized and automatized, in order to raise the reliability and stability in the manufacturing process, the comprehensive monitoring and diagnosis aimed at cutting tool wear and chatter become more and more important and get rapid development. The paper tried to discuss of the intellectual status identification method based on acoustics-vibra characteristics of machining process, and propose that the working conditions may be taken as a core, complex fuzzy inference neural network model based on artificial neural network theory, and by using various kinds of modernized signal processing method to abstract enough characteristics parameters which will reflect overall processing status from machining acoustics-vibra signal as information source, to identify different working condition, and provide guarantee for automation and intelligence in machining process. The complex network is composed of NNw and NNs, Each of them is composed of BP model network, NNw is weight network at rule condition, NNs is decision-making network of each status. Y out is final inference result which is to take subordinate degree as weight from NNw, to weight reflecting result from NNs and obtain status inference of monitoring system. In the process of machining, the acoustics-vibor signal were gotten by the acoustimeter and the acceleration piezoelectricity detector, the date is analysed by the signal processing software in time and frequency domain, then form multi feature parameter vector of criterion pattern samples for the different stage of cutting chatter and acoustics-vibra multi feature parameter vector. The vector can give a accurate and comprehensive description for the cutting process, and have the characteristic which are speediness of time domain and veracity of frequency domain. The research works have been practically applied in identification of tool wear, cutting chatter, experiment results showed that it is practicable to identify the cutting chatter based on fuzzy neural network, and the new method based on fuzzy neural network can be applied to other state identification in machining process.
文摘Hornik, Stinchcombe & White have shown that the multilayer feed forward networks with enough hidden layers are universal approximators. Roux & Bengio have proved that adding hidden units yield a strictly improved modeling power, and Restricted Boltzmann Machines (RBM) are universal approximators of discrete distributions. In this paper, we provide yet another proof. The advantage of this new proof is that it will lead to several new learning algorithms. We prove that the Deep Neural Networks implement an expansion and the expansion is complete. First, we briefly review the basic Boltzmann Machine and that the invariant distributions of the Boltzmann Machine generate Markov chains. We then review the θ-transformation and its completeness, i.e. any function can be expanded by θ-transformation. We further review ABM (Attrasoft Boltzmann Machine). The invariant distribution of the ABM is a θ-transformation;therefore, an ABM can simulate any distribution. We discuss how to convert an ABM into a Deep Neural Network. Finally, by establishing the equivalence between an ABM and the Deep Neural Network, we prove that the Deep Neural Network is complete.
文摘In the present study,an expert weed seeds recognition system combining acoustic emissions analysis,Multilayer Feedforward Neural Network(MFNN)classifier was developed and tested for classifying wheat seeds.This experiment was performed for classifying two major important wheat varieties from five species of weed seeds.In order to produce sound signals,a 60o inclined glass plate was used.Fast Fourier Transform(FFT),Phase and Power Spectral Density(PSD)of impact signals were calculated.All features of sound signals are computed via a 1024-point FFT.After feature generation,60%of data sets were used for training,20%for validation,and remaining samples were selected for testing.The optimized MFNN model was found to have 500-12-2 and 500-10-2 architectures for“101”and“Shiroodi”wheat varieties,respectively.The selection of the optimal model was based on the evaluation of mean square error(MSE)and correct separation rate(CSR).The CSR percentages for two wheat varieties were 100%.Considering the overall aspects of the results,it can be stated that the developed system was successful enough to correlate the acoustic features with wheat seed type.
基金LY’s work was supported by the NSF of China(No.11771081)the science challenge project,China(No.TZ2018001)+4 种基金Zhishan Young Scholar Program of SEU,China.TZ’s work was supported by the National Key R&D Program of China(No.2020YFA0712000)the NSF of China(under grant numbers 11822111,11688101 and 11731006)the science challenge project(No.TZ2018001)the Strategic Priority Research Program of Chinese Academy of Sciences(No.XDA25000404)youth innovation promotion association(CAS),China.
文摘Randomize-then-optimize (RTO) is widely used for sampling from posterior distributions in Bayesian inverse problems. However, RTO can be computationally intensive forcomplexity problems due to repetitive evaluations of the expensive forward model and itsgradient. In this work, we present a novel goal-oriented deep neural networks (DNN) surrogate approach to substantially reduce the computation burden of RTO. In particular,we propose to drawn the training points for the DNN-surrogate from a local approximatedposterior distribution – yielding a flexible and efficient sampling algorithm that convergesto the direct RTO approach. We present a Bayesian inverse problem governed by ellipticPDEs to demonstrate the computational accuracy and efficiency of our DNN-RTO approach, which shows that DNN-RTO can significantly outperform the traditional RTO.
文摘Accurate wind speed and consequently wind power forecasts form a critical enabling tool for large scale wind energy adoption.Probabilistic machine learning models such as Bayesian Neural Network(BNN)models are often preferred in the forecasting task as they facilitate estimates of predictive uncertainty and automatic relevance determination(ARD).Hybrid Monte Carlo(HMC)is widely used to perform asymptotically exact inference of the network parameters.A significant limitation to the increased adoption of HMC in inference for large scale machine learning systems is the exponential degradation of the acceptance rates and the corresponding effective sample sizes with increasing model dimensionality due to numerical integration errors.This paper presents a solution to this problem by sampling from a modified or shadow Hamiltonian that is conserved to a higher-order by the leapfrog integrator.BNNs trained using Separable Shadow Hamiltonian Hybrid Monte Carlo(S2HMC)are used to forecast one hour ahead wind speeds on the Wind Atlas for South Africa(WASA)datasets.Experimental results find that S2HMC yields higher effective sample sizes than the competing HMC.The predictive performance of S2HMC and HMC based BNNs is found to be similar.We further perform hierarchical inference for BNN parameters by combining the S2HMC sampler with Gibbs sampling of hyperparameters for relevance determination.A generalisable ARD committee framework is introduced to synthesise the various sampler ARD outputs into robust feature selections.Experimental results show that this ARD committee approach selects features of high predictive information value.Further,the results show that dimensionality reduction performed through this approach improves the sampling performance of samplers that suffer from random walk behaviour such as Metropolis–Hastings(MH).
文摘Acoustic signals play an essential role in machine state monitoring. Efficient processing of real-time machine acoustic signals improves production quality. However, generating semantically useful information from sound signals is an ill-defined problem that exhibits a highly non-linear relationship between sound and subjective perceptions. This paper outlines two neural network models to analyze and classify acoustic signals emanating from machines:(i) a backpropagation neural network (BPNN);and (ii) a convolutional neural network (CNN). Microphones are used to collect acoustic data for training models from a computer numeric control (CNC) lathe. Numerical experiments demonstrate that CNN performs better than the BP-NN.