To improve the feature extraction of ship-radiated noise in a complex ocean environment,a novel feature extraction method for ship-radiated noise based on complete ensemble empirical mode decomposition with adaptive s...To improve the feature extraction of ship-radiated noise in a complex ocean environment,a novel feature extraction method for ship-radiated noise based on complete ensemble empirical mode decomposition with adaptive selective noise(CEEMDASN) and refined composite multiscale fluctuation-based dispersion entropy(RCMFDE) is proposed.CEEMDASN is proposed in this paper which takes into account the high frequency intermittent components when decomposing the signal.In addition,RCMFDE is also proposed in this paper which refines the preprocessing process of the original signal based on composite multi-scale theory.Firstly,the original signal is decomposed into several intrinsic mode functions(IMFs)by CEEMDASN.Energy distribution ratio(EDR) and average energy distribution ratio(AEDR) of all IMF components are calculated.Then,the IMF with the minimum difference between EDR and AEDR(MEDR)is selected as characteristic IMF.The RCMFDE of characteristic IMF is estimated as the feature vectors of ship-radiated noise.Finally,these feature vectors are sent to self-organizing map(SOM) for classifying and identifying.The proposed method is applied to the feature extraction of ship-radiated noise.The result shows its effectiveness and universality.展开更多
Underwater target recognition is a key technology for underwater acoustic countermeasure.How to classify and recognize underwater targets according to the noise information of underwater targets has been a hot topic i...Underwater target recognition is a key technology for underwater acoustic countermeasure.How to classify and recognize underwater targets according to the noise information of underwater targets has been a hot topic in the field of underwater acoustic signals.In this paper,the deep learning model is applied to underwater target recognition.Improved anti-noise Power-Normalized Cepstral Coefficients(ia-PNCC)is proposed,based on PNCC applied to underwater noises.Multitaper and normalized Gammatone filter banks are applied to improve the anti-noise capacity.The method is combined with a convolutional neural network in order to recognize the underwater target.Experiment results show that the acoustic feature presented by ia-PNCC has lower noise and are wellsuited to underwater target recognition using a convolutional neural network.Compared with the combination of convolutional neural network with single acoustic feature,such as MFCC(Mel-scale Frequency Cepstral Coefficients)or LPCC(Linear Prediction Cepstral Coefficients),the combination of the ia-PNCC with a convolutional neural network offers better accuracy for underwater target recognition.展开更多
The deployment of vehicle micro-motors has witnessed an expansion owing to the progression in electrification and intelligent technologies.However,some micro-motors may exhibit design deficiencies,component wear,assem...The deployment of vehicle micro-motors has witnessed an expansion owing to the progression in electrification and intelligent technologies.However,some micro-motors may exhibit design deficiencies,component wear,assembly errors,and other imperfections that may arise during the design or manufacturing phases.Conse-quently,these micro-motors might generate anomalous noises during their operation,consequently exerting a substantial adverse influence on the overall comfort of drivers and passengers.Automobile micro-motors exhibit a diverse array of structural variations,consequently leading to the manifestation of a multitude of distinctive auditory irregularities.To address the identification of diverse forms of abnormal noise,this research presents a novel approach rooted in the utilization of vibro-acoustic fusion-convolutional neural network(VAF-CNN).This method entails the deployment of distinct network branches,each serving to capture disparate features from the multi-sensor data,all the while considering the auditory perception traits inherent in the human auditory sys-tem.The intermediary layer integrates the concept of adaptive weighting of multi-sensor features,thus affording a calibration mechanism for the features hailing from multiple sensors,thereby enabling a further refinement of features within the branch network.For optimal model efficacy,a feature fusion mechanism is implemented in the concluding layer.To substantiate the efficacy of the proposed approach,this paper initially employs an augmented data methodology inspired by modified SpecAugment,applied to the dataset of abnormal noise sam-ples,encompassing scenarios both with and without in-vehicle interior noise.This serves to mitigate the issue of limited sample availability.Subsequent comparative evaluations are executed,contrasting the performance of the model founded upon single-sensor data against other feature fusion models reliant on multi-sensor data.The experimental results substantiate that the suggested methodology yields heightened recognition accuracy and greater resilience against interference.Moreover,it holds notable practical significance in the engineering domain,as it furnishes valuable support for the targeted management of noise emanating from vehicle micro-motors.展开更多
This paper deals with modulation classification under the alpha-stable noise condition. Our goal is to discriminate orthogonal frequency division multiplexing (OFDM) modulation type from single carrier linear digital ...This paper deals with modulation classification under the alpha-stable noise condition. Our goal is to discriminate orthogonal frequency division multiplexing (OFDM) modulation type from single carrier linear digital (SCLD) modulations in this scenario. Based on the new results concerning the generalized cyclostationarity of these signals in alpha-stable noise which are presented in this paper, we construct new modulation classification features without any priori information of carrier frequency and timing offset of the received signals, and use support vector machine (SVM) as classifier to discriminate OFDM from SCLD. Simulation results show that the recognition accuracy of the proposed algorithm can be up to 95% when the mix signal to noise ratio (MSNR) is up to ?1 dB.展开更多
This pilot study focuses on employment of hybrid LMS-ICA system for in-vehicle background noise reduction.Modern vehicles are nowadays increasingly supporting voice commands,which are one of the pillars of autonomous ...This pilot study focuses on employment of hybrid LMS-ICA system for in-vehicle background noise reduction.Modern vehicles are nowadays increasingly supporting voice commands,which are one of the pillars of autonomous and SMART vehicles.Robust speaker recognition for context-aware in-vehicle applications is limited to a certain extent by in-vehicle back-ground noise.This article presents the new concept of a hybrid system which is implemented as a virtual instrument.The highly modular concept of the virtual car used in combination with real recordings of various driving scenarios enables effective testing of the investigated methods of in-vehicle background noise reduction.The study also presents a unique concept of an adaptive system using intelligent clusters of distributed next generation 5G data networks,which allows the exchange of interference information and/or optimal hybrid algorithm settings between individual vehicles.On average,the unfiltered voice commands were successfully recognized in 29.34%of all scenarios,while the LMS reached up to 71.81%,and LMS-ICA hybrid improved the performance further to 73.03%.展开更多
Refined composite multi-scale dispersion entropy(RCMDE),as a new and effective nonlinear dynamic method,has been applied in the field of medical diagnosis and fault diagnosis.In this paper,we first introduce RCMDE int...Refined composite multi-scale dispersion entropy(RCMDE),as a new and effective nonlinear dynamic method,has been applied in the field of medical diagnosis and fault diagnosis.In this paper,we first introduce RCMDE into the field of underwater acoustic signal processing for complexity feature extraction of ship radiated noise,and then propose a novel classification method for ship-radiated noise based on RCMDE and k-nearest neighbor(KNN),termed RCMDE-KNN.The results of a comparative experiment show that the proposed RCMDE-KNN classification method can effectively extract the complexity features of ship-radiated noise,and has better classification performance under one and two scales than the other three classification methods based on multi-scale permutation entropy(MPE)and KNN,multi-scale weighted-permutation entropy(MW-PE)and KNN,and multi-scale dispersion entropy(MDE)and KNN,termed MPE-KNN,MW-PE-KNN,and MDE-KNN.It is proved that the RCMDE-KNN classification method for ship-radiated noise is feasible and effective,and can obtain a very high recognition rate.展开更多
Wavelet forced de-noising algorithm is suitable for denoising of unsteady drilling fluid pulse signal, including baseline drift rectification and two-stage de-noising processing of frame synchronization signal and ins...Wavelet forced de-noising algorithm is suitable for denoising of unsteady drilling fluid pulse signal, including baseline drift rectification and two-stage de-noising processing of frame synchronization signal and instruction signal. Two-stage de-noising processing can reduce the impact of baseline drift and determine automatic peak detection threshold range for signal recognition by distinguishing different features of frame synchronization pulse and instruction pulse. Rising and falling edge relative protruding threshold is defined for peak detection in signal recognition, which can make full use of the degree of the signal peak change and detect peaks flexibly with rising and falling edge relative protruding threshold combination. A synchronous decoding method was designed to reduce position uncertainty of the frame synchronization pulse and eliminate the accumulative error of time base drift, which determines the first instruction pulse position according to position of the frame synchronization pulse and decodes subsequent instruction pulse by taking current instruction pulse as new bit synchronization pulse. Special tool software was developed to tune algorithm parameters, which has a decoding success rate of about 95% for the universal coded signals. For the special coded signals with check byte, decoding success rate using the automatic threshold adjustment algorithm is as high as 99%.展开更多
Low frequency infrasonic waves are emitted during the formation and movement of debris flows, which are detectable in a radius of several kilometers, thereby to serve as the precondition for their remote monitoring.Ho...Low frequency infrasonic waves are emitted during the formation and movement of debris flows, which are detectable in a radius of several kilometers, thereby to serve as the precondition for their remote monitoring.However, false message often arises from the simple mechanics of alarms under the ambient noise interference.To improve the accuracy of infrasound monitoring for early-warning against debris flows, it is necessary to analyze the monitor information to identify in them the infrasonic signals characteristic of debris flows.Therefore, a large amount of debris flow infrasound and ambient noises have been collected from different sources for analysis to sum up their frequency spectra, sound pressures, waveforms, time duration and other correlated characteristics so as to specify the key characteristic parameters for different sound sources in completing the development of the recognition system of debris flow infrasonic signals for identifying their possible existence in the monitor signals.The recognition performance of the system has been verified by simulating tests and long-term in-situ monitoring of debris flows in Jiangjia Gully,Dongchuan, China to be of high accuracy and applicability.The recognition system can provide the local government and residents with accurate precautionary information about debris flows in preparation for disaster mitigation and minimizing the loss of life and property.展开更多
Underwater acoustic signal processing is one of the research hotspots in underwater acoustics.Noise reduction of underwater acoustic signals is the key to underwater acoustic signal processing.Owing to the complexity ...Underwater acoustic signal processing is one of the research hotspots in underwater acoustics.Noise reduction of underwater acoustic signals is the key to underwater acoustic signal processing.Owing to the complexity of marine environment and the particularity of underwater acoustic channel,noise reduction of underwater acoustic signals has always been a difficult challenge in the field of underwater acoustic signal processing.In order to solve the dilemma,we proposed a novel noise reduction technique for underwater acoustic signals based on complete ensemble empirical mode decomposition with adaptive noise(CEEMDAN),minimum mean square variance criterion(MMSVC) and least mean square adaptive filter(LMSAF).This noise reduction technique,named CEEMDAN-MMSVC-LMSAF,has three main advantages:(i) as an improved algorithm of empirical mode decomposition(EMD) and ensemble EMD(EEMD),CEEMDAN can better suppress mode mixing,and can avoid selecting the number of decomposition in variational mode decomposition(VMD);(ii) MMSVC can identify noisy intrinsic mode function(IMF),and can avoid selecting thresholds of different permutation entropies;(iii) for noise reduction of noisy IMFs,LMSAF overcomes the selection of deco mposition number and basis function for wavelet noise reduction.Firstly,CEEMDAN decomposes the original signal into IMFs,which can be divided into noisy IMFs and real IMFs.Then,MMSVC and LMSAF are used to detect identify noisy IMFs and remove noise components from noisy IMFs.Finally,both denoised noisy IMFs and real IMFs are reconstructed and the final denoised signal is obtained.Compared with other noise reduction techniques,the validity of CEEMDAN-MMSVC-LMSAF can be proved by the analysis of simulation signals and real underwater acoustic signals,which has the better noise reduction effect and has practical application value.CEEMDAN-MMSVC-LMSAF also provides a reliable basis for the detection,feature extraction,classification and recognition of underwater acoustic signals.展开更多
To evaluate the influence of data set noise, the network in network(NIN) model is introduced and the negative effects of different types and proportions of noise on deep convolutional models are studied. Different typ...To evaluate the influence of data set noise, the network in network(NIN) model is introduced and the negative effects of different types and proportions of noise on deep convolutional models are studied. Different types and proportions of data noise are added to two reference data sets, Cifar-10 and Cifar-100. Then, this data containing noise is used to train deep convolutional models and classify the validation data set. The experimental results show that the noise in the data set has obvious adverse effects on deep convolutional network classification models. The adverse effects of random noise are small, but the cross-category noise among categories can significantly reduce the recognition ability of the model. Therefore, a solution is proposed to improve the quality of the data sets that are mixed into a single noise category. The model trained with a data set containing noise is used to evaluate the current training data and reclassify the categories of the anomalies to form a new data set. Repeating the above steps can greatly reduce the noise ratio, so the influence of cross-category noise can be effectively avoided.展开更多
To deal with the nonlinear separable problem, the generalized noise clustering (GNC) algorithm is extended to a kernel generalized noise clustering (KGNC) model. Different from the fuzzy c-means (FCM) model and ...To deal with the nonlinear separable problem, the generalized noise clustering (GNC) algorithm is extended to a kernel generalized noise clustering (KGNC) model. Different from the fuzzy c-means (FCM) model and the GNC model which are based on Euclidean distance, the presented model is based on kernel-induced distance by using kernel method. By kernel method the input data are nonlinearly and implicitly mapped into a high-dimensional feature space, where the nonlinear pattern appears linear and the GNC algorithm is performed. It is unnecessary to calculate in high-dimensional feature space because the kernel function can do it just in input space. The effectiveness of the proposed algorithm is verified by experiments on three data sets. It is concluded that the KGNC algorithm has better clustering accuracy than FCM and GNC in clustering data sets containing noisy data.展开更多
In this paper, a speech signal recovery algorithm is presented for a personalized voice command automatic recognition system in vehicle and restaurant environments. This novel algorithm is able to separate a mixed spe...In this paper, a speech signal recovery algorithm is presented for a personalized voice command automatic recognition system in vehicle and restaurant environments. This novel algorithm is able to separate a mixed speech source from multiple speakers, detect presence/absence of speakers by tracking the higher magnitude portion of speech power spectrum and adaptively suppress noises. An automatic speech recognition (ASR) process to deal with the multi-speaker task is designed and implemented. Evaluation tests have been carried out by using the speech da- tabase NOIZEUS and the experimental results show that the proposed algorithm achieves impressive performance improvements.展开更多
This paper describes a method for reducing sudden noise using noise detection and classification methods, and noise power estimation. Sudden noise detection and classification have been dealt with in our previous stud...This paper describes a method for reducing sudden noise using noise detection and classification methods, and noise power estimation. Sudden noise detection and classification have been dealt with in our previous study. In this paper, GMM-based noise reduction is performed using the detection and classification results. As a result of classification, we can determine the kind of noise we are dealing with, but the power is unknown. In this paper, this problem is solved by combining an estimation of noise power with the noise reduction method. In our experiments, the proposed method achieved good performance for recognition of utterances overlapped by sudden noises.展开更多
Based on the recognition of one-step singular correlation and the remedying methods obtained before,the correlation properties of the neighborhood pixels and the characteristics of image de-noising were analyzed.A kin...Based on the recognition of one-step singular correlation and the remedying methods obtained before,the correlation properties of the neighborhood pixels and the characteristics of image de-noising were analyzed.A kind of most relevant weighted filtering method based on one-step singular correlation recognition(OSSC-MRWF)was put forward.The simulation experiments were done and the comparison with some commonly used methods under salt-and-pepper noises was made.The results show that the proposed method can not only effectively recognize salt-and-pepper noises and mend up the noise points,but also protect the original information such as the edge details very well.The accuracy and performance indicators are further improved considerably.展开更多
Multi-media overcomes the defects of traditional teaching means so foreign language teaching rapidly develops with such technology. It becomes a bottleneck to restrict intelligence learning software development. To so...Multi-media overcomes the defects of traditional teaching means so foreign language teaching rapidly develops with such technology. It becomes a bottleneck to restrict intelligence learning software development. To solve the problem, this paper discuss basic knowledge in speech recognition and studies targeted corpus according to English pronunciation habit of Chinese people. Integrated with oral English learners' requirements with Chinese as native language, this paper applies DTW model-based speech recognition technology for Viterbi decoding speech, then it recognizes and scores through posterior probability. After experiment verification, English pronunciation recognition model in this paper is verified to be reasonable and credible and it can offer learners' timely, accurate and objective evaluation and feedback direction to correct pronunciation errors to improve oral English learning efficiency.展开更多
The structure of any Bangla numerical character is more complex compared to English numerical character. Two pairs of numerical character in Bangla resembles to be closed and they are: “one and nine” and “five and ...The structure of any Bangla numerical character is more complex compared to English numerical character. Two pairs of numerical character in Bangla resembles to be closed and they are: “one and nine” and “five and six”. We found that, handwritten Bangla numerical character cannot be recognized using single machine learning algorithm or discrete wavelet transform (DWT). Above phenomenon motivated us to use combination of DWT, Fuzzy Inference System (FIS) and Principal Component Analysis (PCA) to recognize numerical characters of Bangla in handwritten format. The four lowest spectral components of a preprocessed image are taken using DWT, which is considered as the feature vector to recognize the digits in first phase. The feature vector is then applied to FIS and PCA separately. The combined method provides recognition accuracy of 95.8% whereas application of individual method gives less rate of accuracy. Instead of storing the images itself in a folder, if we can store the feature vector of images achieved from DWT in tabular form. The records of table can be applied in FIS, PCA or other object detection algorithm. Although the technique used in the paper can detect objects with moderate rate of accuracy but can save huge storage against a benchmark database of images. If a tradeoff is made between storage requirements and accuracy of recognition, the model of the paper is preferable compared to other present state-of-art. Another finding of the paper is that, the spectral components of images acquired by DWT only matched with FIS and PCA for classification but do not match properly with unsupervised (K-mean clustering) and supervised (support vector machine) learning.展开更多
In this Letter, a method based on the effects of imperfect oscillators in lasers is proposed to distinguish targets in continuous wave tracking lidar. This technique is based on the fact that each lidar signal source ...In this Letter, a method based on the effects of imperfect oscillators in lasers is proposed to distinguish targets in continuous wave tracking lidar. This technique is based on the fact that each lidar signal source has a specific influence on the phase noise that makes real targets from the false ones. A simulated signal is produced by complex circuits, modulators, memory, and signal oscillators. For example, a deception laser beam has an unequal and variable phase noise from a real target. Thus, the phase noise of transmitted and received signals does not have the same power levels and patterns. To consider the performance of the suggested method, the probability of detection(PD) is shown for various signal-to-noise ratios and signal-to-jammer ratios based on experimental outcomes.展开更多
Invoice document digitization is crucial for efficient management in industries.The scanned invoice image is often noisy due to various reasons.This affects the OCR(optical character recognition)detection accuracy.In ...Invoice document digitization is crucial for efficient management in industries.The scanned invoice image is often noisy due to various reasons.This affects the OCR(optical character recognition)detection accuracy.In this paper,letter data obtained from images of invoices are denoised using a modified autoencoder based deep learning method.A stacked denoising autoencoder(SDAE)is implemented with two hidden layers each in encoder network and decoder network.In order to capture the most salient features of training samples,a undercomplete autoencoder is designed with non-linear encoder and decoder function.This autoencoder is regularized for denoising application using a combined loss function which considers both mean square error and binary cross entropy.A dataset consisting of 59,119 letter images,which contains both English alphabets(upper and lower case)and numbers(0 to 9)is prepared from many scanned invoices images and windows true type(.ttf)files,are used for training the neural network.Performance is analyzed in terms of Signal to Noise Ratio(SNR),Peak Signal to Noise Ratio(PSNR),Structural Similarity Index(SSIM)and Universal Image Quality Index(UQI)and compared with other filtering techniques like Nonlocal Means filter,Anisotropic diffusion filter,Gaussian filters and Mean filters.Denoising performance of proposed SDAE is compared with existing SDAE with single loss function in terms of SNR and PSNR values.Results show the superior performance of proposed SDAE method.展开更多
The features of the ship noises are analyzed by using the higher-order spectrum (HOS) after studying their distribution. The results show that the different ship noise has different ranges of the main frequency. The m...The features of the ship noises are analyzed by using the higher-order spectrum (HOS) after studying their distribution. The results show that the different ship noise has different ranges of the main frequency. The main frequencies of the first class ships are less than 120 Hz, while the second class ships drop in 130 Hz -- 320 Hz. The different relationship between w1 and w2 corresponds to different bispectrum graph. There are the same results in the trispectrum. The feature vector is consist of the wls which correspond to the maximum bispectrum B(wl, wl) and the maximum trispectrum B(wl, w1,wl) respectively, the al, w2 which correspond to the maximum bispectrum B(wl, w2).展开更多
The recognition rate of the auditory periphery features decreases when the model is used to identify underwater targets in practice. To solve this problem, an improved method based on Gammatone filter bank is proposed...The recognition rate of the auditory periphery features decreases when the model is used to identify underwater targets in practice. To solve this problem, an improved method based on Gammatone filter bank is proposed. Firstly, after the reason of the decreasing of the recognition results is analyzed, the mechanism of multichannel data acquisition in acoustic engineering may narrow down signal frequency range, which leads to time-frequency features distortion. Secondly, the Gammatone filter bank is implemented to simulate frequency decom- position characteristics of human ear basilar membrane. Since the class information of the underwater noise signal is mostly contained in low frequency range, the auditory features of the conventional model are interpolated and the channel number of the filter bank and the central frequency of each frequency band are adjusted accordingly to obtain a 27-dimensional feature vector of the narrow-band target signal. The adjusted model may reflect the target's time- frequency feature more precisely. Finally, the performance of the auditory features is tested by a Neural Network classifier. The experiment results show that the modified auditory model is more effective than the conventional ones. The major information contained in broadband signals is reserved and the classification ability for real targets is further enhanced. The recog- nition results are increased from 82.59% to 88.80%. The modified auditory features effectively improve the recognition rate for underwater target radiated noise signals.展开更多
基金supported by the National Natural Science Foundation of China under Grant 51709228。
文摘To improve the feature extraction of ship-radiated noise in a complex ocean environment,a novel feature extraction method for ship-radiated noise based on complete ensemble empirical mode decomposition with adaptive selective noise(CEEMDASN) and refined composite multiscale fluctuation-based dispersion entropy(RCMFDE) is proposed.CEEMDASN is proposed in this paper which takes into account the high frequency intermittent components when decomposing the signal.In addition,RCMFDE is also proposed in this paper which refines the preprocessing process of the original signal based on composite multi-scale theory.Firstly,the original signal is decomposed into several intrinsic mode functions(IMFs)by CEEMDASN.Energy distribution ratio(EDR) and average energy distribution ratio(AEDR) of all IMF components are calculated.Then,the IMF with the minimum difference between EDR and AEDR(MEDR)is selected as characteristic IMF.The RCMFDE of characteristic IMF is estimated as the feature vectors of ship-radiated noise.Finally,these feature vectors are sent to self-organizing map(SOM) for classifying and identifying.The proposed method is applied to the feature extraction of ship-radiated noise.The result shows its effectiveness and universality.
基金This work was funded by the National Natural Science Foundation of China under Grant(Nos.61772152,61502037)the Basic Research Project(Nos.JCKY2016206B001,JCKY2014206C002,JCKY2017604C010)and the Technical Foundation Project(No.JSQB2017206C002).
文摘Underwater target recognition is a key technology for underwater acoustic countermeasure.How to classify and recognize underwater targets according to the noise information of underwater targets has been a hot topic in the field of underwater acoustic signals.In this paper,the deep learning model is applied to underwater target recognition.Improved anti-noise Power-Normalized Cepstral Coefficients(ia-PNCC)is proposed,based on PNCC applied to underwater noises.Multitaper and normalized Gammatone filter banks are applied to improve the anti-noise capacity.The method is combined with a convolutional neural network in order to recognize the underwater target.Experiment results show that the acoustic feature presented by ia-PNCC has lower noise and are wellsuited to underwater target recognition using a convolutional neural network.Compared with the combination of convolutional neural network with single acoustic feature,such as MFCC(Mel-scale Frequency Cepstral Coefficients)or LPCC(Linear Prediction Cepstral Coefficients),the combination of the ia-PNCC with a convolutional neural network offers better accuracy for underwater target recognition.
基金The author received the funding from Sichuan Natural Science Foundation(2022NSFSC1892).
文摘The deployment of vehicle micro-motors has witnessed an expansion owing to the progression in electrification and intelligent technologies.However,some micro-motors may exhibit design deficiencies,component wear,assembly errors,and other imperfections that may arise during the design or manufacturing phases.Conse-quently,these micro-motors might generate anomalous noises during their operation,consequently exerting a substantial adverse influence on the overall comfort of drivers and passengers.Automobile micro-motors exhibit a diverse array of structural variations,consequently leading to the manifestation of a multitude of distinctive auditory irregularities.To address the identification of diverse forms of abnormal noise,this research presents a novel approach rooted in the utilization of vibro-acoustic fusion-convolutional neural network(VAF-CNN).This method entails the deployment of distinct network branches,each serving to capture disparate features from the multi-sensor data,all the while considering the auditory perception traits inherent in the human auditory sys-tem.The intermediary layer integrates the concept of adaptive weighting of multi-sensor features,thus affording a calibration mechanism for the features hailing from multiple sensors,thereby enabling a further refinement of features within the branch network.For optimal model efficacy,a feature fusion mechanism is implemented in the concluding layer.To substantiate the efficacy of the proposed approach,this paper initially employs an augmented data methodology inspired by modified SpecAugment,applied to the dataset of abnormal noise sam-ples,encompassing scenarios both with and without in-vehicle interior noise.This serves to mitigate the issue of limited sample availability.Subsequent comparative evaluations are executed,contrasting the performance of the model founded upon single-sensor data against other feature fusion models reliant on multi-sensor data.The experimental results substantiate that the suggested methodology yields heightened recognition accuracy and greater resilience against interference.Moreover,it holds notable practical significance in the engineering domain,as it furnishes valuable support for the targeted management of noise emanating from vehicle micro-motors.
文摘This paper deals with modulation classification under the alpha-stable noise condition. Our goal is to discriminate orthogonal frequency division multiplexing (OFDM) modulation type from single carrier linear digital (SCLD) modulations in this scenario. Based on the new results concerning the generalized cyclostationarity of these signals in alpha-stable noise which are presented in this paper, we construct new modulation classification features without any priori information of carrier frequency and timing offset of the received signals, and use support vector machine (SVM) as classifier to discriminate OFDM from SCLD. Simulation results show that the recognition accuracy of the proposed algorithm can be up to 95% when the mix signal to noise ratio (MSNR) is up to ?1 dB.
基金This research was funded by the European Regional Development Fund in the Research Centre of Advanced Mechatronic Systems project, project number CZ.02.1.01/0.0/0.0/16_019 /0000867by the Ministry of Education of the Czech Republic, Project No. SP2021/32.
文摘This pilot study focuses on employment of hybrid LMS-ICA system for in-vehicle background noise reduction.Modern vehicles are nowadays increasingly supporting voice commands,which are one of the pillars of autonomous and SMART vehicles.Robust speaker recognition for context-aware in-vehicle applications is limited to a certain extent by in-vehicle back-ground noise.This article presents the new concept of a hybrid system which is implemented as a virtual instrument.The highly modular concept of the virtual car used in combination with real recordings of various driving scenarios enables effective testing of the investigated methods of in-vehicle background noise reduction.The study also presents a unique concept of an adaptive system using intelligent clusters of distributed next generation 5G data networks,which allows the exchange of interference information and/or optimal hybrid algorithm settings between individual vehicles.On average,the unfiltered voice commands were successfully recognized in 29.34%of all scenarios,while the LMS reached up to 71.81%,and LMS-ICA hybrid improved the performance further to 73.03%.
基金supported by National Natural Science Foundation of China(No.61871318 and 61833013)Shaanxi Provincial Key Research and Development Project(No.2019GY-099).
文摘Refined composite multi-scale dispersion entropy(RCMDE),as a new and effective nonlinear dynamic method,has been applied in the field of medical diagnosis and fault diagnosis.In this paper,we first introduce RCMDE into the field of underwater acoustic signal processing for complexity feature extraction of ship radiated noise,and then propose a novel classification method for ship-radiated noise based on RCMDE and k-nearest neighbor(KNN),termed RCMDE-KNN.The results of a comparative experiment show that the proposed RCMDE-KNN classification method can effectively extract the complexity features of ship-radiated noise,and has better classification performance under one and two scales than the other three classification methods based on multi-scale permutation entropy(MPE)and KNN,multi-scale weighted-permutation entropy(MW-PE)and KNN,and multi-scale dispersion entropy(MDE)and KNN,termed MPE-KNN,MW-PE-KNN,and MDE-KNN.It is proved that the RCMDE-KNN classification method for ship-radiated noise is feasible and effective,and can obtain a very high recognition rate.
基金Supported by the China National Science and Technology Major Project(2016ZX05020005-001)
文摘Wavelet forced de-noising algorithm is suitable for denoising of unsteady drilling fluid pulse signal, including baseline drift rectification and two-stage de-noising processing of frame synchronization signal and instruction signal. Two-stage de-noising processing can reduce the impact of baseline drift and determine automatic peak detection threshold range for signal recognition by distinguishing different features of frame synchronization pulse and instruction pulse. Rising and falling edge relative protruding threshold is defined for peak detection in signal recognition, which can make full use of the degree of the signal peak change and detect peaks flexibly with rising and falling edge relative protruding threshold combination. A synchronous decoding method was designed to reduce position uncertainty of the frame synchronization pulse and eliminate the accumulative error of time base drift, which determines the first instruction pulse position according to position of the frame synchronization pulse and decodes subsequent instruction pulse by taking current instruction pulse as new bit synchronization pulse. Special tool software was developed to tune algorithm parameters, which has a decoding success rate of about 95% for the universal coded signals. For the special coded signals with check byte, decoding success rate using the automatic threshold adjustment algorithm is as high as 99%.
基金supported by the National Science and Technology Support Program(2011BAK12B00)the International Cooperation Project of the Department of Science and Technology of Sichuan Province(2009HH0005)the Project of the Department of Science and Technology of Sichuan Province(2015JY0235)
文摘Low frequency infrasonic waves are emitted during the formation and movement of debris flows, which are detectable in a radius of several kilometers, thereby to serve as the precondition for their remote monitoring.However, false message often arises from the simple mechanics of alarms under the ambient noise interference.To improve the accuracy of infrasound monitoring for early-warning against debris flows, it is necessary to analyze the monitor information to identify in them the infrasonic signals characteristic of debris flows.Therefore, a large amount of debris flow infrasound and ambient noises have been collected from different sources for analysis to sum up their frequency spectra, sound pressures, waveforms, time duration and other correlated characteristics so as to specify the key characteristic parameters for different sound sources in completing the development of the recognition system of debris flow infrasonic signals for identifying their possible existence in the monitor signals.The recognition performance of the system has been verified by simulating tests and long-term in-situ monitoring of debris flows in Jiangjia Gully,Dongchuan, China to be of high accuracy and applicability.The recognition system can provide the local government and residents with accurate precautionary information about debris flows in preparation for disaster mitigation and minimizing the loss of life and property.
基金The authors gratefully acknowledge the support of the National Natural Science Foundation of China(No.11574250).
文摘Underwater acoustic signal processing is one of the research hotspots in underwater acoustics.Noise reduction of underwater acoustic signals is the key to underwater acoustic signal processing.Owing to the complexity of marine environment and the particularity of underwater acoustic channel,noise reduction of underwater acoustic signals has always been a difficult challenge in the field of underwater acoustic signal processing.In order to solve the dilemma,we proposed a novel noise reduction technique for underwater acoustic signals based on complete ensemble empirical mode decomposition with adaptive noise(CEEMDAN),minimum mean square variance criterion(MMSVC) and least mean square adaptive filter(LMSAF).This noise reduction technique,named CEEMDAN-MMSVC-LMSAF,has three main advantages:(i) as an improved algorithm of empirical mode decomposition(EMD) and ensemble EMD(EEMD),CEEMDAN can better suppress mode mixing,and can avoid selecting the number of decomposition in variational mode decomposition(VMD);(ii) MMSVC can identify noisy intrinsic mode function(IMF),and can avoid selecting thresholds of different permutation entropies;(iii) for noise reduction of noisy IMFs,LMSAF overcomes the selection of deco mposition number and basis function for wavelet noise reduction.Firstly,CEEMDAN decomposes the original signal into IMFs,which can be divided into noisy IMFs and real IMFs.Then,MMSVC and LMSAF are used to detect identify noisy IMFs and remove noise components from noisy IMFs.Finally,both denoised noisy IMFs and real IMFs are reconstructed and the final denoised signal is obtained.Compared with other noise reduction techniques,the validity of CEEMDAN-MMSVC-LMSAF can be proved by the analysis of simulation signals and real underwater acoustic signals,which has the better noise reduction effect and has practical application value.CEEMDAN-MMSVC-LMSAF also provides a reliable basis for the detection,feature extraction,classification and recognition of underwater acoustic signals.
基金The Science and Technology R&D Fund Project of Shenzhen(No.JCYJ2017081765149850)
文摘To evaluate the influence of data set noise, the network in network(NIN) model is introduced and the negative effects of different types and proportions of noise on deep convolutional models are studied. Different types and proportions of data noise are added to two reference data sets, Cifar-10 and Cifar-100. Then, this data containing noise is used to train deep convolutional models and classify the validation data set. The experimental results show that the noise in the data set has obvious adverse effects on deep convolutional network classification models. The adverse effects of random noise are small, but the cross-category noise among categories can significantly reduce the recognition ability of the model. Therefore, a solution is proposed to improve the quality of the data sets that are mixed into a single noise category. The model trained with a data set containing noise is used to evaluate the current training data and reclassify the categories of the anomalies to form a new data set. Repeating the above steps can greatly reduce the noise ratio, so the influence of cross-category noise can be effectively avoided.
基金The 15th Plan National Defence Preven-tive Research Project (No.413030201)
文摘To deal with the nonlinear separable problem, the generalized noise clustering (GNC) algorithm is extended to a kernel generalized noise clustering (KGNC) model. Different from the fuzzy c-means (FCM) model and the GNC model which are based on Euclidean distance, the presented model is based on kernel-induced distance by using kernel method. By kernel method the input data are nonlinearly and implicitly mapped into a high-dimensional feature space, where the nonlinear pattern appears linear and the GNC algorithm is performed. It is unnecessary to calculate in high-dimensional feature space because the kernel function can do it just in input space. The effectiveness of the proposed algorithm is verified by experiments on three data sets. It is concluded that the KGNC algorithm has better clustering accuracy than FCM and GNC in clustering data sets containing noisy data.
文摘In this paper, a speech signal recovery algorithm is presented for a personalized voice command automatic recognition system in vehicle and restaurant environments. This novel algorithm is able to separate a mixed speech source from multiple speakers, detect presence/absence of speakers by tracking the higher magnitude portion of speech power spectrum and adaptively suppress noises. An automatic speech recognition (ASR) process to deal with the multi-speaker task is designed and implemented. Evaluation tests have been carried out by using the speech da- tabase NOIZEUS and the experimental results show that the proposed algorithm achieves impressive performance improvements.
文摘This paper describes a method for reducing sudden noise using noise detection and classification methods, and noise power estimation. Sudden noise detection and classification have been dealt with in our previous study. In this paper, GMM-based noise reduction is performed using the detection and classification results. As a result of classification, we can determine the kind of noise we are dealing with, but the power is unknown. In this paper, this problem is solved by combining an estimation of noise power with the noise reduction method. In our experiments, the proposed method achieved good performance for recognition of utterances overlapped by sudden noises.
基金Natural Science Foundation of Shanxi Province,China(No.2008011011)
文摘Based on the recognition of one-step singular correlation and the remedying methods obtained before,the correlation properties of the neighborhood pixels and the characteristics of image de-noising were analyzed.A kind of most relevant weighted filtering method based on one-step singular correlation recognition(OSSC-MRWF)was put forward.The simulation experiments were done and the comparison with some commonly used methods under salt-and-pepper noises was made.The results show that the proposed method can not only effectively recognize salt-and-pepper noises and mend up the noise points,but also protect the original information such as the edge details very well.The accuracy and performance indicators are further improved considerably.
文摘Multi-media overcomes the defects of traditional teaching means so foreign language teaching rapidly develops with such technology. It becomes a bottleneck to restrict intelligence learning software development. To solve the problem, this paper discuss basic knowledge in speech recognition and studies targeted corpus according to English pronunciation habit of Chinese people. Integrated with oral English learners' requirements with Chinese as native language, this paper applies DTW model-based speech recognition technology for Viterbi decoding speech, then it recognizes and scores through posterior probability. After experiment verification, English pronunciation recognition model in this paper is verified to be reasonable and credible and it can offer learners' timely, accurate and objective evaluation and feedback direction to correct pronunciation errors to improve oral English learning efficiency.
文摘The structure of any Bangla numerical character is more complex compared to English numerical character. Two pairs of numerical character in Bangla resembles to be closed and they are: “one and nine” and “five and six”. We found that, handwritten Bangla numerical character cannot be recognized using single machine learning algorithm or discrete wavelet transform (DWT). Above phenomenon motivated us to use combination of DWT, Fuzzy Inference System (FIS) and Principal Component Analysis (PCA) to recognize numerical characters of Bangla in handwritten format. The four lowest spectral components of a preprocessed image are taken using DWT, which is considered as the feature vector to recognize the digits in first phase. The feature vector is then applied to FIS and PCA separately. The combined method provides recognition accuracy of 95.8% whereas application of individual method gives less rate of accuracy. Instead of storing the images itself in a folder, if we can store the feature vector of images achieved from DWT in tabular form. The records of table can be applied in FIS, PCA or other object detection algorithm. Although the technique used in the paper can detect objects with moderate rate of accuracy but can save huge storage against a benchmark database of images. If a tradeoff is made between storage requirements and accuracy of recognition, the model of the paper is preferable compared to other present state-of-art. Another finding of the paper is that, the spectral components of images acquired by DWT only matched with FIS and PCA for classification but do not match properly with unsupervised (K-mean clustering) and supervised (support vector machine) learning.
文摘In this Letter, a method based on the effects of imperfect oscillators in lasers is proposed to distinguish targets in continuous wave tracking lidar. This technique is based on the fact that each lidar signal source has a specific influence on the phase noise that makes real targets from the false ones. A simulated signal is produced by complex circuits, modulators, memory, and signal oscillators. For example, a deception laser beam has an unequal and variable phase noise from a real target. Thus, the phase noise of transmitted and received signals does not have the same power levels and patterns. To consider the performance of the suggested method, the probability of detection(PD) is shown for various signal-to-noise ratios and signal-to-jammer ratios based on experimental outcomes.
文摘Invoice document digitization is crucial for efficient management in industries.The scanned invoice image is often noisy due to various reasons.This affects the OCR(optical character recognition)detection accuracy.In this paper,letter data obtained from images of invoices are denoised using a modified autoencoder based deep learning method.A stacked denoising autoencoder(SDAE)is implemented with two hidden layers each in encoder network and decoder network.In order to capture the most salient features of training samples,a undercomplete autoencoder is designed with non-linear encoder and decoder function.This autoencoder is regularized for denoising application using a combined loss function which considers both mean square error and binary cross entropy.A dataset consisting of 59,119 letter images,which contains both English alphabets(upper and lower case)and numbers(0 to 9)is prepared from many scanned invoices images and windows true type(.ttf)files,are used for training the neural network.Performance is analyzed in terms of Signal to Noise Ratio(SNR),Peak Signal to Noise Ratio(PSNR),Structural Similarity Index(SSIM)and Universal Image Quality Index(UQI)and compared with other filtering techniques like Nonlocal Means filter,Anisotropic diffusion filter,Gaussian filters and Mean filters.Denoising performance of proposed SDAE is compared with existing SDAE with single loss function in terms of SNR and PSNR values.Results show the superior performance of proposed SDAE method.
基金The project is supported by National Education Ministry Doctor Foundation of China
文摘The features of the ship noises are analyzed by using the higher-order spectrum (HOS) after studying their distribution. The results show that the different ship noise has different ranges of the main frequency. The main frequencies of the first class ships are less than 120 Hz, while the second class ships drop in 130 Hz -- 320 Hz. The different relationship between w1 and w2 corresponds to different bispectrum graph. There are the same results in the trispectrum. The feature vector is consist of the wls which correspond to the maximum bispectrum B(wl, wl) and the maximum trispectrum B(wl, w1,wl) respectively, the al, w2 which correspond to the maximum bispectrum B(wl, w2).
基金supported by the Chinese Defense Advance Research Program of Basic Science and Technology(51303020307-8,41416040401)
文摘The recognition rate of the auditory periphery features decreases when the model is used to identify underwater targets in practice. To solve this problem, an improved method based on Gammatone filter bank is proposed. Firstly, after the reason of the decreasing of the recognition results is analyzed, the mechanism of multichannel data acquisition in acoustic engineering may narrow down signal frequency range, which leads to time-frequency features distortion. Secondly, the Gammatone filter bank is implemented to simulate frequency decom- position characteristics of human ear basilar membrane. Since the class information of the underwater noise signal is mostly contained in low frequency range, the auditory features of the conventional model are interpolated and the channel number of the filter bank and the central frequency of each frequency band are adjusted accordingly to obtain a 27-dimensional feature vector of the narrow-band target signal. The adjusted model may reflect the target's time- frequency feature more precisely. Finally, the performance of the auditory features is tested by a Neural Network classifier. The experiment results show that the modified auditory model is more effective than the conventional ones. The major information contained in broadband signals is reserved and the classification ability for real targets is further enhanced. The recog- nition results are increased from 82.59% to 88.80%. The modified auditory features effectively improve the recognition rate for underwater target radiated noise signals.