Noise reduction analysis of signals is essential for modern underwater acoustic detection systems.The traditional noise reduction techniques gradually lose efficacy because the target signal is masked by biological an...Noise reduction analysis of signals is essential for modern underwater acoustic detection systems.The traditional noise reduction techniques gradually lose efficacy because the target signal is masked by biological and natural noise in the marine environ-ment.The feature extraction method combining time-frequency spectrograms and deep learning can effectively achieve the separation of noise and target signals.A fully convolutional encoder-decoder neural network(FCEDN)is proposed to address the issue of noise reduc-tion in underwater acoustic signals.The time-domain waveform map of underwater acoustic signals is converted into a wavelet low-frequency analysis recording spectrogram during the denoising process to preserve as many underwater acoustic signal characteristics as possible.The FCEDN is built to learn the spectrogram mapping between noise and target signals that can be learned at each time level.The transposed convolution transforms are introduced,which can transform the spectrogram features of the signals into listenable audio files.After evaluating the systems on the ShipsEar Dataset,the proposed method can increase SNR and SI-SNR by 10.02 and 9.5dB,re-spectively.展开更多
Recently, convolutional neural networks (CNNs) have been utilized in medical imaging research field and have successfully shown their ability in image classification and detection. In this paper we used a CNN combined...Recently, convolutional neural networks (CNNs) have been utilized in medical imaging research field and have successfully shown their ability in image classification and detection. In this paper we used a CNN combined with a wavelet transform approach for classifying a dataset of 448 lung CT images into 4 categories, e.g. lung adenocarcinoma, lung squamous cell carcinoma, metastatic lung cancer, and normal. The key difference between the commonly-used CNNs and the presented method is that in this method, we adopt the use of redundant wavelet coefficients at level 1 as inputs to the CNN, instead of using original images. One of the main advantages of the proposed method is that it is not necessary to extract regions of interest from original images. The wavelet coefficients of the entire image are used as inputs to the CNN. We compare the classification performance of the proposed method to that of an existing CNN classifier and a CNN-based support vector machine classifier. The experimental results show that the proposed method outperforms the other two methods and achieve the highest overall accuracy of 91.9%. It demonstrates the potential for use in classification of lung diseases in CT images.展开更多
Computerized tomography (CT) scan is the only screening test recommended by doctors to look for lung cancer. Convolutional neural networks (CNNs) have recently proven their ability to successfully classify medical ima...Computerized tomography (CT) scan is the only screening test recommended by doctors to look for lung cancer. Convolutional neural networks (CNNs) have recently proven their ability to successfully classify medical images. Due to its strong compactness property, the Discrete Wavelet transform (DWT) has been commonly used in image feature extraction applications. This paper presents a novel technique for the classification of Lung cancer in Computerized Tomography (CT) scans using Wavelets to find discriminative features in the CT images and CNN to classify the extracted features. Experimental results prove that the proposed approach outperforms other commonly used methods and gives an overall accuracy of 99.5%.展开更多
The remaining useful life(RUL)estimation of bearings is critical for ensuring the reliability of mechanical systems.Owing to the rapid development of deep learning methods,a multitude of data-driven RUL estimation app...The remaining useful life(RUL)estimation of bearings is critical for ensuring the reliability of mechanical systems.Owing to the rapid development of deep learning methods,a multitude of data-driven RUL estimation approaches have been proposed recently.However,the following problems remain in existing methods:1)Most network models use raw data or statistical features as input,which renders it difficult to extract complex fault-related information hidden in signals;2)for current observations,the dependence between current states is emphasized,but their complex dependence on previous states is often disregarded;3)the output of neural networks is directly used as the estimated RUL in most studies,resulting in extremely volatile prediction results that lack robustness.Hence,a novel prognostics approach is proposed based on a time-frequency representation(TFR)subsequence,three-dimensional convolutional neural network(3DCNN),and Gaussian process regression(GPR).The approach primarily comprises two aspects:construction of a health indicator(HI)using the TFR-subsequence-3DCNN model,and RUL estimation based on the GPR model.The raw signals of the bearings are converted into TFR-subsequences by continuous wavelet transform and a dislocated overlapping strategy.Subsequently,the 3DCNN is applied to extract the hidden spatiotemporal features from the TFR-subsequences and construct HIs.Finally,the RUL of the bearings is estimated using the GPR model,which can also define the probability distribution of the potential function and prediction confidence.Experiments on the PRONOSTIA platform demonstrate the superiority of the proposed TFR-subsequence-3DCNN-GPR approach.The use of degradation-related spatiotemporal features in signals is proposed herein to achieve a highly accurate bearing RUL prediction with uncertainty quantification.展开更多
The El Niño-Southern Oscillation (ENSO) is a significant climate phenomenon with far-reaching impacts on global weather patterns, ecosystems, and economies. This study aims to enhance ENSO forecasting with the Ex...The El Niño-Southern Oscillation (ENSO) is a significant climate phenomenon with far-reaching impacts on global weather patterns, ecosystems, and economies. This study aims to enhance ENSO forecasting with the Extended Reconstruction Sea Surface Temperature v5 (ERSSTv5) climate model. The M-band discrete wavelet transforms (DWT) are utilized to capture multi-scale temporal and spatial features effectively. Long-short term memory (LSTM) autoencoders are also used to capture significant spatial and temporal patterns in sea surface temperature (SST) anomaly data. Deep learning techniques such as the convolutional neural networks (CNN) are used with non-image and image time series data. We also employ parallel computing in a various support vector regression (SVR) approximators to enhance accuracy. Preliminary results indicate that this hybrid model effectively identifies key precursors and patterns associated with El Niño events, surpassing traditional forecasting methods. Results of the hybrid model produce a correlation of 0.93 in 4-month lagged forecasting of the Oceanic Niño Index (ONI)—indicative of high success rate of the model. Future work will focus on evaluating the model’s performance using additional reanalysis datasets and other methods of deep learning to further refine its robustness and applicability. We propose wavelet-based deep learning models which have potential to shine a light on achieving United Nations’ 2030 Agenda for Sustainable Development’s goal 13: “Climate Action”, as an innovation with potential in improving time series image forecasting in all fields.展开更多
Convolutional neural networks(CNNs) have shown great potential for image super-resolution(SR).However,most existing CNNs only reconstruct images in the spatial domain,resulting in insufficient high-frequency details o...Convolutional neural networks(CNNs) have shown great potential for image super-resolution(SR).However,most existing CNNs only reconstruct images in the spatial domain,resulting in insufficient high-frequency details of reconstructed images.To address this issue,a channel attention based wavelet cascaded network for image super-resolution(CWSR) is proposed.Specifically,a second-order channel attention(SOCA) mechanism is incorporated into the network,and the covariance matrix normalization is utilized to explore interdependencies between channel-wise features.Then,to boost the quality of residual features,the non-local module is adopted to further improve the global information integration ability of the network.Finally,taking the image loss in the spatial and wavelet domains into account,a dual-constrained loss function is proposed to optimize the network.Experimental results illustrate that CWSR outperforms several state-of-the-art methods in terms of both visual quality and quantitative metrics.展开更多
Accurate and reliable crack segmentation is a challenge and meaningful task.In this article,aiming at the characteristics of cracks on the concrete images,the intensity frequency information of source images which is ...Accurate and reliable crack segmentation is a challenge and meaningful task.In this article,aiming at the characteristics of cracks on the concrete images,the intensity frequency information of source images which is obtained by Discrete Wavelet Transform(DWT)is fed into deep learning-based networks to enhance the ability of network on crack segmentation.To well integrate frequency information into network an effective and novel DWTA module based on the DWT and scSE attention mechanism is proposed.The semantic information of cracks is enhanced and the irrelevant information is suppressed by DWTA module.And the gap between frequency information and convolution information from network is balanced by DWTA module which can well fuse wavelet information into image segmentation network.The Unet-DWTA is proposed to preserved the information of crack boundary and thin crack in intermediate feature maps by adding DWTA module in the encoderdecoder structures.In decoder,diverse level feature maps are fused to capture the information of crack boundary and the abstract semantic information which is beneficial to crack pixel classification.The proposed method is verified on three classic datasets including CrackDataset,CrackForest,and DeepCrack datasets.Compared with the other crack methods,the proposed Unet-DWTA shows better performance based on the evaluation of the subjective analysis and objective metrics about image semantic segmentation.展开更多
Biometrics,which has become integrated with our daily lives,could fall prey to falsification attacks,leading to security concerns.In our paper,we use Transient Evoked Otoacoustic Emissions(TEOAE)that are generated by ...Biometrics,which has become integrated with our daily lives,could fall prey to falsification attacks,leading to security concerns.In our paper,we use Transient Evoked Otoacoustic Emissions(TEOAE)that are generated by the human cochlea in response to an external sound stimulus,as a biometric modality.TEOAE are robust to falsification attacks,as the uniqueness of an individual’s inner ear cannot be impersonated.In this study,we use both the raw 1D TEOAE signals,as well as the 2D time-frequency representation of the signal using Continuous Wavelet Transform(CWT).We use 1D and 2D Convolutional Neural Networks(CNN)for the former and latter,respectively,to derive the feature maps.The corresponding lower-dimensional feature maps are obtained using principal component analysis,which is then used as features to build classifiers using machine learning techniques for the task of person identification.T-SNE plots of these feature maps show that they discriminate well among the subjects.Among the various architectures explored,we achieve a best-performing accuracy of 98.95%and 100%using the feature maps of the 1D-CNN and 2D-CNN,respectively,with the latter performance being an improvement over all the earlier works.This performance makes the TEOAE based person identification systems deployable in real-world situations,along with the added advantage of robustness to falsification attacks.展开更多
Every year, hurricanes pose a serious threat to coastal communities, and forecasting their maximum intensities has been a crucial task for scientists. Computational methods have been used to forecast the intensities o...Every year, hurricanes pose a serious threat to coastal communities, and forecasting their maximum intensities has been a crucial task for scientists. Computational methods have been used to forecast the intensities of hurricanes across varying time horizons. However, as climate change has increased the volatility of the intensities of recent hurricanes, newer and adaptable methods must be devised. In this study, a framework is proposed to estimate the maximum intensity of tropical cyclones (TCs) in the Atlantic Ocean using a multi-input convolutional neural network (CNN). From the Atlantic hurricane seasons of 2000 through 2021, over 100 TCs that reached hurricane-level wind speeds are used. Novel algorithms are used to collect and preprocess both satellite image data and non-image data for these TCs. Namely, Discrete Wavelet Transforms (DWTs) are used to decompose individual bands of satellite image data, eliminating noise and extracting hidden frequency details before training. Validation tests indicate that this framework can estimate the maximum wind speed of TCs with a root mean square error of 15 knots. This framework provides preliminary predictions that can supplement current computational methods that would otherwise not be able to account for climate change. Future work can be done by forecasting with time constraints, and to provide estimations for more metrics such as pressure and precipitation.展开更多
基金supported by the National Natural Science Foundation of China(No.41906169)the PLA Academy of Military Sciences.
文摘Noise reduction analysis of signals is essential for modern underwater acoustic detection systems.The traditional noise reduction techniques gradually lose efficacy because the target signal is masked by biological and natural noise in the marine environ-ment.The feature extraction method combining time-frequency spectrograms and deep learning can effectively achieve the separation of noise and target signals.A fully convolutional encoder-decoder neural network(FCEDN)is proposed to address the issue of noise reduc-tion in underwater acoustic signals.The time-domain waveform map of underwater acoustic signals is converted into a wavelet low-frequency analysis recording spectrogram during the denoising process to preserve as many underwater acoustic signal characteristics as possible.The FCEDN is built to learn the spectrogram mapping between noise and target signals that can be learned at each time level.The transposed convolution transforms are introduced,which can transform the spectrogram features of the signals into listenable audio files.After evaluating the systems on the ShipsEar Dataset,the proposed method can increase SNR and SI-SNR by 10.02 and 9.5dB,re-spectively.
文摘Recently, convolutional neural networks (CNNs) have been utilized in medical imaging research field and have successfully shown their ability in image classification and detection. In this paper we used a CNN combined with a wavelet transform approach for classifying a dataset of 448 lung CT images into 4 categories, e.g. lung adenocarcinoma, lung squamous cell carcinoma, metastatic lung cancer, and normal. The key difference between the commonly-used CNNs and the presented method is that in this method, we adopt the use of redundant wavelet coefficients at level 1 as inputs to the CNN, instead of using original images. One of the main advantages of the proposed method is that it is not necessary to extract regions of interest from original images. The wavelet coefficients of the entire image are used as inputs to the CNN. We compare the classification performance of the proposed method to that of an existing CNN classifier and a CNN-based support vector machine classifier. The experimental results show that the proposed method outperforms the other two methods and achieve the highest overall accuracy of 91.9%. It demonstrates the potential for use in classification of lung diseases in CT images.
文摘Computerized tomography (CT) scan is the only screening test recommended by doctors to look for lung cancer. Convolutional neural networks (CNNs) have recently proven their ability to successfully classify medical images. Due to its strong compactness property, the Discrete Wavelet transform (DWT) has been commonly used in image feature extraction applications. This paper presents a novel technique for the classification of Lung cancer in Computerized Tomography (CT) scans using Wavelets to find discriminative features in the CT images and CNN to classify the extracted features. Experimental results prove that the proposed approach outperforms other commonly used methods and gives an overall accuracy of 99.5%.
基金Supported by National Key Research and Development Project of China(Grant No.2020YFB2007700)State Key Laboratory of Tribology Initiative Research Program(Grant No.SKLT2020D21)+2 种基金National Natural Science Foundation of China(Grant No.51975309)Shaanxi Provincial Natural Science Foundation of China(Grant No.2019JQ-712)Young Talent Fund of University Association for Science and Technology in Shaanxi(Grant No.20170511).
文摘The remaining useful life(RUL)estimation of bearings is critical for ensuring the reliability of mechanical systems.Owing to the rapid development of deep learning methods,a multitude of data-driven RUL estimation approaches have been proposed recently.However,the following problems remain in existing methods:1)Most network models use raw data or statistical features as input,which renders it difficult to extract complex fault-related information hidden in signals;2)for current observations,the dependence between current states is emphasized,but their complex dependence on previous states is often disregarded;3)the output of neural networks is directly used as the estimated RUL in most studies,resulting in extremely volatile prediction results that lack robustness.Hence,a novel prognostics approach is proposed based on a time-frequency representation(TFR)subsequence,three-dimensional convolutional neural network(3DCNN),and Gaussian process regression(GPR).The approach primarily comprises two aspects:construction of a health indicator(HI)using the TFR-subsequence-3DCNN model,and RUL estimation based on the GPR model.The raw signals of the bearings are converted into TFR-subsequences by continuous wavelet transform and a dislocated overlapping strategy.Subsequently,the 3DCNN is applied to extract the hidden spatiotemporal features from the TFR-subsequences and construct HIs.Finally,the RUL of the bearings is estimated using the GPR model,which can also define the probability distribution of the potential function and prediction confidence.Experiments on the PRONOSTIA platform demonstrate the superiority of the proposed TFR-subsequence-3DCNN-GPR approach.The use of degradation-related spatiotemporal features in signals is proposed herein to achieve a highly accurate bearing RUL prediction with uncertainty quantification.
文摘The El Niño-Southern Oscillation (ENSO) is a significant climate phenomenon with far-reaching impacts on global weather patterns, ecosystems, and economies. This study aims to enhance ENSO forecasting with the Extended Reconstruction Sea Surface Temperature v5 (ERSSTv5) climate model. The M-band discrete wavelet transforms (DWT) are utilized to capture multi-scale temporal and spatial features effectively. Long-short term memory (LSTM) autoencoders are also used to capture significant spatial and temporal patterns in sea surface temperature (SST) anomaly data. Deep learning techniques such as the convolutional neural networks (CNN) are used with non-image and image time series data. We also employ parallel computing in a various support vector regression (SVR) approximators to enhance accuracy. Preliminary results indicate that this hybrid model effectively identifies key precursors and patterns associated with El Niño events, surpassing traditional forecasting methods. Results of the hybrid model produce a correlation of 0.93 in 4-month lagged forecasting of the Oceanic Niño Index (ONI)—indicative of high success rate of the model. Future work will focus on evaluating the model’s performance using additional reanalysis datasets and other methods of deep learning to further refine its robustness and applicability. We propose wavelet-based deep learning models which have potential to shine a light on achieving United Nations’ 2030 Agenda for Sustainable Development’s goal 13: “Climate Action”, as an innovation with potential in improving time series image forecasting in all fields.
基金Supported by the National Natural Science Foundation of China(No.61901183)Fundamental Research Funds for the Central Universities(No.ZQN921)+4 种基金Natural Science Foundation of Fujian Province Science and Technology Department(No.2021H6037)Key Project of Quanzhou Science and Technology Plan(No.2021C008R)Natural Science Foundation of Fujian Province(No.2019J01010561)Education and Scientific Research Project for Young and Middle-aged Teachers of Fujian Province 2019(No.JAT191080)Science and Technology Bureau of Quanzhou(No.2017G046)。
文摘Convolutional neural networks(CNNs) have shown great potential for image super-resolution(SR).However,most existing CNNs only reconstruct images in the spatial domain,resulting in insufficient high-frequency details of reconstructed images.To address this issue,a channel attention based wavelet cascaded network for image super-resolution(CWSR) is proposed.Specifically,a second-order channel attention(SOCA) mechanism is incorporated into the network,and the covariance matrix normalization is utilized to explore interdependencies between channel-wise features.Then,to boost the quality of residual features,the non-local module is adopted to further improve the global information integration ability of the network.Finally,taking the image loss in the spatial and wavelet domains into account,a dual-constrained loss function is proposed to optimize the network.Experimental results illustrate that CWSR outperforms several state-of-the-art methods in terms of both visual quality and quantitative metrics.
基金National Natural Science Foundation of China under Grant 61972267National Natural Science Foundation of Hebei Province under Grant F2018210148University Science Research Project of Hebei Province under Grant ZD2021334。
文摘Accurate and reliable crack segmentation is a challenge and meaningful task.In this article,aiming at the characteristics of cracks on the concrete images,the intensity frequency information of source images which is obtained by Discrete Wavelet Transform(DWT)is fed into deep learning-based networks to enhance the ability of network on crack segmentation.To well integrate frequency information into network an effective and novel DWTA module based on the DWT and scSE attention mechanism is proposed.The semantic information of cracks is enhanced and the irrelevant information is suppressed by DWTA module.And the gap between frequency information and convolution information from network is balanced by DWTA module which can well fuse wavelet information into image segmentation network.The Unet-DWTA is proposed to preserved the information of crack boundary and thin crack in intermediate feature maps by adding DWTA module in the encoderdecoder structures.In decoder,diverse level feature maps are fused to capture the information of crack boundary and the abstract semantic information which is beneficial to crack pixel classification.The proposed method is verified on three classic datasets including CrackDataset,CrackForest,and DeepCrack datasets.Compared with the other crack methods,the proposed Unet-DWTA shows better performance based on the evaluation of the subjective analysis and objective metrics about image semantic segmentation.
基金The authors would like to thank the Biometrics Security Laboratory of the University of Toronto for providing the Transient Evoked Otoacoustic Emissions(TEOAE)dataset.
文摘Biometrics,which has become integrated with our daily lives,could fall prey to falsification attacks,leading to security concerns.In our paper,we use Transient Evoked Otoacoustic Emissions(TEOAE)that are generated by the human cochlea in response to an external sound stimulus,as a biometric modality.TEOAE are robust to falsification attacks,as the uniqueness of an individual’s inner ear cannot be impersonated.In this study,we use both the raw 1D TEOAE signals,as well as the 2D time-frequency representation of the signal using Continuous Wavelet Transform(CWT).We use 1D and 2D Convolutional Neural Networks(CNN)for the former and latter,respectively,to derive the feature maps.The corresponding lower-dimensional feature maps are obtained using principal component analysis,which is then used as features to build classifiers using machine learning techniques for the task of person identification.T-SNE plots of these feature maps show that they discriminate well among the subjects.Among the various architectures explored,we achieve a best-performing accuracy of 98.95%and 100%using the feature maps of the 1D-CNN and 2D-CNN,respectively,with the latter performance being an improvement over all the earlier works.This performance makes the TEOAE based person identification systems deployable in real-world situations,along with the added advantage of robustness to falsification attacks.
文摘Every year, hurricanes pose a serious threat to coastal communities, and forecasting their maximum intensities has been a crucial task for scientists. Computational methods have been used to forecast the intensities of hurricanes across varying time horizons. However, as climate change has increased the volatility of the intensities of recent hurricanes, newer and adaptable methods must be devised. In this study, a framework is proposed to estimate the maximum intensity of tropical cyclones (TCs) in the Atlantic Ocean using a multi-input convolutional neural network (CNN). From the Atlantic hurricane seasons of 2000 through 2021, over 100 TCs that reached hurricane-level wind speeds are used. Novel algorithms are used to collect and preprocess both satellite image data and non-image data for these TCs. Namely, Discrete Wavelet Transforms (DWTs) are used to decompose individual bands of satellite image data, eliminating noise and extracting hidden frequency details before training. Validation tests indicate that this framework can estimate the maximum wind speed of TCs with a root mean square error of 15 knots. This framework provides preliminary predictions that can supplement current computational methods that would otherwise not be able to account for climate change. Future work can be done by forecasting with time constraints, and to provide estimations for more metrics such as pressure and precipitation.