Electrocardiogram(ECG)signal is one of the noninvasive physiological measurement techniques commonly usedin cardiac diagnosis.However,in real scenarios,the ECGsignal is susceptible to various noise erosion,which affec...Electrocardiogram(ECG)signal is one of the noninvasive physiological measurement techniques commonly usedin cardiac diagnosis.However,in real scenarios,the ECGsignal is susceptible to various noise erosion,which affectsthe subsequent pathological analysis.Therefore,the effective removal of the noise from ECG signals has becomea top priority in cardiac diagnostic research.Aiming at the problem of incomplete signal shape retention andlow signal-to-noise ratio(SNR)after denoising,a novel ECG denoising network,named attention-based residualdense shrinkage network(ARDSN),is proposed in this paper.Firstly,the shallow ECG characteristics are extractedby a shallow feature extraction network(SFEN).Then,the residual dense shrinkage attention block(RDSAB)isused for adaptive noise suppression.Finally,feature fusion representation(FFR)is performed on the hierarchicalfeatures extracted by a series of RDSABs to reconstruct the de-noised ECG signal.Experiments on the MIT-BIHarrhythmia database and MIT-BIH noise stress test database indicate that the proposed scheme can effectively resistthe interference of different sources of noise on the ECG signal.展开更多
Masking-based and spectrum mapping-based methods are the two main algorithms of speech enhancement with deep neural network(DNN).But the mapping-based methods only utilizes the phase of noisy speech,which limits the u...Masking-based and spectrum mapping-based methods are the two main algorithms of speech enhancement with deep neural network(DNN).But the mapping-based methods only utilizes the phase of noisy speech,which limits the upper bound of speech enhancement performance.Maskingbased methods need to accurately estimate the masking which is still the key problem.Combining the advantages of above two types of methods,this paper proposes the speech enhancement algorithm MM-RDN(maskingmapping residual dense network)based on masking-mapping(MM)and residual dense network(RDN).Using the logarithmic power spectrogram(LPS)of consecutive frames,MM estimates the ideal ratio masking(IRM)matrix of consecutive frames.RDN can make full use of feature maps of all layers.Meanwhile,using the global residual learning to combine the shallow features and deep features,RDN obtains the global dense features from the LPS,thereby improves estimated accuracy of the IRM matrix.Simulations show that the proposed method achieves attractive speech enhancement performance in various acoustic environments.Specifically,in the untrained acoustic test with limited priors,e.g.,unmatched signal-to-noise ratio(SNR)and unmatched noise category,MM-RDN can still outperform the existing convolutional recurrent network(CRN)method in themeasures of perceptual evaluation of speech quality(PESQ)and other evaluation indexes.It indicates that the proposed algorithm is more generalized in untrained conditions.展开更多
Generative adversarial networks(GANs)are paid more attention to dealing with the end-to-end speech enhancement in recent years.Various GANbased enhancement methods are presented to improve the quality of reconstructed...Generative adversarial networks(GANs)are paid more attention to dealing with the end-to-end speech enhancement in recent years.Various GANbased enhancement methods are presented to improve the quality of reconstructed speech.However,the performance of these GAN-based methods is worse than those of masking-based methods.To tackle this problem,we propose speech enhancement method with a residual dense generative adversarial network(RDGAN)contributing to map the log-power spectrum(LPS)of degraded speech to the clean one.In detail,a residual dense block(RDB)architecture is designed to better estimate the LPS of clean speech,which can extract rich local features of LPS through densely connected convolution layers.Meanwhile,sequential RDB connections are incorporated on various scales of LPS.It significantly increases the feature learning flexibility and robustness in the time-frequency domain.Simulations show that the proposed method achieves attractive speech enhancement performance in various acoustic environments.Specifically,in the untrained acoustic test with limited priors,e.g.,unmatched signal-to-noise ratio(SNR)and unmatched noise category,RDGAN can still outperform the existing GAN-based methods and masking-based method in the measures of PESQ and other evaluation indexes.It indicates that our method is more generalized in untrained conditions.展开更多
Recent applications of convolutional neural networks(CNNs)in single image super-resolution(SISR)have achieved unprecedented performance.However,existing CNN-based SISR network structure design consider mostly only cha...Recent applications of convolutional neural networks(CNNs)in single image super-resolution(SISR)have achieved unprecedented performance.However,existing CNN-based SISR network structure design consider mostly only channel or spatial information,and cannot make full use of both channel and spatial information to improve SISR performance further.The present work addresses this problem by proposing a mixed attention densely residual network architecture that can make full and simultaneous use of both channel and spatial information.Specifically,we propose a residual in dense network structure composed of dense connections between multiple dense residual groups to form a very deep network.This structure allows each dense residual group to apply a local residual skip connection and enables the cascading of multiple residual blocks to reuse previous features.A mixed attention module is inserted into each dense residual group,to enable the algorithm to fuse channel attention with laplacian spatial attention effectively,and thereby more adaptively focus on valuable feature learning.The qualitative and quantitative results of extensive experiments have demonstrate that the proposed method has a comparable performance with other stateof-the-art methods.展开更多
In this study,an underwater image enhancement method based on multi-scale adversarial network was proposed to solve the problem of detail blur and color distortion in underwater images.Firstly,the local features of ea...In this study,an underwater image enhancement method based on multi-scale adversarial network was proposed to solve the problem of detail blur and color distortion in underwater images.Firstly,the local features of each layer were enhanced into the global features by the proposed residual dense block,which ensured that the generated images retain more details.Secondly,a multi-scale structure was adopted to extract multi-scale semantic features of the original images.Finally,the features obtained from the dual channels were fused by an adaptive fusion module to further optimize the features.The discriminant network adopted the structure of the Markov discriminator.In addition,by constructing mean square error,structural similarity,and perceived color loss function,the generated image is consistent with the reference image in structure,color,and content.The experimental results showed that the enhanced underwater image deblurring effect of the proposed algorithm was good and the problem of underwater image color bias was effectively improved.In both subjective and objective evaluation indexes,the experimental results of the proposed algorithm are better than those of the comparison algorithm.展开更多
In the realm of low-level vision tasks,such as image deraining and dehazing,restoring images distorted by adverse weather conditions remains a significant challenge.The emergence of abundant computational resources ha...In the realm of low-level vision tasks,such as image deraining and dehazing,restoring images distorted by adverse weather conditions remains a significant challenge.The emergence of abundant computational resources has driven the dominance of deep Convolutional Neural Networks(CNNs),supplanting traditional methods reliant on prior knowledge.However,the evolution of CNN architectures has tended towards increasing complexity,utilizing intricate structures to enhance performance,often at the expense of computational efficiency.In response,we propose the Selective Kernel Dense Residual M-shaped Network(SKDRMNet),a flexible solution adept at balancing computational efficiency with network accuracy.A key innovation is the incorporation of an M-shaped hierarchical structure,derived from the U-Net framework as M-Network(M-Net),within which the Selective Kernel Dense Residual Module(SDRM)is introduced to reinforce multi-scale semantic feature maps.Our methodology employs two sampling techniques-bilinear and pixel unshuffled and utilizes a multi-scale feature fusion approach to distil more robust spatial feature map information.During the reconstruction phase,feature maps of varying resolutions are seamlessly integrated,and the extracted features are effectively merged using the Selective Kernel Fusion Module(SKFM).Empirical results demonstrate the comprehensive superiority of SKDRMNet across both synthetic and real rain and haze datasets.展开更多
Based on the orthotropic elastic theory of rock masses, the X-ray method was used to measure the distribution of macro-residual strain energy density along a depth profile,using core samples taken from 47 large-apertu...Based on the orthotropic elastic theory of rock masses, the X-ray method was used to measure the distribution of macro-residual strain energy density along a depth profile,using core samples taken from 47 large-aperture deep boreholes in four regions of Southwest China: the Longmenshan, Anninghe, Honghe, and Xianshuihe fault zones.Then, the vertical gradients of the macro-residual strain energy density and the macroresidual strain energy contained in high-energy cuboid block segments along each fault zone were determined. The results demonstrate that the macro-residual strain energy stored at shallow levels in the rock mass in these fault zones may be partly responsible for generating many large earthquakes and may explain why the large earthquakes in this region are typically shallow.展开更多
Many networks are designed to stack a large number of residual blocks,deepen the network and improve network performance through short residual connec-tion,long residual connection,and dense connection.However,without...Many networks are designed to stack a large number of residual blocks,deepen the network and improve network performance through short residual connec-tion,long residual connection,and dense connection.However,without consider-ing different contributions of different depth features to the network,these de-signs have the problem of evaluating the importance of different depth features.To solve this problem,this paper proposes an adaptive densely residual net-work(ADRNet)for the single image super resolution.ADRN realizes the evalua-tion of distributions of different depth features and learns more representative features.An adaptive densely residual block(ADRB)was designed,combining 3 residual blocks(RB)and dense connection was added.It learned the attention score of each dense connection through adaptive dense connections,and the at-tention score reflected the importance of the features of each RB.To further en-hance the performance of ADRB,a multi-direction attention block(MDAB)was introduced to obtain multidirectional context information.Through comparative experiments,it is proved that theproposed ADRNet is superior to the existing methods.Through ablation experiments,it is proved that evaluating features of different depths helps to improve network performance.展开更多
This work focuses on a novel lightweight machine learning approach to the task of plant disease classification,posing as a core component of a larger grow-light smart monitoring system.To the extent of our knowledge,t...This work focuses on a novel lightweight machine learning approach to the task of plant disease classification,posing as a core component of a larger grow-light smart monitoring system.To the extent of our knowledge,this work is the first to implement lightweight convolutional neural network architectures leveraging down-scaled versions of inception blocks,residual connections,and dense residual connections applied without pre-training to the PlantVillage dataset.The novel contributions of this work include the proposal of a smart monitor-ing framework outline;responsible for detection and classification of ailments via the devised lightweight net-works as well as interfacing with LED grow-light fixtures to optimize environmental parameters and lighting control for the growth of plants in a greenhouse system.Lightweight adaptation of dense residual connections achieved the best balance of minimizing model parameters and maximizing performance metrics with accuracy,precision,recall,and F1-scores of 96.75%,97.62%,97.59%,and 97.58%respectively,while consisting of only 228,479 model parameters.These results are further compared against various full-scale state-of-the-art model architectures trained on the PlantVillage dataset,of which the proposed down-scaled lightweight models were capable of performing equally to,if not better than many large-scale counterparts with drastically less com-putational requirements.展开更多
To address the problems of lack of high-frequency information and texture details and unstable training in superresolution generative adversarial net-works,this paper optimizes the generator and discriminator based on...To address the problems of lack of high-frequency information and texture details and unstable training in superresolution generative adversarial net-works,this paper optimizes the generator and discriminator based on the SRGAN model.First,the residual dense block is used as the basic structural unit of the gen-erator to improve the network’s feature extraction capability.Second,enhanced lightweight coordinate attention is incorporated to help the network more precisely concentrate on high-frequency location information,thereby allowing the gener-ator to produce more realistic image reconstruction results.Then,we propose a symmetric and efficient pyramidal segmentation attention discriminator network in which the attention mechanism is capable of derivingfiner-grained multiscale spatial information and creating long-term dependencies between multiscale chan-nel attentions,thus enhancing the discriminative ability of the network.Finally,a Charbonnier loss function and a gradient variance loss function with improved robustness are used to better realize the image’s texture structure and enhance the model’s stability.Thefindings from the experiments reveal that the reconstructed image quality enhances the average peak signal-to-noise ratio(PSNR)by 1.59 dB and the structural similarity index(SSIM)by 0.045 when compared to SRGAN on the three test sets.Compared with the state-of-the-art methods,the reconstructed images have a clearer texture structure,richer high-frequency details,and better visual effects.展开更多
Complex nature of underwater environment poses biggest challenge towards image acquisition and transmission of underwater images.This paper proposes an integrated approach which consists of a non-learning enhancement ...Complex nature of underwater environment poses biggest challenge towards image acquisition and transmission of underwater images.This paper proposes an integrated approach which consists of a non-learning enhancement method with deep Convolutional Neural Networks(CNN)for compression and reconstruction of the image.The proposed method does color and contrast correction for image enhancement.The enhanced images are down-sampled using 9-layer CNN followed by Discrete Wavelet Transform(DWT).The decompression is done by using Inverse DWT.Further,the sub-pixel up-sampled image is de-blurred using a three-layer CNN.Residual Dense CNN(RD-CNN)is used to improve the quality of the reconstructed image after deblurring.The quality of the reconstructed images is measured using Peak Signal to Noise Ratio(PSNR)and Structural Similarity Index Metric(SSIM).The proposed model provides better image enhancement,compression,and reconstruction quality than the existing state-of-the-art methods and Super Resolution CNN(SRCNN)respectively.展开更多
The superresolution(SR)method based on generative adversarial networks(GANs)cannot adequately capture enough diversity from training data,resulting in misalignment between input low resolution(LR)images and output hig...The superresolution(SR)method based on generative adversarial networks(GANs)cannot adequately capture enough diversity from training data,resulting in misalignment between input low resolution(LR)images and output high resolution(HR)images.GAN training has difficulty converging.Based on this,an advanced GAN-based image SR reconstructionmethod is presented.First,the dense connection residual block and attention mechanism are integrated into the GAN generator to improve high-frequency feature extraction.Meanwhile,an added discriminator is added into the GAN discriminant network,which forms a dual discriminator to ensure that the process of training is stable.Second,the more robust Charbonnier loss is used instead of the mean square error(MSE)loss to compare similarities between the obtained image and actual image,and the total variation(TV)loss is employed to smooth the training results.Finally,the experimental results indicate that global structures can be better reconstructed using the method of this paper and texture details of images compared with other SOTA methods.The peak signal-to-noise ratio(PSNR)values by the method of this paper are improved by an average of 2.24 dB,and the structural similarity index measure(SSIM)values are improved by an average of 0.07.展开更多
基金the National Natural Science Foundation of China under Grant 62172059 and 62072055Hunan Provincial Natural Science Foundations of China under Grant 2022JJ50318 and 2022JJ30621Scientific Research Fund of Hunan Provincial Education Department of China under Grant 22A0200 and 20K098。
文摘Electrocardiogram(ECG)signal is one of the noninvasive physiological measurement techniques commonly usedin cardiac diagnosis.However,in real scenarios,the ECGsignal is susceptible to various noise erosion,which affectsthe subsequent pathological analysis.Therefore,the effective removal of the noise from ECG signals has becomea top priority in cardiac diagnostic research.Aiming at the problem of incomplete signal shape retention andlow signal-to-noise ratio(SNR)after denoising,a novel ECG denoising network,named attention-based residualdense shrinkage network(ARDSN),is proposed in this paper.Firstly,the shallow ECG characteristics are extractedby a shallow feature extraction network(SFEN).Then,the residual dense shrinkage attention block(RDSAB)isused for adaptive noise suppression.Finally,feature fusion representation(FFR)is performed on the hierarchicalfeatures extracted by a series of RDSABs to reconstruct the de-noised ECG signal.Experiments on the MIT-BIHarrhythmia database and MIT-BIH noise stress test database indicate that the proposed scheme can effectively resistthe interference of different sources of noise on the ECG signal.
基金supported by the National Key Research and Development Program of China under Grant 2020YFC2004003 and Grant 2020YFC2004002the National Nature Science Foundation of China(NSFC)under Grant No.61571106.
文摘Masking-based and spectrum mapping-based methods are the two main algorithms of speech enhancement with deep neural network(DNN).But the mapping-based methods only utilizes the phase of noisy speech,which limits the upper bound of speech enhancement performance.Maskingbased methods need to accurately estimate the masking which is still the key problem.Combining the advantages of above two types of methods,this paper proposes the speech enhancement algorithm MM-RDN(maskingmapping residual dense network)based on masking-mapping(MM)and residual dense network(RDN).Using the logarithmic power spectrogram(LPS)of consecutive frames,MM estimates the ideal ratio masking(IRM)matrix of consecutive frames.RDN can make full use of feature maps of all layers.Meanwhile,using the global residual learning to combine the shallow features and deep features,RDN obtains the global dense features from the LPS,thereby improves estimated accuracy of the IRM matrix.Simulations show that the proposed method achieves attractive speech enhancement performance in various acoustic environments.Specifically,in the untrained acoustic test with limited priors,e.g.,unmatched signal-to-noise ratio(SNR)and unmatched noise category,MM-RDN can still outperform the existing convolutional recurrent network(CRN)method in themeasures of perceptual evaluation of speech quality(PESQ)and other evaluation indexes.It indicates that the proposed algorithm is more generalized in untrained conditions.
基金This work is supported by the National Key Research and Development Program of China under Grant 2020YFC2004003 and Grant 2020YFC2004002the National Nature Science Foundation of China(NSFC)under Grant No.61571106。
文摘Generative adversarial networks(GANs)are paid more attention to dealing with the end-to-end speech enhancement in recent years.Various GANbased enhancement methods are presented to improve the quality of reconstructed speech.However,the performance of these GAN-based methods is worse than those of masking-based methods.To tackle this problem,we propose speech enhancement method with a residual dense generative adversarial network(RDGAN)contributing to map the log-power spectrum(LPS)of degraded speech to the clean one.In detail,a residual dense block(RDB)architecture is designed to better estimate the LPS of clean speech,which can extract rich local features of LPS through densely connected convolution layers.Meanwhile,sequential RDB connections are incorporated on various scales of LPS.It significantly increases the feature learning flexibility and robustness in the time-frequency domain.Simulations show that the proposed method achieves attractive speech enhancement performance in various acoustic environments.Specifically,in the untrained acoustic test with limited priors,e.g.,unmatched signal-to-noise ratio(SNR)and unmatched noise category,RDGAN can still outperform the existing GAN-based methods and masking-based method in the measures of PESQ and other evaluation indexes.It indicates that our method is more generalized in untrained conditions.
基金This work was supported in part by the Natural Science Foundation of China under Grant 62063004 and 61762033in part by the Hainan Provincial Natural Science Foundation of China under Grant 2019RC018 and 619QN246by the Postdoctoral Science Foundation under Grant 2020TQ0293.
文摘Recent applications of convolutional neural networks(CNNs)in single image super-resolution(SISR)have achieved unprecedented performance.However,existing CNN-based SISR network structure design consider mostly only channel or spatial information,and cannot make full use of both channel and spatial information to improve SISR performance further.The present work addresses this problem by proposing a mixed attention densely residual network architecture that can make full and simultaneous use of both channel and spatial information.Specifically,we propose a residual in dense network structure composed of dense connections between multiple dense residual groups to form a very deep network.This structure allows each dense residual group to apply a local residual skip connection and enables the cascading of multiple residual blocks to reuse previous features.A mixed attention module is inserted into each dense residual group,to enable the algorithm to fuse channel attention with laplacian spatial attention effectively,and thereby more adaptively focus on valuable feature learning.The qualitative and quantitative results of extensive experiments have demonstrate that the proposed method has a comparable performance with other stateof-the-art methods.
文摘In this study,an underwater image enhancement method based on multi-scale adversarial network was proposed to solve the problem of detail blur and color distortion in underwater images.Firstly,the local features of each layer were enhanced into the global features by the proposed residual dense block,which ensured that the generated images retain more details.Secondly,a multi-scale structure was adopted to extract multi-scale semantic features of the original images.Finally,the features obtained from the dual channels were fused by an adaptive fusion module to further optimize the features.The discriminant network adopted the structure of the Markov discriminator.In addition,by constructing mean square error,structural similarity,and perceived color loss function,the generated image is consistent with the reference image in structure,color,and content.The experimental results showed that the enhanced underwater image deblurring effect of the proposed algorithm was good and the problem of underwater image color bias was effectively improved.In both subjective and objective evaluation indexes,the experimental results of the proposed algorithm are better than those of the comparison algorithm.
文摘In the realm of low-level vision tasks,such as image deraining and dehazing,restoring images distorted by adverse weather conditions remains a significant challenge.The emergence of abundant computational resources has driven the dominance of deep Convolutional Neural Networks(CNNs),supplanting traditional methods reliant on prior knowledge.However,the evolution of CNN architectures has tended towards increasing complexity,utilizing intricate structures to enhance performance,often at the expense of computational efficiency.In response,we propose the Selective Kernel Dense Residual M-shaped Network(SKDRMNet),a flexible solution adept at balancing computational efficiency with network accuracy.A key innovation is the incorporation of an M-shaped hierarchical structure,derived from the U-Net framework as M-Network(M-Net),within which the Selective Kernel Dense Residual Module(SDRM)is introduced to reinforce multi-scale semantic feature maps.Our methodology employs two sampling techniques-bilinear and pixel unshuffled and utilizes a multi-scale feature fusion approach to distil more robust spatial feature map information.During the reconstruction phase,feature maps of varying resolutions are seamlessly integrated,and the extracted features are effectively merged using the Selective Kernel Fusion Module(SKFM).Empirical results demonstrate the comprehensive superiority of SKDRMNet across both synthetic and real rain and haze datasets.
基金supported by the Joint Seismology Science Foundation(85012,850708,863017,88138,91046)Old Professor Science Foundation(201041)
文摘Based on the orthotropic elastic theory of rock masses, the X-ray method was used to measure the distribution of macro-residual strain energy density along a depth profile,using core samples taken from 47 large-aperture deep boreholes in four regions of Southwest China: the Longmenshan, Anninghe, Honghe, and Xianshuihe fault zones.Then, the vertical gradients of the macro-residual strain energy density and the macroresidual strain energy contained in high-energy cuboid block segments along each fault zone were determined. The results demonstrate that the macro-residual strain energy stored at shallow levels in the rock mass in these fault zones may be partly responsible for generating many large earthquakes and may explain why the large earthquakes in this region are typically shallow.
文摘Many networks are designed to stack a large number of residual blocks,deepen the network and improve network performance through short residual connec-tion,long residual connection,and dense connection.However,without consider-ing different contributions of different depth features to the network,these de-signs have the problem of evaluating the importance of different depth features.To solve this problem,this paper proposes an adaptive densely residual net-work(ADRNet)for the single image super resolution.ADRN realizes the evalua-tion of distributions of different depth features and learns more representative features.An adaptive densely residual block(ADRB)was designed,combining 3 residual blocks(RB)and dense connection was added.It learned the attention score of each dense connection through adaptive dense connections,and the at-tention score reflected the importance of the features of each RB.To further en-hance the performance of ADRB,a multi-direction attention block(MDAB)was introduced to obtain multidirectional context information.Through comparative experiments,it is proved that theproposed ADRNet is superior to the existing methods.Through ablation experiments,it is proved that evaluating features of different depths helps to improve network performance.
文摘This work focuses on a novel lightweight machine learning approach to the task of plant disease classification,posing as a core component of a larger grow-light smart monitoring system.To the extent of our knowledge,this work is the first to implement lightweight convolutional neural network architectures leveraging down-scaled versions of inception blocks,residual connections,and dense residual connections applied without pre-training to the PlantVillage dataset.The novel contributions of this work include the proposal of a smart monitor-ing framework outline;responsible for detection and classification of ailments via the devised lightweight net-works as well as interfacing with LED grow-light fixtures to optimize environmental parameters and lighting control for the growth of plants in a greenhouse system.Lightweight adaptation of dense residual connections achieved the best balance of minimizing model parameters and maximizing performance metrics with accuracy,precision,recall,and F1-scores of 96.75%,97.62%,97.59%,and 97.58%respectively,while consisting of only 228,479 model parameters.These results are further compared against various full-scale state-of-the-art model architectures trained on the PlantVillage dataset,of which the proposed down-scaled lightweight models were capable of performing equally to,if not better than many large-scale counterparts with drastically less com-putational requirements.
基金This work was supported in part by the Basic Scientific Research Project of Liaoning Provincial Department of Education under Grant Nos.LJKQZ2021152 and LJ2020JCL007in part by the National Science Foundation of China(NSFC)under Grant No.61602226in part by the PhD Startup Foundation of Liaoning Technical University of China under Grant Nos.18-1021.
文摘To address the problems of lack of high-frequency information and texture details and unstable training in superresolution generative adversarial net-works,this paper optimizes the generator and discriminator based on the SRGAN model.First,the residual dense block is used as the basic structural unit of the gen-erator to improve the network’s feature extraction capability.Second,enhanced lightweight coordinate attention is incorporated to help the network more precisely concentrate on high-frequency location information,thereby allowing the gener-ator to produce more realistic image reconstruction results.Then,we propose a symmetric and efficient pyramidal segmentation attention discriminator network in which the attention mechanism is capable of derivingfiner-grained multiscale spatial information and creating long-term dependencies between multiscale chan-nel attentions,thus enhancing the discriminative ability of the network.Finally,a Charbonnier loss function and a gradient variance loss function with improved robustness are used to better realize the image’s texture structure and enhance the model’s stability.Thefindings from the experiments reveal that the reconstructed image quality enhances the average peak signal-to-noise ratio(PSNR)by 1.59 dB and the structural similarity index(SSIM)by 0.045 when compared to SRGAN on the three test sets.Compared with the state-of-the-art methods,the reconstructed images have a clearer texture structure,richer high-frequency details,and better visual effects.
文摘Complex nature of underwater environment poses biggest challenge towards image acquisition and transmission of underwater images.This paper proposes an integrated approach which consists of a non-learning enhancement method with deep Convolutional Neural Networks(CNN)for compression and reconstruction of the image.The proposed method does color and contrast correction for image enhancement.The enhanced images are down-sampled using 9-layer CNN followed by Discrete Wavelet Transform(DWT).The decompression is done by using Inverse DWT.Further,the sub-pixel up-sampled image is de-blurred using a three-layer CNN.Residual Dense CNN(RD-CNN)is used to improve the quality of the reconstructed image after deblurring.The quality of the reconstructed images is measured using Peak Signal to Noise Ratio(PSNR)and Structural Similarity Index Metric(SSIM).The proposed model provides better image enhancement,compression,and reconstruction quality than the existing state-of-the-art methods and Super Resolution CNN(SRCNN)respectively.
基金supported in part by the Basic Scientific Research Project of Liaoning Provincial Department of Education under Grant No.LJKQZ2021152in part by the National Science Foundation of China (NSFC)under Grant No.61602226in part by the PhD Startup Foundation of Liaoning Technical University of China under Grant No.18-1021.
文摘The superresolution(SR)method based on generative adversarial networks(GANs)cannot adequately capture enough diversity from training data,resulting in misalignment between input low resolution(LR)images and output high resolution(HR)images.GAN training has difficulty converging.Based on this,an advanced GAN-based image SR reconstructionmethod is presented.First,the dense connection residual block and attention mechanism are integrated into the GAN generator to improve high-frequency feature extraction.Meanwhile,an added discriminator is added into the GAN discriminant network,which forms a dual discriminator to ensure that the process of training is stable.Second,the more robust Charbonnier loss is used instead of the mean square error(MSE)loss to compare similarities between the obtained image and actual image,and the total variation(TV)loss is employed to smooth the training results.Finally,the experimental results indicate that global structures can be better reconstructed using the method of this paper and texture details of images compared with other SOTA methods.The peak signal-to-noise ratio(PSNR)values by the method of this paper are improved by an average of 2.24 dB,and the structural similarity index measure(SSIM)values are improved by an average of 0.07.