Recently, semantic segmentation has been widely applied toimage processing, scene understanding, and many others. Especially, indeep learning-based semantic segmentation, the U-Net with convolutionalencoder-decoder ar...Recently, semantic segmentation has been widely applied toimage processing, scene understanding, and many others. Especially, indeep learning-based semantic segmentation, the U-Net with convolutionalencoder-decoder architecture is a representative model which is proposed forimage segmentation in the biomedical field. It used max pooling operationfor reducing the size of image and making noise robust. However, instead ofreducing the complexity of the model, max pooling has the disadvantageof omitting some information about the image in reducing it. So, thispaper used two diagonal elements of down-sampling operation instead ofit. We think that the down-sampling feature maps have more informationintrinsically than max pooling feature maps because of keeping the Nyquisttheorem and extracting the latent information from them. In addition,this paper used two other diagonal elements for the skip connection. Indecoding, we used Subpixel Convolution rather than transposed convolutionto efficiently decode the encoded feature maps. Including all the ideas, thispaper proposed the new encoder-decoder model called Down-Sampling andSubpixel Convolution U-Net (DSSC-UNet). To prove the better performanceof the proposed model, this paper measured the performance of the UNetand DSSC-UNet on the Cityscapes. As a result, DSSC-UNet achieved89.6% Mean Intersection OverUnion (Mean-IoU) andU-Net achieved 85.6%Mean-IoU, confirming that DSSC-UNet achieved better performance.展开更多
A new scheme combining a scalable transcoder with space time block codes (STBC) for an orthogonal frequency division multiplexing (OFDM) system is proposed for robust video transmission in dispersive fading channe...A new scheme combining a scalable transcoder with space time block codes (STBC) for an orthogonal frequency division multiplexing (OFDM) system is proposed for robust video transmission in dispersive fading channels. The target application for such a scalable transcoder is to provide successful access to the pre-encoded high quality video MPEG-2 from mobile wireless terminals. In the scalable transcoder, besides outputting the MPEG-4 fine granular scalability (FGS) bitstream, both the size of video frames and the bit rate are reduced. And an array processing algorithm of layer interference suppression is used at the receiver which makes the system structure provide different levels of protection to different layers. Furthermore, by considering the important level of scalable bitstream, the different bitstreams can be given different level protection by the system structure and channel coding. With the proposed system, the concurrent large diversity gain characteristic of STBC and alleviation of the frequency-selective fading effect of OFDM can be achieved. The simulation results show that the proposed schemes integrating scalable transcoding can provide a basic quality of video transmission and outperform the conventional single layer transcoding transmitted under the random and bursty error channel conditions.展开更多
A number of conventional interpolation techniques have been proposed. However, it seems that there do not exist good criteria for the design of optimal linear interpolators. Also, such an interpolator can hardly provi...A number of conventional interpolation techniques have been proposed. However, it seems that there do not exist good criteria for the design of optimal linear interpolators. Also, such an interpolator can hardly provide a satisfactory solution for interpolating noisy images. In this paper, the novelty of this research is that a universal approach is proposed to design an image interpolator with any one image smoothing filter, thereby not only interpolating a down-sampled image but also preserving the characteristics of the performing filtering.展开更多
In view of the limited bandwidth of underwater video image transmission,a low bit rate underwater video compression coding method is proposed.Based on the preprocessing process of wavelet transform and coefficient dow...In view of the limited bandwidth of underwater video image transmission,a low bit rate underwater video compression coding method is proposed.Based on the preprocessing process of wavelet transform and coefficient down-sampling,the visual redundancy of underwater image is removed and the computational coefficients and coding bits are reduced.At the same time,combined with multi-level wavelet decomposition,inter frame motion compensation,entropy coding and other methods,according to the characteristics of different types of frame image data,reduce the number of calculations and improve the coding efficiency.The experimental results show that the reconstructed image quality can meet the visual requirements,and the average compression ratio of underwater video can meet the requirements of underwater acoustic channel transmission rate.展开更多
Poor visibility in bad weather, such as haze and fog, is a major problem for many applications of computer vision. Thus, haze removal is highly required for receiving high performance of the vision algorithm. In this ...Poor visibility in bad weather, such as haze and fog, is a major problem for many applications of computer vision. Thus, haze removal is highly required for receiving high performance of the vision algorithm. In this paper, we propose a new fast dehazing method for real-time image and video processing. The transmission map estimated by an improved guided filtering scheme is smooth and respect with depth information of the underlying image. Results demonstrate that the proposed method achieves good dehazeing effect as well as real-time performance. The proposed algorithm, due to its speed and ability to improve visibility, may be used with advantages as pre-processing in many systems ranging from surveillance, intelligent vehicles, to remote sensing.展开更多
As a huge number of satellites revolve around the earth,a great probability exists to observe and determine the change phenomena on the earth through the analysis of satellite images on a real-time basis.Therefore,cla...As a huge number of satellites revolve around the earth,a great probability exists to observe and determine the change phenomena on the earth through the analysis of satellite images on a real-time basis.Therefore,classifying satellite images plays strong assistance in remote sensing communities for predicting tropical cyclones.In this article,a classification approach is proposed using Deep Convolutional Neural Network(DCNN),comprising numerous layers,which extract the features through a downsampling process for classifying satellite cloud images.DCNN is trained marvelously on cloud images with an impressive amount of prediction accuracy.Delivery time decreases for testing images,whereas prediction accuracy increases using an appropriate deep convolutional network with a huge number of training dataset instances.The satellite images are taken from the Meteorological&Oceanographic Satellite Data Archival Centre,the organization is responsible for availing satellite cloud images of India and its subcontinent.The proposed cloud image classification shows 94% prediction accuracy with the DCNN framework.展开更多
文摘Recently, semantic segmentation has been widely applied toimage processing, scene understanding, and many others. Especially, indeep learning-based semantic segmentation, the U-Net with convolutionalencoder-decoder architecture is a representative model which is proposed forimage segmentation in the biomedical field. It used max pooling operationfor reducing the size of image and making noise robust. However, instead ofreducing the complexity of the model, max pooling has the disadvantageof omitting some information about the image in reducing it. So, thispaper used two diagonal elements of down-sampling operation instead ofit. We think that the down-sampling feature maps have more informationintrinsically than max pooling feature maps because of keeping the Nyquisttheorem and extracting the latent information from them. In addition,this paper used two other diagonal elements for the skip connection. Indecoding, we used Subpixel Convolution rather than transposed convolutionto efficiently decode the encoded feature maps. Including all the ideas, thispaper proposed the new encoder-decoder model called Down-Sampling andSubpixel Convolution U-Net (DSSC-UNet). To prove the better performanceof the proposed model, this paper measured the performance of the UNetand DSSC-UNet on the Cityscapes. As a result, DSSC-UNet achieved89.6% Mean Intersection OverUnion (Mean-IoU) andU-Net achieved 85.6%Mean-IoU, confirming that DSSC-UNet achieved better performance.
文摘A new scheme combining a scalable transcoder with space time block codes (STBC) for an orthogonal frequency division multiplexing (OFDM) system is proposed for robust video transmission in dispersive fading channels. The target application for such a scalable transcoder is to provide successful access to the pre-encoded high quality video MPEG-2 from mobile wireless terminals. In the scalable transcoder, besides outputting the MPEG-4 fine granular scalability (FGS) bitstream, both the size of video frames and the bit rate are reduced. And an array processing algorithm of layer interference suppression is used at the receiver which makes the system structure provide different levels of protection to different layers. Furthermore, by considering the important level of scalable bitstream, the different bitstreams can be given different level protection by the system structure and channel coding. With the proposed system, the concurrent large diversity gain characteristic of STBC and alleviation of the frequency-selective fading effect of OFDM can be achieved. The simulation results show that the proposed schemes integrating scalable transcoding can provide a basic quality of video transmission and outperform the conventional single layer transcoding transmitted under the random and bursty error channel conditions.
文摘A number of conventional interpolation techniques have been proposed. However, it seems that there do not exist good criteria for the design of optimal linear interpolators. Also, such an interpolator can hardly provide a satisfactory solution for interpolating noisy images. In this paper, the novelty of this research is that a universal approach is proposed to design an image interpolator with any one image smoothing filter, thereby not only interpolating a down-sampled image but also preserving the characteristics of the performing filtering.
文摘In view of the limited bandwidth of underwater video image transmission,a low bit rate underwater video compression coding method is proposed.Based on the preprocessing process of wavelet transform and coefficient down-sampling,the visual redundancy of underwater image is removed and the computational coefficients and coding bits are reduced.At the same time,combined with multi-level wavelet decomposition,inter frame motion compensation,entropy coding and other methods,according to the characteristics of different types of frame image data,reduce the number of calculations and improve the coding efficiency.The experimental results show that the reconstructed image quality can meet the visual requirements,and the average compression ratio of underwater video can meet the requirements of underwater acoustic channel transmission rate.
文摘Poor visibility in bad weather, such as haze and fog, is a major problem for many applications of computer vision. Thus, haze removal is highly required for receiving high performance of the vision algorithm. In this paper, we propose a new fast dehazing method for real-time image and video processing. The transmission map estimated by an improved guided filtering scheme is smooth and respect with depth information of the underlying image. Results demonstrate that the proposed method achieves good dehazeing effect as well as real-time performance. The proposed algorithm, due to its speed and ability to improve visibility, may be used with advantages as pre-processing in many systems ranging from surveillance, intelligent vehicles, to remote sensing.
文摘As a huge number of satellites revolve around the earth,a great probability exists to observe and determine the change phenomena on the earth through the analysis of satellite images on a real-time basis.Therefore,classifying satellite images plays strong assistance in remote sensing communities for predicting tropical cyclones.In this article,a classification approach is proposed using Deep Convolutional Neural Network(DCNN),comprising numerous layers,which extract the features through a downsampling process for classifying satellite cloud images.DCNN is trained marvelously on cloud images with an impressive amount of prediction accuracy.Delivery time decreases for testing images,whereas prediction accuracy increases using an appropriate deep convolutional network with a huge number of training dataset instances.The satellite images are taken from the Meteorological&Oceanographic Satellite Data Archival Centre,the organization is responsible for availing satellite cloud images of India and its subcontinent.The proposed cloud image classification shows 94% prediction accuracy with the DCNN framework.