The numerous photos captured by low-price Internet of Things(IoT)sensors are frequently affected by meteorological factors,especially rainfall.It causes varying sizes of white streaks on the image,destroying the image...The numerous photos captured by low-price Internet of Things(IoT)sensors are frequently affected by meteorological factors,especially rainfall.It causes varying sizes of white streaks on the image,destroying the image texture and ruining the performance of the outdoor computer vision system.Existing methods utilise training with pairs of images,which is difficult to cover all scenes and leads to domain gaps.In addition,the network structures adopt deep learning to map rain images to rain-free images,failing to use prior knowledge effectively.To solve these problems,we introduce a single image derain model in edge computing that combines prior knowledge of rain patterns with the learning capability of the neural network.Specifically,the algorithm first uses Residue Channel Prior to filter out the rainfall textural features then it uses the Feature Fusion Module to fuse the original image with the background feature information.This results in a pre-processed image which is fed into Half Instance Net(HINet)to recover a high-quality rain-free image with a clear and accurate structure,and the model does not rely on any rainfall assumptions.Experimental results on synthetic and real-world datasets show that the average peak signal-to-noise ratio of the model decreases by 0.37 dB on the synthetic dataset and increases by 0.43 dB on the real-world dataset,demonstrating that a combined model reduces the gap between synthetic data and natural rain scenes,improves the generalization ability of the derain network,and alleviates the overfitting problem.展开更多
Visual degradation of captured images caused by rainy streaks under rainy weather can adversely affect the performance of many open-air vision systems.Hence,it is necessary to address the problem of eliminating rain s...Visual degradation of captured images caused by rainy streaks under rainy weather can adversely affect the performance of many open-air vision systems.Hence,it is necessary to address the problem of eliminating rain streaks from the individual rainy image.In this work,a deep convolution neural network(CNN)based method is introduced,called Rain-Removal Net(R2N),to solve the single image de-raining issue.Firstly,we decomposed the rainy image into its high-frequency detail layer and lowfrequency base layer.Then,we used the high-frequency detail layer to input the carefully designed CNN architecture to learn the mapping between it and its corresponding derained high-frequency detail layer.The CNN architecture consists of four convolution layers and four deconvolution layers,as well as three skip connections.The experiments on synthetic and real-world rainy images show that the performance of our architecture outperforms the compared state-of-the-art de-raining models with respects to the quality of de-rained images and computing efficiency.展开更多
Outdoor haze has adverse impact on outdoor image quality,including contrast loss and poor visibility.In this paper,a novel dehazing algorithm based on the decomposition strategy is proposed.It combines the advantages ...Outdoor haze has adverse impact on outdoor image quality,including contrast loss and poor visibility.In this paper,a novel dehazing algorithm based on the decomposition strategy is proposed.It combines the advantages of the two-dimensional variational mode decomposition(2DVMD)algorithm and dark channel prior.The original hazy image is adaptively decom-posed into low-frequency and high-frequency images according to the image frequency band by using the 2DVMD algorithm.The low-frequency image is dehazed by using the improved dark channel prior,and then fused with the high-frequency image.Furthermore,we optimize the atmospheric light and transmit-tance estimation method to obtain a defogging effect with richer details and stronger contrast.The proposed algorithm is com-pared with the existing advanced algorithms.Experiment results show that the proposed algorithm has better performance in comparison with the state-of-the-art algorithms.展开更多
In this letter, we present a novel integrated feature that incorporates traditional parameters, and adopt a parallel cascading fashion network Haze Net for enhancing image quality. Our unified feature is a complete in...In this letter, we present a novel integrated feature that incorporates traditional parameters, and adopt a parallel cascading fashion network Haze Net for enhancing image quality. Our unified feature is a complete integration, and its role is to directly describe the effects of haze. In Haze Net, we design two separate structures including backbone and auxiliary networks to extract feature map. Backbone network is responsible for extracting high-level feature map, and low-level feature learned by the auxiliary network can be interpreted as fine-grained feature. After cascading two features with different accuracy, final performance can be effectively improved. Extensive experimental results on both synthetic datasets and real-world images prove the superiority of the proposed method, and demonstrate more favorable performance compared with the existing state-of-art methods.展开更多
There is a steadily growing range of applications that can benefit from facial reconstruction techniques,leading to an increasing demand for reconstruction of high-quality 3D face models.While it is an important expre...There is a steadily growing range of applications that can benefit from facial reconstruction techniques,leading to an increasing demand for reconstruction of high-quality 3D face models.While it is an important expressive part of the human face,the nose has received less attention than other expressive regions in the face reconstruction literature.When applying existing reconstruction methods to facial images,the reconstructed nose models are often inconsistent with the desired shape and expression.In this paper,we propose a coarse-to-fine 3D nose reconstruction and correction pipeline to build a nose model from a single image,where 3D and 2D nose curve correspondences are adaptively updated and refined.We first correct the reconstruction result coarsely using constraints of 3D-2D sparse landmark correspondences,and then heuristically update a dense 3D-2D curve correspondence based on the coarsely corrected result.A final refinement step is performed to correct the shape based on the updated 3D-2D dense curve constraints.Experimental results show the advantages of our method for 3D nose reconstruction over existing methods.展开更多
The quality of photos is highly susceptible to severe weather such as heavy rain;it can also degrade the performance of various visual tasks like object detection.Rain removal is a challenging problem because rain str...The quality of photos is highly susceptible to severe weather such as heavy rain;it can also degrade the performance of various visual tasks like object detection.Rain removal is a challenging problem because rain streaks have different appearances even in one image.Regions where rain accumulates appear foggy or misty,while rain streaks can be clearly seen in areas where rain is less heavy.We propose removing various rain effects in pictures using a hybrid multiscale loss guided multiple feature fusion de-raining network(MSGMFFNet).Specially,to deal with rain streaks,our method generates a rain streak attention map,while preprocessing uses gamma correction and contrast enhancement to enhanced images to address the problem of rain accumulation.Using these tools,the model can restore a result with abundant details.Furthermore,a hybrid multiscale loss combining L1 loss and edge loss is used to guide the training process to pay attention to edge and content information.Comprehensive experiments conducted on both synthetic and real-world datasets demonstrate the effectiveness of our method.展开更多
Cities are in constant change and city managers aim to keep an updated digital model of the city for city governance. There are a lot of images uploaded daily on image sharing platforms (as “Flickr”, “Twitter”, et...Cities are in constant change and city managers aim to keep an updated digital model of the city for city governance. There are a lot of images uploaded daily on image sharing platforms (as “Flickr”, “Twitter”, etc.). These images feature a rough localization and no orientation information. Nevertheless, they can help to populate an active collaborative database of street images usable to maintain a city 3D model, but their localization and orientation need to be known. Based on these images, we propose the Data Gathering system for image Pose Estimation (DGPE) that helps to find the pose (position and orientation) of the camera used to shoot them with better accuracy than the sole GPS localization that may be embedded in the image header. DGPE uses both visual and semantic information, existing in a single image processed by a fully automatic chain composed of three main layers: Data retrieval and preprocessing layer, Features extraction layer, Decision Making layer. In this article, we present the whole system details and compare its detection results with a state of the art method. Finally, we show the obtained localization, and often orientation results, combining both semantic and visual information processing on 47 images. Our multilayer system succeeds in 26% of our test cases in finding a better localization and orientation of the original photo. This is achieved by using only the image content and associated metadata. The use of semantic information found on social media such as comments, hash tags, etc. has doubled the success rate to 59%. It has reduced the search area and thus made the visual search more accurate.展开更多
The thermal front in the oceanic system is believed to have a significant effect on biological activity.During an era of climate change,changes in heat regulation between the atmosphere and oceanic interior can alter ...The thermal front in the oceanic system is believed to have a significant effect on biological activity.During an era of climate change,changes in heat regulation between the atmosphere and oceanic interior can alter the characteristics of this important feature.Using the simulation results of the 3D Regional Ocean Modelling System(ROMS),we identified the location of thermal fronts and determined their dynamic variability in the area between the southern Andaman Sea and northern Malacca Strait.The Single Image Edge Detection(SIED)algorithm was used to detect the thermal front from model-derived temperature.Results show that a thermal front occurred every year from 2002 to 2012 with the temperature gradient at the location of the front was 0.3°C/km.Compared to the years affected by El Ni?o and negative Indian Ocean Dipole(IOD),the normal years(e.g.,May 2003)show the presence of the thermal front at every selected depth(10,25,50,and 75 m),whereas El Ni?o and negative IOD during 2010 show the presence of the thermal front only at depth of 75 m due to greater warming,leading to the thermocline deepening and enhanced stratification.During May 2003,the thermal front was separated by cooler SST in the southern Andaman Sea and warmer SST in the northern Malacca Strait.The higher SST in the northern Malacca Strait was believed due to the besieged Malacca Strait,which trapped the heat and make it difficult to release while higher chlorophyll a in Malacca Strait is due to the freshwater conduit from nearby rivers(Klang,Langat,Perak,and Selangor).Furthermore,compared to the southern Andaman Sea,the chlorophyll a in the northern Malacca Strait is easier to reach the surface area due to the shallower thermocline,which allows nutrients in the area to reach the surface faster.展开更多
Single image super resolution(SISR)is an important research content in the field of computer vision and image processing.With the rapid development of deep neural networks,different image super-resolution models have ...Single image super resolution(SISR)is an important research content in the field of computer vision and image processing.With the rapid development of deep neural networks,different image super-resolution models have emerged.Compared to some traditional SISR methods,deep learning-based methods can complete the super-resolution tasks through a single image.In addition,compared with the SISR methods using traditional convolutional neural networks,SISR based on generative adversarial networks(GAN)has achieved the most advanced visual performance.In this review,we first explore the challenges faced by SISR and introduce some common datasets and evaluation metrics.Then,we review the improved network structures and loss functions of GAN-based perceptual SISR.Subsequently,the advantages and disadvantages of different networks are analyzed by multiple comparative experiments.Finally,we summarize the paper and look forward to the future development trends of GAN-based perceptual SISR.展开更多
Although there has been a great breakthrough in the accuracy and speed of super-resolution(SR)reconstruction of a single image by using a convolutional neural network,an important problem remains unresolved:how to res...Although there has been a great breakthrough in the accuracy and speed of super-resolution(SR)reconstruction of a single image by using a convolutional neural network,an important problem remains unresolved:how to restore finer texture details during image super-resolution reconstruction?This paper proposes an Enhanced Laplacian Pyramid Generative Adversarial Network(ELSRGAN),based on the Laplacian pyramid to capture the high-frequency details of the image.By combining Laplacian pyramids and generative adversarial networks,progressive reconstruction of super-resolution images can be made,making model applications more flexible.In order to solve the problem of gradient disappearance,we introduce the Residual-in-Residual Dense Block(RRDB)as the basic network unit.Network capacity benefits more from dense connections,is able to capture more visual features with better reconstruction effects,and removes BN layers to increase calculation speed and reduce calculation complexity.In addition,a loss of content driven by perceived similarity is used instead of content loss driven by spatial similarity,thereby enhancing the visual effect of the super-resolution image,making it more consistent with human visual perception.Extensive qualitative and quantitative evaluation of the baseline datasets shows that the proposed algorithm has higher mean-sort-score(MSS)than any state-of-the-art method and has better visual perception.展开更多
Obtaining clear images of underwater scenes with descriptive details is an arduous task.Conventional imaging techniques fail to provide clear cut features and attributes that ultimately result in object recognition er...Obtaining clear images of underwater scenes with descriptive details is an arduous task.Conventional imaging techniques fail to provide clear cut features and attributes that ultimately result in object recognition errors.Consequently,a need for a system that produces clear images for underwater image study has been necessitated.To overcome problems in resolution and to make better use of the Super-Resolution(SR)method,this paper introduces a novel method that has been derived from the Alpha Generative Adversarial Network(AlphaGAN)model,named Alpha Super Resolution Generative Adversarial Network(AlphaSRGAN).The model put forth in this paper helps in enhancing the quality of underwater imagery and yields images with greater resolution and more concise details.Images undergo pre-processing before they are fed into a generator network that optimizes and reforms the structure of the network while enhancing the stability of the network that acts as the generator.After the images are processed by the generator network,they are passed through an adversarial method for training models.The dataset used in this paper to learn Single Image Super Resolution(SISR)is the USR 248 dataset.Training supervision is performed by an unprejudiced function that simultaneously scrutinizes and improves the image quality.Appraisal of images is done with reference to factors like local style information,global content and color.The dataset USR 248 which has a huge collection of images has been used for the study is composed of three collections of images—high(640×480)and low(80×60,160×120,and 320×240).Paired instances of different sizes—2×,4×and 8×—are also present in the dataset.Parameters like Mean Opinion Score(MOS),Peak Signal-to-Noise Ratio(PSNR),Structural Similarity(SSIM)and Underwater Image Quality Measure(UIQM)scores have been compared to validate the improved efficiency of our model when compared to existing works.展开更多
Background This study presents a neural hand reconstruction method for monocular 3D hand pose and shape estimation.Methods Alternate to directly representing hand with 3D data,a novel UV position map is used to repres...Background This study presents a neural hand reconstruction method for monocular 3D hand pose and shape estimation.Methods Alternate to directly representing hand with 3D data,a novel UV position map is used to represent a hand pose and shape with 2D data that maps 3D hand surface points to 2D image space.Furthermore,an encoder-decoder neural network is proposed to infer such UV position map from a single image.To train this network with inadequate ground truth training pairs,we propose a novel MANOReg module that employs MANO model as a prior shape to constrain high dimensional space of the UV position map.Results The quantitative and qualitative experiments demonstrate the effectiveness of our UV position map representation and MANOReg module.展开更多
Single pixel imaging is a novel imaging technique,and it becomes a focus of research in recent years due to its advantages such as high lateral resolution and high robustness to noise.Imaging speed is one of the criti...Single pixel imaging is a novel imaging technique,and it becomes a focus of research in recent years due to its advantages such as high lateral resolution and high robustness to noise.Imaging speed is one of the critical shortcomings,which limits the further development and applications of this technique.In this paper,we focus on the issues of imaging efficiency of a single pixel imaging system.We propose semi-continuous wavelet transform(SCWT)protocol and introduce the protocol into the single pixel imaging system.The proposed protocol is something between continuous wavelet transform and discrete wavelet transform,which allows the usage of those smooth(usually non-orthogonal,and they have advantages in representing smooth signals compressively,which can improve the imaging speed of single pixel imaging)wavelets and with limited numbers of measurements.The proposed imaging scheme is studied,and verified by simulations and experiments.Furthermore,a comparison between our proposed scheme and existing imaging schemes are given.According to the results,the proposed SCWT scheme is proved to be effective in reconstructing a image compressively.展开更多
The exosome-mediated response can promote or restrain the diseases by regulating the intracellular pathways,making the exosome become an effective marker for diagnosis and therapeutic control at the single-cell level....The exosome-mediated response can promote or restrain the diseases by regulating the intracellular pathways,making the exosome become an effective marker for diagnosis and therapeutic control at the single-cell level.However,real-time analysis is hard to be achieved with traditional approaches because the exosomes usually need to be enriched by ultracentrifugation for a measurable signal-to-noise ratio.Recently developed label-free single-molecule imaging approaches may become an real-time quantitative tool for the analysis of single exosomes and related secretion behaviors of single living cells owing to their extreme sensitivity.展开更多
In modern society,information is becoming increasingly interconnected through networks,and the rapid development of information technology has caused people to pay more attention to the encryption and the protection o...In modern society,information is becoming increasingly interconnected through networks,and the rapid development of information technology has caused people to pay more attention to the encryption and the protection of information.Image encryption technology is a key technology for ensuring the security performance of images.We extracted single channel RGB component images from a color image using MATLAB programs,encrypted and decrypted the color images by randomly disrupting rows,columns and regions of the image.Combined with histograms and the visual judgments of encryption images,it is shown that the information of the original image cannot be obtained from the encryption image easily.The results show that the color-image encryptions with the algorithm we used have good effect and fast operation speed.Thus this algorithm has certain practical value.展开更多
Due to highly underdetermined nature of Single Image Super-Resolution(SISR)problem,deep learning neural networks are required to be more deeper to solve the problem effectively.One of deep neural networks successful i...Due to highly underdetermined nature of Single Image Super-Resolution(SISR)problem,deep learning neural networks are required to be more deeper to solve the problem effectively.One of deep neural networks successful in the Super-Resolution(SR)problem is ResNet which can render the capability of deeper networks with the help of skip connections.However,zero padding(ZP)scheme in the network restricts benefits of skip connections in SRResNet and its performance as the ratio of the number of pure input data to that of zero padded data increases.In this paper.we consider the ResNet with Partial Convolution based Padding(PCP)instead of ZP to solve SR problem.Since training of deep neural networks using patch images is advantageous in many aspects such as the number of training image data and network complexities,patch image based SR performance is compared with single full image based one.The experimental results show that patch based SRResNet SR results are better than single full image based ones and the performance of deep SRResNet with PCP is better than the one with ZP.展开更多
According to the operational conditions of an aviation module reticule,a measurement mode is proposed,which is based on an industrial photogrammetry system,with matching by a measuring pen.Meanwhile,the factors affect...According to the operational conditions of an aviation module reticule,a measurement mode is proposed,which is based on an industrial photogrammetry system,with matching by a measuring pen.Meanwhile,the factors affecting the accuracy of the measurement have been analyzed and verified by examples.The analysis is described as follows:①Along the optical axis of the camera,the error is larger than the ones in other directions using the“single camera+measuring pen”mode;②By avoiding the error along the optical axis of the camera,the accuracy of the“single camera+measuring pen”mode is better than 0.1 mm when the measuring pen is moving parallel to the optical axis.展开更多
Many networks are designed to stack a large number of residual blocks,deepen the network and improve network performance through short residual connec-tion,long residual connection,and dense connection.However,without...Many networks are designed to stack a large number of residual blocks,deepen the network and improve network performance through short residual connec-tion,long residual connection,and dense connection.However,without consider-ing different contributions of different depth features to the network,these de-signs have the problem of evaluating the importance of different depth features.To solve this problem,this paper proposes an adaptive densely residual net-work(ADRNet)for the single image super resolution.ADRN realizes the evalua-tion of distributions of different depth features and learns more representative features.An adaptive densely residual block(ADRB)was designed,combining 3 residual blocks(RB)and dense connection was added.It learned the attention score of each dense connection through adaptive dense connections,and the at-tention score reflected the importance of the features of each RB.To further en-hance the performance of ADRB,a multi-direction attention block(MDAB)was introduced to obtain multidirectional context information.Through comparative experiments,it is proved that theproposed ADRNet is superior to the existing methods.Through ablation experiments,it is proved that evaluating features of different depths helps to improve network performance.展开更多
Significance, difficult, international developing actuality and our completed works for single molecules imaging in living cell with optical method are described respectively. Additionally we give out some suggestions...Significance, difficult, international developing actuality and our completed works for single molecules imaging in living cell with optical method are described respectively. Additionally we give out some suggestions for the technology development further.展开更多
In single-pixel imaging or computational ghost imaging,the measurement matrix has a great impact on the performance of the imaging system,because it involves modulation of the optical signal and image reconstruction.T...In single-pixel imaging or computational ghost imaging,the measurement matrix has a great impact on the performance of the imaging system,because it involves modulation of the optical signal and image reconstruction.The measurement matrix reported in the existing literatures is first binarized and then loaded onto the digital micro-mirror device(DMD)for optical modulation,that is,each pixel can only be modulated into on-off states.In this paper,we propose a digital grayscale modulation method for more efficient compressive sampling.On the basis of this,we demonstrate a single photon compressive imaging system.A control and counting circuit,based on field-programmable gate array(FPGA),is developed to control DMD to conduct digital grayscale modulation and count single-photon pulse output from the photomultiplier tube(PMT)simultaneously.The experimental results show that the imaging reconstruction quality can be improved by increasing the sparsity ratio properly and compressive sampling ratio(SR)of these gray-scale matrices.However,when the compressive SR and sparsity ratio are increased appropriately to a certain value,the reconstruction quality is usually saturated,and the imaging reconstruction quality of the digital grayscale modulation is better than that of binary modulation.展开更多
基金supported by the National Natural Science Foundation of China under Grant no.41975183,and Grant no.41875184 and Supported by a grant from State Key Laboratory of Resources and Environmental Information System.
文摘The numerous photos captured by low-price Internet of Things(IoT)sensors are frequently affected by meteorological factors,especially rainfall.It causes varying sizes of white streaks on the image,destroying the image texture and ruining the performance of the outdoor computer vision system.Existing methods utilise training with pairs of images,which is difficult to cover all scenes and leads to domain gaps.In addition,the network structures adopt deep learning to map rain images to rain-free images,failing to use prior knowledge effectively.To solve these problems,we introduce a single image derain model in edge computing that combines prior knowledge of rain patterns with the learning capability of the neural network.Specifically,the algorithm first uses Residue Channel Prior to filter out the rainfall textural features then it uses the Feature Fusion Module to fuse the original image with the background feature information.This results in a pre-processed image which is fed into Half Instance Net(HINet)to recover a high-quality rain-free image with a clear and accurate structure,and the model does not rely on any rainfall assumptions.Experimental results on synthetic and real-world datasets show that the average peak signal-to-noise ratio of the model decreases by 0.37 dB on the synthetic dataset and increases by 0.43 dB on the real-world dataset,demonstrating that a combined model reduces the gap between synthetic data and natural rain scenes,improves the generalization ability of the derain network,and alleviates the overfitting problem.
基金This work was supported by the National Natural Science Foundation of China(Grant No.61673222)Jiangsu Universities Natural Science Research Project(Grant No.13KJA510001)Major Program of the National Social Science Fund of China(Grant No.17ZDA092).
文摘Visual degradation of captured images caused by rainy streaks under rainy weather can adversely affect the performance of many open-air vision systems.Hence,it is necessary to address the problem of eliminating rain streaks from the individual rainy image.In this work,a deep convolution neural network(CNN)based method is introduced,called Rain-Removal Net(R2N),to solve the single image de-raining issue.Firstly,we decomposed the rainy image into its high-frequency detail layer and lowfrequency base layer.Then,we used the high-frequency detail layer to input the carefully designed CNN architecture to learn the mapping between it and its corresponding derained high-frequency detail layer.The CNN architecture consists of four convolution layers and four deconvolution layers,as well as three skip connections.The experiments on synthetic and real-world rainy images show that the performance of our architecture outperforms the compared state-of-the-art de-raining models with respects to the quality of de-rained images and computing efficiency.
基金supported by the National Defense Technology Advance Research Project of China(004040204).
文摘Outdoor haze has adverse impact on outdoor image quality,including contrast loss and poor visibility.In this paper,a novel dehazing algorithm based on the decomposition strategy is proposed.It combines the advantages of the two-dimensional variational mode decomposition(2DVMD)algorithm and dark channel prior.The original hazy image is adaptively decom-posed into low-frequency and high-frequency images according to the image frequency band by using the 2DVMD algorithm.The low-frequency image is dehazed by using the improved dark channel prior,and then fused with the high-frequency image.Furthermore,we optimize the atmospheric light and transmit-tance estimation method to obtain a defogging effect with richer details and stronger contrast.The proposed algorithm is com-pared with the existing advanced algorithms.Experiment results show that the proposed algorithm has better performance in comparison with the state-of-the-art algorithms.
基金supported by the National Natural Science Foundation of China (No.61561030)the Gansu Provincial F inance Department (No.214138)。
文摘In this letter, we present a novel integrated feature that incorporates traditional parameters, and adopt a parallel cascading fashion network Haze Net for enhancing image quality. Our unified feature is a complete integration, and its role is to directly describe the effects of haze. In Haze Net, we design two separate structures including backbone and auxiliary networks to extract feature map. Backbone network is responsible for extracting high-level feature map, and low-level feature learned by the auxiliary network can be interpreted as fine-grained feature. After cascading two features with different accuracy, final performance can be effectively improved. Extensive experimental results on both synthetic datasets and real-world images prove the superiority of the proposed method, and demonstrate more favorable performance compared with the existing state-of-art methods.
基金supported by the National Natural Science Foundation of China(Grant Nos.61972342,61602402,and 61902334)Zhejiang Provincial Basic Public Welfare Research(Grant No.LGG19F020001)+1 种基金Shenzhen Fundamental Research(General Project)(Grant No.JCYJ20190814112007258)the Royal Society(Grant No.IES\R1\180126).
文摘There is a steadily growing range of applications that can benefit from facial reconstruction techniques,leading to an increasing demand for reconstruction of high-quality 3D face models.While it is an important expressive part of the human face,the nose has received less attention than other expressive regions in the face reconstruction literature.When applying existing reconstruction methods to facial images,the reconstructed nose models are often inconsistent with the desired shape and expression.In this paper,we propose a coarse-to-fine 3D nose reconstruction and correction pipeline to build a nose model from a single image,where 3D and 2D nose curve correspondences are adaptively updated and refined.We first correct the reconstruction result coarsely using constraints of 3D-2D sparse landmark correspondences,and then heuristically update a dense 3D-2D curve correspondence based on the coarsely corrected result.A final refinement step is performed to correct the shape based on the updated 3D-2D dense curve constraints.Experimental results show the advantages of our method for 3D nose reconstruction over existing methods.
基金This work was supported in part by the National Key R&D Program of China under No.2017YFB1003000the National Natural Science Foundation of China under No.61872047 and No.61720106007+2 种基金the Beijing Nova Program under No.Z201100006820124the Beijing Natural Science Foundation(L191004)the 111 Project(B18008).
文摘The quality of photos is highly susceptible to severe weather such as heavy rain;it can also degrade the performance of various visual tasks like object detection.Rain removal is a challenging problem because rain streaks have different appearances even in one image.Regions where rain accumulates appear foggy or misty,while rain streaks can be clearly seen in areas where rain is less heavy.We propose removing various rain effects in pictures using a hybrid multiscale loss guided multiple feature fusion de-raining network(MSGMFFNet).Specially,to deal with rain streaks,our method generates a rain streak attention map,while preprocessing uses gamma correction and contrast enhancement to enhanced images to address the problem of rain accumulation.Using these tools,the model can restore a result with abundant details.Furthermore,a hybrid multiscale loss combining L1 loss and edge loss is used to guide the training process to pay attention to edge and content information.Comprehensive experiments conducted on both synthetic and real-world datasets demonstrate the effectiveness of our method.
文摘Cities are in constant change and city managers aim to keep an updated digital model of the city for city governance. There are a lot of images uploaded daily on image sharing platforms (as “Flickr”, “Twitter”, etc.). These images feature a rough localization and no orientation information. Nevertheless, they can help to populate an active collaborative database of street images usable to maintain a city 3D model, but their localization and orientation need to be known. Based on these images, we propose the Data Gathering system for image Pose Estimation (DGPE) that helps to find the pose (position and orientation) of the camera used to shoot them with better accuracy than the sole GPS localization that may be embedded in the image header. DGPE uses both visual and semantic information, existing in a single image processed by a fully automatic chain composed of three main layers: Data retrieval and preprocessing layer, Features extraction layer, Decision Making layer. In this article, we present the whole system details and compare its detection results with a state of the art method. Finally, we show the obtained localization, and often orientation results, combining both semantic and visual information processing on 47 images. Our multilayer system succeeds in 26% of our test cases in finding a better localization and orientation of the original photo. This is achieved by using only the image content and associated metadata. The use of semantic information found on social media such as comments, hash tags, etc. has doubled the success rate to 59%. It has reduced the search area and thus made the visual search more accurate.
基金the Higher Education Ministry research grant,under the Long-Term Research Grant Scheme(No.LRGS/1/2020/UMT/01/1/2)the Universiti Malaysia Terengganu Scholarship(BUMT)。
文摘The thermal front in the oceanic system is believed to have a significant effect on biological activity.During an era of climate change,changes in heat regulation between the atmosphere and oceanic interior can alter the characteristics of this important feature.Using the simulation results of the 3D Regional Ocean Modelling System(ROMS),we identified the location of thermal fronts and determined their dynamic variability in the area between the southern Andaman Sea and northern Malacca Strait.The Single Image Edge Detection(SIED)algorithm was used to detect the thermal front from model-derived temperature.Results show that a thermal front occurred every year from 2002 to 2012 with the temperature gradient at the location of the front was 0.3°C/km.Compared to the years affected by El Ni?o and negative Indian Ocean Dipole(IOD),the normal years(e.g.,May 2003)show the presence of the thermal front at every selected depth(10,25,50,and 75 m),whereas El Ni?o and negative IOD during 2010 show the presence of the thermal front only at depth of 75 m due to greater warming,leading to the thermocline deepening and enhanced stratification.During May 2003,the thermal front was separated by cooler SST in the southern Andaman Sea and warmer SST in the northern Malacca Strait.The higher SST in the northern Malacca Strait was believed due to the besieged Malacca Strait,which trapped the heat and make it difficult to release while higher chlorophyll a in Malacca Strait is due to the freshwater conduit from nearby rivers(Klang,Langat,Perak,and Selangor).Furthermore,compared to the southern Andaman Sea,the chlorophyll a in the northern Malacca Strait is easier to reach the surface area due to the shallower thermocline,which allows nutrients in the area to reach the surface faster.
基金The authors are highly thankful to the Development Research Center of Guangxi Relatively Sparse-populated Minorities(ID:GXRKJSZ201901)to the Natural Science Foundation of Guangxi Province(No.2018GXNSFAA281164)This research was financially supported by the project of outstanding thousand young teachers’training in higher education institutions of Guangxi,Guangxi Colleges and Universities Key Laboratory Breeding Base of System Control and Information Processing.
文摘Single image super resolution(SISR)is an important research content in the field of computer vision and image processing.With the rapid development of deep neural networks,different image super-resolution models have emerged.Compared to some traditional SISR methods,deep learning-based methods can complete the super-resolution tasks through a single image.In addition,compared with the SISR methods using traditional convolutional neural networks,SISR based on generative adversarial networks(GAN)has achieved the most advanced visual performance.In this review,we first explore the challenges faced by SISR and introduce some common datasets and evaluation metrics.Then,we review the improved network structures and loss functions of GAN-based perceptual SISR.Subsequently,the advantages and disadvantages of different networks are analyzed by multiple comparative experiments.Finally,we summarize the paper and look forward to the future development trends of GAN-based perceptual SISR.
基金This work was supported in part by the National Science Foundation of China under Grant 61572526.
文摘Although there has been a great breakthrough in the accuracy and speed of super-resolution(SR)reconstruction of a single image by using a convolutional neural network,an important problem remains unresolved:how to restore finer texture details during image super-resolution reconstruction?This paper proposes an Enhanced Laplacian Pyramid Generative Adversarial Network(ELSRGAN),based on the Laplacian pyramid to capture the high-frequency details of the image.By combining Laplacian pyramids and generative adversarial networks,progressive reconstruction of super-resolution images can be made,making model applications more flexible.In order to solve the problem of gradient disappearance,we introduce the Residual-in-Residual Dense Block(RRDB)as the basic network unit.Network capacity benefits more from dense connections,is able to capture more visual features with better reconstruction effects,and removes BN layers to increase calculation speed and reduce calculation complexity.In addition,a loss of content driven by perceived similarity is used instead of content loss driven by spatial similarity,thereby enhancing the visual effect of the super-resolution image,making it more consistent with human visual perception.Extensive qualitative and quantitative evaluation of the baseline datasets shows that the proposed algorithm has higher mean-sort-score(MSS)than any state-of-the-art method and has better visual perception.
文摘Obtaining clear images of underwater scenes with descriptive details is an arduous task.Conventional imaging techniques fail to provide clear cut features and attributes that ultimately result in object recognition errors.Consequently,a need for a system that produces clear images for underwater image study has been necessitated.To overcome problems in resolution and to make better use of the Super-Resolution(SR)method,this paper introduces a novel method that has been derived from the Alpha Generative Adversarial Network(AlphaGAN)model,named Alpha Super Resolution Generative Adversarial Network(AlphaSRGAN).The model put forth in this paper helps in enhancing the quality of underwater imagery and yields images with greater resolution and more concise details.Images undergo pre-processing before they are fed into a generator network that optimizes and reforms the structure of the network while enhancing the stability of the network that acts as the generator.After the images are processed by the generator network,they are passed through an adversarial method for training models.The dataset used in this paper to learn Single Image Super Resolution(SISR)is the USR 248 dataset.Training supervision is performed by an unprejudiced function that simultaneously scrutinizes and improves the image quality.Appraisal of images is done with reference to factors like local style information,global content and color.The dataset USR 248 which has a huge collection of images has been used for the study is composed of three collections of images—high(640×480)and low(80×60,160×120,and 320×240).Paired instances of different sizes—2×,4×and 8×—are also present in the dataset.Parameters like Mean Opinion Score(MOS),Peak Signal-to-Noise Ratio(PSNR),Structural Similarity(SSIM)and Underwater Image Quality Measure(UIQM)scores have been compared to validate the improved efficiency of our model when compared to existing works.
文摘Background This study presents a neural hand reconstruction method for monocular 3D hand pose and shape estimation.Methods Alternate to directly representing hand with 3D data,a novel UV position map is used to represent a hand pose and shape with 2D data that maps 3D hand surface points to 2D image space.Furthermore,an encoder-decoder neural network is proposed to infer such UV position map from a single image.To train this network with inadequate ground truth training pairs,we propose a novel MANOReg module that employs MANO model as a prior shape to constrain high dimensional space of the UV position map.Results The quantitative and qualitative experiments demonstrate the effectiveness of our UV position map representation and MANOReg module.
基金the Natural Science Foundation of Jilin Province,China(Grand No.YDZJ202101ZYTS030)。
文摘Single pixel imaging is a novel imaging technique,and it becomes a focus of research in recent years due to its advantages such as high lateral resolution and high robustness to noise.Imaging speed is one of the critical shortcomings,which limits the further development and applications of this technique.In this paper,we focus on the issues of imaging efficiency of a single pixel imaging system.We propose semi-continuous wavelet transform(SCWT)protocol and introduce the protocol into the single pixel imaging system.The proposed protocol is something between continuous wavelet transform and discrete wavelet transform,which allows the usage of those smooth(usually non-orthogonal,and they have advantages in representing smooth signals compressively,which can improve the imaging speed of single pixel imaging)wavelets and with limited numbers of measurements.The proposed imaging scheme is studied,and verified by simulations and experiments.Furthermore,a comparison between our proposed scheme and existing imaging schemes are given.According to the results,the proposed SCWT scheme is proved to be effective in reconstructing a image compressively.
基金National Institute of General Medical Sciences of the National Institutes of Health Grant[Grant No.R01GM107165].
文摘The exosome-mediated response can promote or restrain the diseases by regulating the intracellular pathways,making the exosome become an effective marker for diagnosis and therapeutic control at the single-cell level.However,real-time analysis is hard to be achieved with traditional approaches because the exosomes usually need to be enriched by ultracentrifugation for a measurable signal-to-noise ratio.Recently developed label-free single-molecule imaging approaches may become an real-time quantitative tool for the analysis of single exosomes and related secretion behaviors of single living cells owing to their extreme sensitivity.
基金National Natural Science Foundation of China(No.11865013)Horizontal Project of Shangrao Normal University,China(No.K8000219T)+1 种基金Industrial Science and Technology Project in Shangrao of Jiangxi Province,China(No.17A005)Doctoral Scientific Research Foundation of Shangrao Normal University,China(No.6000108)。
文摘In modern society,information is becoming increasingly interconnected through networks,and the rapid development of information technology has caused people to pay more attention to the encryption and the protection of information.Image encryption technology is a key technology for ensuring the security performance of images.We extracted single channel RGB component images from a color image using MATLAB programs,encrypted and decrypted the color images by randomly disrupting rows,columns and regions of the image.Combined with histograms and the visual judgments of encryption images,it is shown that the information of the original image cannot be obtained from the encryption image easily.The results show that the color-image encryptions with the algorithm we used have good effect and fast operation speed.Thus this algorithm has certain practical value.
文摘Due to highly underdetermined nature of Single Image Super-Resolution(SISR)problem,deep learning neural networks are required to be more deeper to solve the problem effectively.One of deep neural networks successful in the Super-Resolution(SR)problem is ResNet which can render the capability of deeper networks with the help of skip connections.However,zero padding(ZP)scheme in the network restricts benefits of skip connections in SRResNet and its performance as the ratio of the number of pure input data to that of zero padded data increases.In this paper.we consider the ResNet with Partial Convolution based Padding(PCP)instead of ZP to solve SR problem.Since training of deep neural networks using patch images is advantageous in many aspects such as the number of training image data and network complexities,patch image based SR performance is compared with single full image based one.The experimental results show that patch based SRResNet SR results are better than single full image based ones and the performance of deep SRResNet with PCP is better than the one with ZP.
文摘According to the operational conditions of an aviation module reticule,a measurement mode is proposed,which is based on an industrial photogrammetry system,with matching by a measuring pen.Meanwhile,the factors affecting the accuracy of the measurement have been analyzed and verified by examples.The analysis is described as follows:①Along the optical axis of the camera,the error is larger than the ones in other directions using the“single camera+measuring pen”mode;②By avoiding the error along the optical axis of the camera,the accuracy of the“single camera+measuring pen”mode is better than 0.1 mm when the measuring pen is moving parallel to the optical axis.
文摘Many networks are designed to stack a large number of residual blocks,deepen the network and improve network performance through short residual connec-tion,long residual connection,and dense connection.However,without consider-ing different contributions of different depth features to the network,these de-signs have the problem of evaluating the importance of different depth features.To solve this problem,this paper proposes an adaptive densely residual net-work(ADRNet)for the single image super resolution.ADRN realizes the evalua-tion of distributions of different depth features and learns more representative features.An adaptive densely residual block(ADRB)was designed,combining 3 residual blocks(RB)and dense connection was added.It learned the attention score of each dense connection through adaptive dense connections,and the at-tention score reflected the importance of the features of each RB.To further en-hance the performance of ADRB,a multi-direction attention block(MDAB)was introduced to obtain multidirectional context information.Through comparative experiments,it is proved that theproposed ADRNet is superior to the existing methods.Through ablation experiments,it is proved that evaluating features of different depths helps to improve network performance.
文摘Significance, difficult, international developing actuality and our completed works for single molecules imaging in living cell with optical method are described respectively. Additionally we give out some suggestions for the technology development further.
基金This work was supported in part by the National Natural Science Foundation of China(Grants Nos.61865010 and 61565012)in part by the China Postdoctoral Science Foundation(Grant No.2015T80691)+1 种基金in part by the Science and Technology Plan Project of Jiangxi Province(Grant No.20151BBE50092)in part by the Funding Scheme to Outstanding Young Talents of Jiangxi Province(Grant No.20171BCB23007).
文摘In single-pixel imaging or computational ghost imaging,the measurement matrix has a great impact on the performance of the imaging system,because it involves modulation of the optical signal and image reconstruction.The measurement matrix reported in the existing literatures is first binarized and then loaded onto the digital micro-mirror device(DMD)for optical modulation,that is,each pixel can only be modulated into on-off states.In this paper,we propose a digital grayscale modulation method for more efficient compressive sampling.On the basis of this,we demonstrate a single photon compressive imaging system.A control and counting circuit,based on field-programmable gate array(FPGA),is developed to control DMD to conduct digital grayscale modulation and count single-photon pulse output from the photomultiplier tube(PMT)simultaneously.The experimental results show that the imaging reconstruction quality can be improved by increasing the sparsity ratio properly and compressive sampling ratio(SR)of these gray-scale matrices.However,when the compressive SR and sparsity ratio are increased appropriately to a certain value,the reconstruction quality is usually saturated,and the imaging reconstruction quality of the digital grayscale modulation is better than that of binary modulation.