文章分析了语音合成技术的要点,基于语音合成提出了一种视觉的语音合成算法L2W(Lip to Wav),并将其应用到身份认证当中。在GRID英文唇语数据集上的实验验证,证明了L2W的准确率能够达到78.85%,比相关算法有4.55%的提升。通过L2W合成的语...文章分析了语音合成技术的要点,基于语音合成提出了一种视觉的语音合成算法L2W(Lip to Wav),并将其应用到身份认证当中。在GRID英文唇语数据集上的实验验证,证明了L2W的准确率能够达到78.85%,比相关算法有4.55%的提升。通过L2W合成的语音与原声源的频谱距离实现基于视觉语音合成的身份认证技术。展开更多
An image fusion method combining complex contourlet transform(CCT) with nonnegative matrix factorization(NMF) is proposed in this paper.After two images are decomposed by CCT,NMF is applied to their highand low-freque...An image fusion method combining complex contourlet transform(CCT) with nonnegative matrix factorization(NMF) is proposed in this paper.After two images are decomposed by CCT,NMF is applied to their highand low-frequency components,respectively,and finally an image is synthesized.Subjective-visual-quality of the image fusion result is compared with those of the image fusion methods based on NMF and the combination of wavelet /contourlet /nonsubsampled contourlet with NMF.The experimental results are evaluated quantitatively,and the running time is also contrasted.It is shown that the proposed image fusion method can gain larger information entropy,standard deviation and mean gradient,which means that it can better integrate featured information from all source images,avoid background noise and promote space clearness in the fusion image effectively.展开更多
Since there is lack of methodology to assess the performance of defogging algorithm and the existing assessment methods have some limitations,three new methods for assessing the defogging algorithm were proposed.One w...Since there is lack of methodology to assess the performance of defogging algorithm and the existing assessment methods have some limitations,three new methods for assessing the defogging algorithm were proposed.One was using synthetic foggy image simulated by image degradation model to assess the defogging algorithm in full-reference way.In this method,the absolute difference was computed between the synthetic image with and without fog.The other two were computing the fog density of gray level image or constructing assessment system of color image from human visual perception to assess the defogging algorithm in no-reference way.For these methods,an assessment function was defined to evaluate algorithm performance from the function value.Using the defogging algorithm comparison,the experimental results demonstrate the effectiveness and reliability of the proposed methods.展开更多
Three dimensional digitization of human head is desired in many applications. In this paper, an information fusion based scheme is presented to obtain 3-D information of human head. Structured light technology is empl...Three dimensional digitization of human head is desired in many applications. In this paper, an information fusion based scheme is presented to obtain 3-D information of human head. Structured light technology is employed to measure depth. For the special reflection areas,in which the structured light stripe can not be detected directly, the shape of the structured light stripe can be calculated from the corresponding contour. By fusing the information of structured light and the contours, the problem of reflectance influence is solved, and the whole shape of head,including hair area, can be obtained. Some good results are obtained.展开更多
The paper aims to challenge non-GPS navigation problems by using visual sensors and geo-referenced images. An area-based method is proposed to estimate full navigation parameters(FNPs), including attitude, altitude an...The paper aims to challenge non-GPS navigation problems by using visual sensors and geo-referenced images. An area-based method is proposed to estimate full navigation parameters(FNPs), including attitude, altitude and horizontal position, for unmanned aerial vehicle(UAV) navigation. Our method is composed of three main modules: geometric transfer function, local normalized sobel energy image(LNSEI) based objective function and simplex-simulated annealing(SSA) based optimization algorithm. The adoption of relatively rich scene information and LNSEI, makes it possible to yield a solution robustly even in the presence of very noisy cases, such as multi-modal and/or multi-temporal images that differ in the type of visual sensor, season, illumination, weather, and so on, and also to handle the sparsely textured regions where features are barely detected or matched. Simulation experiments using many synthetic images clearly support noise resistance and estimation accuracy, and experimental results using 2367 real images show the maximum estimation error of 5.16(meter) for horizontal position, 9.72(meter) for altitude and 0.82(degree) for attitude.展开更多
文摘文章分析了语音合成技术的要点,基于语音合成提出了一种视觉的语音合成算法L2W(Lip to Wav),并将其应用到身份认证当中。在GRID英文唇语数据集上的实验验证,证明了L2W的准确率能够达到78.85%,比相关算法有4.55%的提升。通过L2W合成的语音与原声源的频谱距离实现基于视觉语音合成的身份认证技术。
基金Supported by National Natural Science Foundation of China (No. 60872065)
文摘An image fusion method combining complex contourlet transform(CCT) with nonnegative matrix factorization(NMF) is proposed in this paper.After two images are decomposed by CCT,NMF is applied to their highand low-frequency components,respectively,and finally an image is synthesized.Subjective-visual-quality of the image fusion result is compared with those of the image fusion methods based on NMF and the combination of wavelet /contourlet /nonsubsampled contourlet with NMF.The experimental results are evaluated quantitatively,and the running time is also contrasted.It is shown that the proposed image fusion method can gain larger information entropy,standard deviation and mean gradient,which means that it can better integrate featured information from all source images,avoid background noise and promote space clearness in the fusion image effectively.
基金Projects(91220301,61175064,61273314)supported by the National Natural Science Foundation of ChinaProject(126648)supported by the Postdoctoral Science Foundation of Central South University,ChinaProject(2012170301)supported by the New Teacher Fund for School of Information Science and Engineering,Central South University,China
文摘Since there is lack of methodology to assess the performance of defogging algorithm and the existing assessment methods have some limitations,three new methods for assessing the defogging algorithm were proposed.One was using synthetic foggy image simulated by image degradation model to assess the defogging algorithm in full-reference way.In this method,the absolute difference was computed between the synthetic image with and without fog.The other two were computing the fog density of gray level image or constructing assessment system of color image from human visual perception to assess the defogging algorithm in no-reference way.For these methods,an assessment function was defined to evaluate algorithm performance from the function value.Using the defogging algorithm comparison,the experimental results demonstrate the effectiveness and reliability of the proposed methods.
基金Supported by the National Natural Science Foundation of China(69775022) and 863 Programme of China(863-306-ZT04-06-3)
文摘Three dimensional digitization of human head is desired in many applications. In this paper, an information fusion based scheme is presented to obtain 3-D information of human head. Structured light technology is employed to measure depth. For the special reflection areas,in which the structured light stripe can not be detected directly, the shape of the structured light stripe can be calculated from the corresponding contour. By fusing the information of structured light and the contours, the problem of reflectance influence is solved, and the whole shape of head,including hair area, can be obtained. Some good results are obtained.
文摘The paper aims to challenge non-GPS navigation problems by using visual sensors and geo-referenced images. An area-based method is proposed to estimate full navigation parameters(FNPs), including attitude, altitude and horizontal position, for unmanned aerial vehicle(UAV) navigation. Our method is composed of three main modules: geometric transfer function, local normalized sobel energy image(LNSEI) based objective function and simplex-simulated annealing(SSA) based optimization algorithm. The adoption of relatively rich scene information and LNSEI, makes it possible to yield a solution robustly even in the presence of very noisy cases, such as multi-modal and/or multi-temporal images that differ in the type of visual sensor, season, illumination, weather, and so on, and also to handle the sparsely textured regions where features are barely detected or matched. Simulation experiments using many synthetic images clearly support noise resistance and estimation accuracy, and experimental results using 2367 real images show the maximum estimation error of 5.16(meter) for horizontal position, 9.72(meter) for altitude and 0.82(degree) for attitude.