高效视频编码(high efficiency video coding,HEVC)具有较高的计算复杂度,而快速编码算法不能在给定编码时间下完成视频编码,因此提出了一种基于随机森林的HEVC复杂度控制方法。首先训练得到3种不同预测准确率的随机森林分类器,实现编...高效视频编码(high efficiency video coding,HEVC)具有较高的计算复杂度,而快速编码算法不能在给定编码时间下完成视频编码,因此提出了一种基于随机森林的HEVC复杂度控制方法。首先训练得到3种不同预测准确率的随机森林分类器,实现编码树单元(coding tree unit,CTU)的多种编码配置;然后,通过建立平均深度—复杂度模型为CTU分配复杂度;最后通过平坦度、平均深度、比特和CTU级累计编码误差确定的CTU编码配置完成复杂度控制。实验结果表明,本文算法复杂度控制精度较高,在率失真性能方面优于当前复杂度控制方法。展开更多
In this paper, an approach to predicting randomly-shaped particle volume based on its two-Dimensional (2-D) digital image is explored. Conversion of gray-scale image of the particles to its binary coun-terpart is firs...In this paper, an approach to predicting randomly-shaped particle volume based on its two-Dimensional (2-D) digital image is explored. Conversion of gray-scale image of the particles to its binary coun-terpart is first performed using backlighting technique. The silhouette of particle is thus obtained, and conse-quently, informative features such as particle area, centroid and shape-related descriptors are collected. Several dimensionless parameters are defined, and used as regressor variables in a multiple linear regression model to predict particle volume. Regressor coefficients are found by fitting to a randomly selected sample of 501 parti-cles ranging in size from 4.75mm to 25mm. The model testing experiment is conducted against a different ag-gregate sample of the similar statistical properties, the errors of the model-predicted volume of the batch is within ±2%.展开更多
In many image analysis and processing problems, discriminating the size and shape of each individual object in an aggregate pile projected in an image is an important practice. It is relatively easy to distinguish the...In many image analysis and processing problems, discriminating the size and shape of each individual object in an aggregate pile projected in an image is an important practice. It is relatively easy to distinguish these features among the objects already separated from each other. The problems will be undoubtedly more complex and of greater challenge if the objects are touched or/and overlapped. This letter presents an algorithm that can be used to separate the touches and overlaps existing in the objects within a 2-D image. The approach is first to convert the gray-scale image to its corresponding binary one and then to the 3-D topographic one using the erosion operations. A template (or mask) is engineered to search the topographic surface for the saddle point, from which the segmenting orientation is determined followed by the desired separating operation. The algorithm is tested on a real image and the running result is adequately satisfying and encouraging.展开更多
Most of Image Quality Assessment (IQA) metrics consist of two processes. In the first process, quality map of image is measured locally. In the second process, the last quality score is converted from the quality map ...Most of Image Quality Assessment (IQA) metrics consist of two processes. In the first process, quality map of image is measured locally. In the second process, the last quality score is converted from the quality map by using the pooling strategy. The first process had been made effective and significant progresses, while the second process was always done in simple ways. In the second process of the pooling strategy, the optimal perceptual pooling weights should be determined and computed according to Human Visual System (HVS). Thus, a reliable spatial pooling mathematical model based on HVS is an important issue worthy of study. In this paper, a new Visual Perceptual Pooling Strategy (VPPS) for IQA is presented based on contrast sensitivity and luminance sensitivity of HVS. Experimental results with the LIVE database show that the visual perceptual weights, obtained by the proposed pooling strategy, can effectively and significantly improve the performances of the IQA metrics with Mean Structural SIMilarity (MSSIM) or Phase Quantization Code (PQC). It is confirmed that the proposed VPPS demonstrates promising results for improving the performances of existing IQA metrics.展开更多
Color inconsistency between views is an important problem to be solved in multi-view video systems. A multi-view video color correction method using dynamic programming is proposed. Three-dimensional histograms are co...Color inconsistency between views is an important problem to be solved in multi-view video systems. A multi-view video color correction method using dynamic programming is proposed. Three-dimensional histograms are constructed with sequential conditional probability in HSI color space. Then, dynamic programming is used to seek the best color mapping relation with the minimum cost path between target image histogram and source image histogram. Finally, video tracking technique is performed to correct multi-view video. Experimental results show that the proposed method can obtain better subjective and objective performance in color correction.展开更多
Due to the limitation of Depth Of Field (DOF) of microscope, the regions which are not within the DOF will be blurring after imaging. Thus for micro-image fusion, the most important step is to identify the blurring re...Due to the limitation of Depth Of Field (DOF) of microscope, the regions which are not within the DOF will be blurring after imaging. Thus for micro-image fusion, the most important step is to identify the blurring regions within each micro-image, so as to remove their undesirable impacts on the fused image. In this paper, a fusion algorithm based on a novel region growing method is proposed for micro-image fusion. The local sharpness of micro-image is judged block by block, then blocks whose sharpness is lower than an adaptive threshold are used as seeds, and the sharpness of neighbors of each seed are evaluated again during the region growing until the blurring regions are identified completely. With the decreasing in block size, the obtained region segmentation becomes more and more accurate. Finally, the micro-images are fused with pixel-wise fusion rules. The experimental results show that the proposed algorithm benefits from the novel region segmentation and it is able to obtain fused micro-image with higher sharpness compared with some popular image fusion method.展开更多
A semi-reference image quality assessment metric based on similarity measurement for synthesizedvirtual viewpoint image (WI) in free-viewpoint television system (FTV) is proposed in this paper. Thekey point of the pro...A semi-reference image quality assessment metric based on similarity measurement for synthesizedvirtual viewpoint image (WI) in free-viewpoint television system (FTV) is proposed in this paper. Thekey point of the proposed metric is taking resemblant information between WI and its neighbor view imagesfor quality assessment to make our metric to be extended to multi-semi-reference image quality assessmenteasily. The proposed metric first extracts impact factors from image features, then combines animage synthesis technique and similarity functions, in which, disparity information are taken into accountfor registering the resemblant regions. Experiments are divided into three phases. Phase Ⅰ is to verify thevalidation of the proposed metric by taking impaired images and original reference into account. The experimentalresults show the agreement between evaluation scores and bio-characteristic of human visualsystem. Phase Ⅱ shows the accordance with Phase Ⅰ by taking neighbor view as reference. The proposedmetric can be taken as a full reference one to evaluate the image quality even though the original referenceis absent. Phase Ⅲ is then performed to evaluate the quality of ⅤⅥ. Evaluation scores in the experimentalresults are able to evaluate the quality of ⅤⅥ.展开更多
In order to establish a stereoscopic image quality assessment method which is consistent with human visual perception,we propose an objective stereoscopic image quality assessment method.It takes into account the stro...In order to establish a stereoscopic image quality assessment method which is consistent with human visual perception,we propose an objective stereoscopic image quality assessment method.It takes into account the strong correlation and high degree of structural between pixels of image.This method contains two models.One is the quality synthetic assessment of left-right view images,which is based on human visual characteristics,we use the Singular Value Decomposition(SVD)that can represent the degree of the distortion,and combine the qualities of left and right images by the characteristics of binocular superposition.The other model is stereoscopic perception quality assessment,due to strong stability of image’s singular value characteristics,we calculate the distance of the singular values and structural characteristic similarity of the absolute difference maps,and utilize the statistical value of the global error to evaluate stereoscopic perception.Finally,we combine two models to describe the stereoscopic image quality.Experimental results show that the correlation coefficients of the proposed assessment method and the human subjective perception are above 0.93,and the mean square errors are all less than 6.2,under JPEG,JP2K compression,Gaussian blurring,Gaussian white noise,H.264 coding distortion,and hybrid cross distortion.It indicates that the proposed stereoscopic objective method is consistent with human visual properties and also of availability.展开更多
Ray-space based arbitrary viewpoint rendering without complex object segmentation or model construction is the main technology to realize Free Viewpoint Video(FVV) system for complex scenes. Ray-space interpolation an...Ray-space based arbitrary viewpoint rendering without complex object segmentation or model construction is the main technology to realize Free Viewpoint Video(FVV) system for complex scenes. Ray-space interpolation and compression are two key techniques for the solution. In this paper,correlation among multiple epipolar lines in ray-space data is analyzed,and a new method of ray-space interpolation with multi-epipolar lines matching is proposed. Comparing with the pixel-based matching interpolation method and the block-based matching interpolation method,the proposed method can achieve higher Peak Signal to Noise Ratio(PSNR) in interpolating rayspace data and rendering arbitrary viewpoint images.展开更多
Color inconsistency between views is an important problem to be solved in multi-view video applications, such as free viewpoint television and other three-dimensional video systems. In this paper, by combining with mu...Color inconsistency between views is an important problem to be solved in multi-view video applications, such as free viewpoint television and other three-dimensional video systems. In this paper, by combining with multi-view video coding, a coding-oriented multi-view video color correction method is proposed. We first separate foreground and background in first Ggroup Oof Ppictures (GOP) by using SKIP coding mode. Then by transferring means and standard deviations in backgrounds, color correction is performed for each frame in GOP, and multi-view video coding is performed and used to renew the backgrounds. Experimental results show the proposed method can obtain better perform-ances in color correction and multi-view video coding.展开更多
This paper presents a free viewpoint video(FVV)system based on ray-space interpolation method.The new algorithm matches individual pixels in corresponding scanline pairs by using a dynamic program-ming technique.A spa...This paper presents a free viewpoint video(FVV)system based on ray-space interpolation method.The new algorithm matches individual pixels in corresponding scanline pairs by using a dynamic program-ming technique.A sparse intermediate view disparity map is projected from matched pixels firstly,andthe holes(occluded pixels)are filled in by propagating the disparity of neighboring background pixels.After interpolating dense view images,an arbitrary virtual view image can be easily rendered from thedense ray-space converted from these view images.The proposed method is evaluated on the Middleburydata set arid compared with other methods,experimental results show that the better quality of the inter-mediate view is obtained and the corresponding computational complexity is reduced significantly.展开更多
目的研究表明,图像的恰可察觉失真(JND)阈值主要与视觉系统的亮度适应性、对比度掩模、模块掩模以及图像结构等因素有关。为了更好地研究图像结构对JND阈值的影响,提出一种基于稀疏表示的结构信息和非结构信息分离模型,并应用于自然图像...目的研究表明,图像的恰可察觉失真(JND)阈值主要与视觉系统的亮度适应性、对比度掩模、模块掩模以及图像结构等因素有关。为了更好地研究图像结构对JND阈值的影响,提出一种基于稀疏表示的结构信息和非结构信息分离模型,并应用于自然图像的JND阈值估计,使JND阈值模型与人眼视觉系统具有更好的一致性。方法首先通过K-均值奇异值分解算法(K-SVD)得到过完备视觉字典。然后利用该过完备字典对输入的自然图像进行稀疏表示和重建,得到该图像对应的结构层和非结构层。针对结构层和非结构层,进一步设计基于亮度适应性与对比度掩模的结构层JND估计模型和基于亮度对比度与信息不确定度的非结构层JND估计模型。最后利用一个能够刻画掩模效应的非线性可加模型对以上两个分量的JND估计模型进行融合。结果本文提出的JND估计模型利用稀疏表示将自然图像的结构/非结构信息进行分离,然后采用符合各自分量特点的JND模型进行计算,与视觉感知机理高度一致。实验结果表明,本文JND模型能够有效地预测自然图像的JND阈值,受污染图的峰值信噪比(PSNR)值比其他3个JND对比模型值高出3 5 d B。结论与现有模型相比,该模型与人眼主观视觉感知具有更好的一致性,更能有效地预测自然图像的JND阈值。展开更多
文摘高效视频编码(high efficiency video coding,HEVC)具有较高的计算复杂度,而快速编码算法不能在给定编码时间下完成视频编码,因此提出了一种基于随机森林的HEVC复杂度控制方法。首先训练得到3种不同预测准确率的随机森林分类器,实现编码树单元(coding tree unit,CTU)的多种编码配置;然后,通过建立平均深度—复杂度模型为CTU分配复杂度;最后通过平坦度、平均深度、比特和CTU级累计编码误差确定的CTU编码配置完成复杂度控制。实验结果表明,本文算法复杂度控制精度较高,在率失真性能方面优于当前复杂度控制方法。
基金Funded by the Zhejiang Provincial Educatrion Ministry (No.2004884), and the Scientific Research Start-up Foundation of Ningbo University (No.2004037).
文摘In this paper, an approach to predicting randomly-shaped particle volume based on its two-Dimensional (2-D) digital image is explored. Conversion of gray-scale image of the particles to its binary coun-terpart is first performed using backlighting technique. The silhouette of particle is thus obtained, and conse-quently, informative features such as particle area, centroid and shape-related descriptors are collected. Several dimensionless parameters are defined, and used as regressor variables in a multiple linear regression model to predict particle volume. Regressor coefficients are found by fitting to a randomly selected sample of 501 parti-cles ranging in size from 4.75mm to 25mm. The model testing experiment is conducted against a different ag-gregate sample of the similar statistical properties, the errors of the model-predicted volume of the batch is within ±2%.
基金Suppprted by the Scientific Research Start-up foundation of Ningbo University (No.2004037)Zhejiang Provincial Foundation for Returned Overseas Students and Scholars (No.2004884).
文摘In many image analysis and processing problems, discriminating the size and shape of each individual object in an aggregate pile projected in an image is an important practice. It is relatively easy to distinguish these features among the objects already separated from each other. The problems will be undoubtedly more complex and of greater challenge if the objects are touched or/and overlapped. This letter presents an algorithm that can be used to separate the touches and overlaps existing in the objects within a 2-D image. The approach is first to convert the gray-scale image to its corresponding binary one and then to the 3-D topographic one using the erosion operations. A template (or mask) is engineered to search the topographic surface for the saddle point, from which the segmenting orientation is determined followed by the desired separating operation. The algorithm is tested on a real image and the running result is adequately satisfying and encouraging.
基金Supported by the National Natural Science Foundation of China (No. 60832003, 60902096, 61171163, 61071120)the Scientific Research Foundation of Graduate School of Ningbo University
文摘Most of Image Quality Assessment (IQA) metrics consist of two processes. In the first process, quality map of image is measured locally. In the second process, the last quality score is converted from the quality map by using the pooling strategy. The first process had been made effective and significant progresses, while the second process was always done in simple ways. In the second process of the pooling strategy, the optimal perceptual pooling weights should be determined and computed according to Human Visual System (HVS). Thus, a reliable spatial pooling mathematical model based on HVS is an important issue worthy of study. In this paper, a new Visual Perceptual Pooling Strategy (VPPS) for IQA is presented based on contrast sensitivity and luminance sensitivity of HVS. Experimental results with the LIVE database show that the visual perceptual weights, obtained by the proposed pooling strategy, can effectively and significantly improve the performances of the IQA metrics with Mean Structural SIMilarity (MSSIM) or Phase Quantization Code (PQC). It is confirmed that the proposed VPPS demonstrates promising results for improving the performances of existing IQA metrics.
基金supported by the National Natural Science Foundation of China (60672073)the Program for New Century Excellent Talents in University (NCET-06-0537)+1 种基金the Natural Science Foundation of Ningbo (2008A610016)the K.C.Wong Magna Fund in Ningbo University.
文摘Color inconsistency between views is an important problem to be solved in multi-view video systems. A multi-view video color correction method using dynamic programming is proposed. Three-dimensional histograms are constructed with sequential conditional probability in HSI color space. Then, dynamic programming is used to seek the best color mapping relation with the minimum cost path between target image histogram and source image histogram. Finally, video tracking technique is performed to correct multi-view video. Experimental results show that the proposed method can obtain better subjective and objective performance in color correction.
基金Supported by the Natural Science Foundation of Zhejiang Province (Y1101240)Zhejiang Scientific and Technical Key Innovation Team (2010R50009)+1 种基金Natural Science Foundation of Ningbo (2011A610200, 2011A610197)Student Research and Innovation Training Program of Zhejiang Province (New-shoot Talents Project 2011R-405054) (A00162100400)
文摘Due to the limitation of Depth Of Field (DOF) of microscope, the regions which are not within the DOF will be blurring after imaging. Thus for micro-image fusion, the most important step is to identify the blurring regions within each micro-image, so as to remove their undesirable impacts on the fused image. In this paper, a fusion algorithm based on a novel region growing method is proposed for micro-image fusion. The local sharpness of micro-image is judged block by block, then blocks whose sharpness is lower than an adaptive threshold are used as seeds, and the sharpness of neighbors of each seed are evaluated again during the region growing until the blurring regions are identified completely. With the decreasing in block size, the obtained region segmentation becomes more and more accurate. Finally, the micro-images are fused with pixel-wise fusion rules. The experimental results show that the proposed algorithm benefits from the novel region segmentation and it is able to obtain fused micro-image with higher sharpness compared with some popular image fusion method.
基金Supported by the National Natural Science Foundation of China (No. 60672073,60872094)the Program for New Century Excellent Talents in University (NCET-06-0537)the Natural Science Foundation of Ningbo (No. 2007A610037).
文摘A semi-reference image quality assessment metric based on similarity measurement for synthesizedvirtual viewpoint image (WI) in free-viewpoint television system (FTV) is proposed in this paper. Thekey point of the proposed metric is taking resemblant information between WI and its neighbor view imagesfor quality assessment to make our metric to be extended to multi-semi-reference image quality assessmenteasily. The proposed metric first extracts impact factors from image features, then combines animage synthesis technique and similarity functions, in which, disparity information are taken into accountfor registering the resemblant regions. Experiments are divided into three phases. Phase Ⅰ is to verify thevalidation of the proposed metric by taking impaired images and original reference into account. The experimentalresults show the agreement between evaluation scores and bio-characteristic of human visualsystem. Phase Ⅱ shows the accordance with Phase Ⅰ by taking neighbor view as reference. The proposedmetric can be taken as a full reference one to evaluate the image quality even though the original referenceis absent. Phase Ⅲ is then performed to evaluate the quality of ⅤⅥ. Evaluation scores in the experimentalresults are able to evaluate the quality of ⅤⅥ.
基金Supported by the National Natural Science Foundation of China(Nos.6117116361271270+2 种基金6127102161111140392)National Science and Technology Support Program(2012BAH67F01)
文摘In order to establish a stereoscopic image quality assessment method which is consistent with human visual perception,we propose an objective stereoscopic image quality assessment method.It takes into account the strong correlation and high degree of structural between pixels of image.This method contains two models.One is the quality synthetic assessment of left-right view images,which is based on human visual characteristics,we use the Singular Value Decomposition(SVD)that can represent the degree of the distortion,and combine the qualities of left and right images by the characteristics of binocular superposition.The other model is stereoscopic perception quality assessment,due to strong stability of image’s singular value characteristics,we calculate the distance of the singular values and structural characteristic similarity of the absolute difference maps,and utilize the statistical value of the global error to evaluate stereoscopic perception.Finally,we combine two models to describe the stereoscopic image quality.Experimental results show that the correlation coefficients of the proposed assessment method and the human subjective perception are above 0.93,and the mean square errors are all less than 6.2,under JPEG,JP2K compression,Gaussian blurring,Gaussian white noise,H.264 coding distortion,and hybrid cross distortion.It indicates that the proposed stereoscopic objective method is consistent with human visual properties and also of availability.
基金the National Natural Science Foundation of China (No.60472100)the Natural Science Foundation of Zhejiang Province (No.Y105577)the Key Project of Chinese Ministry of Education (No.206059).
文摘Ray-space based arbitrary viewpoint rendering without complex object segmentation or model construction is the main technology to realize Free Viewpoint Video(FVV) system for complex scenes. Ray-space interpolation and compression are two key techniques for the solution. In this paper,correlation among multiple epipolar lines in ray-space data is analyzed,and a new method of ray-space interpolation with multi-epipolar lines matching is proposed. Comparing with the pixel-based matching interpolation method and the block-based matching interpolation method,the proposed method can achieve higher Peak Signal to Noise Ratio(PSNR) in interpolating rayspace data and rendering arbitrary viewpoint images.
基金the National Natural Science Foundation of China (No.60672073, No.60872094)the Program for New Century Excellent Talents in University (NCET-06-0537)+2 种基金the Key Project of Chinese Ministry of Education (No. 206059)Scientific Research Fund of Zhejiang Provincial Education Department (No.20070962)the Natural Science Foundation of Ningbo (No.2008A610016).
文摘Color inconsistency between views is an important problem to be solved in multi-view video applications, such as free viewpoint television and other three-dimensional video systems. In this paper, by combining with multi-view video coding, a coding-oriented multi-view video color correction method is proposed. We first separate foreground and background in first Ggroup Oof Ppictures (GOP) by using SKIP coding mode. Then by transferring means and standard deviations in backgrounds, color correction is performed for each frame in GOP, and multi-view video coding is performed and used to renew the backgrounds. Experimental results show the proposed method can obtain better perform-ances in color correction and multi-view video coding.
基金the Natural Science Foundation of China(No.60472100,60672073)the Program for New Century Excellent Talents in University(No.NCET-06-0537)the Key Project of Chinese Ministry of Education(No.206059)
文摘This paper presents a free viewpoint video(FVV)system based on ray-space interpolation method.The new algorithm matches individual pixels in corresponding scanline pairs by using a dynamic program-ming technique.A sparse intermediate view disparity map is projected from matched pixels firstly,andthe holes(occluded pixels)are filled in by propagating the disparity of neighboring background pixels.After interpolating dense view images,an arbitrary virtual view image can be easily rendered from thedense ray-space converted from these view images.The proposed method is evaluated on the Middleburydata set arid compared with other methods,experimental results show that the better quality of the inter-mediate view is obtained and the corresponding computational complexity is reduced significantly.
文摘目的研究表明,图像的恰可察觉失真(JND)阈值主要与视觉系统的亮度适应性、对比度掩模、模块掩模以及图像结构等因素有关。为了更好地研究图像结构对JND阈值的影响,提出一种基于稀疏表示的结构信息和非结构信息分离模型,并应用于自然图像的JND阈值估计,使JND阈值模型与人眼视觉系统具有更好的一致性。方法首先通过K-均值奇异值分解算法(K-SVD)得到过完备视觉字典。然后利用该过完备字典对输入的自然图像进行稀疏表示和重建,得到该图像对应的结构层和非结构层。针对结构层和非结构层,进一步设计基于亮度适应性与对比度掩模的结构层JND估计模型和基于亮度对比度与信息不确定度的非结构层JND估计模型。最后利用一个能够刻画掩模效应的非线性可加模型对以上两个分量的JND估计模型进行融合。结果本文提出的JND估计模型利用稀疏表示将自然图像的结构/非结构信息进行分离,然后采用符合各自分量特点的JND模型进行计算,与视觉感知机理高度一致。实验结果表明,本文JND模型能够有效地预测自然图像的JND阈值,受污染图的峰值信噪比(PSNR)值比其他3个JND对比模型值高出3 5 d B。结论与现有模型相比,该模型与人眼主观视觉感知具有更好的一致性,更能有效地预测自然图像的JND阈值。