Light field 3D display technology is considered a revolutionary technology to address the critical visual fatigue issues in the existing 3D displays.Tabletop light field 3D display provides a brand-new display form th...Light field 3D display technology is considered a revolutionary technology to address the critical visual fatigue issues in the existing 3D displays.Tabletop light field 3D display provides a brand-new display form that satisfies multi-user shared viewing and collaborative works,and it is poised to become a potential alternative to the traditional wall and portable display forms.However,a large radial viewing angle and correct radial perspective and parallax are still out of reach for most current tabletop light field 3D displays due to the limited amount of spatial information.To address the viewing angle and perspective issues,a novel integral imaging-based tabletop light field 3D display with a simple flat-panel structure is proposed and developed by applying a compound lens array,two spliced 8K liquid crystal display panels,and a light shaping diffuser screen.The compound lens array is designed to be composed of multiple three-piece compound lens units by employing a reverse design scheme,which greatly extends the radial viewing angle in the case of a limited amount of spatial information and balances other important 3D display parameters.The proposed display has a radial viewing angle of 68.7°in a large display size of 43.5 inches,which is larger than the conventional tabletop light field 3D displays.The radial perspective and parallax are correct,and high-resolution 3D images can be reproduced in large radial viewing positions.We envision that this proposed display opens up possibility for redefining the display forms of consumer electronics.展开更多
Tabletop integral imaging display with a more realistic and immersive experience has always been a hot spot in three-dimensional imaging technology,widely used in biomedical imaging and visualization to enhance medica...Tabletop integral imaging display with a more realistic and immersive experience has always been a hot spot in three-dimensional imaging technology,widely used in biomedical imaging and visualization to enhance medical diagnosis.However,the traditional structural characteristics of integral imaging display inevitably introduce the flipping effect outside the effective viewing angle.Here,a full-parallax tabletop integral imaging display without the flipping effect based on space-multiplexed voxel screen and compound lens array is demonstrated,and two holographic functional screens with different parameters are optically designed and fabricated.To eliminate the flipping effect in the reconstruction process,the space-multiplexed voxel screen consisting of a projector array and the holographic functional screen is presented to constrain light beams passing through the corresponding lens.To greatly promote imaging quality within the viewing area,the aspherical structure of the compound lens is optimized to balance the aberrations.It cooperates with the holographic functional screen to modulate the light field spatial distribution.Compared with the simulation results,the distortion rate of the imaging display is reduced to less than 9%from more than 30%.In the experiment,the floating high-quality reconstructed three-dimensional image without the flipping effect can be observed with the correct 3D perception at 96°×96°viewing angle,where 44,100 viewpoints are employed.展开更多
With the continuous calls for energy conservation and emission reduction in recent years,more and more people choose walking as their travel mode.The improvement of the quality of street space will directly affect peo...With the continuous calls for energy conservation and emission reduction in recent years,more and more people choose walking as their travel mode.The improvement of the quality of street space will directly affect people's willingness to walk.By sorting out relevant research on street quality measurement,extracting quality keywords with high frequency of reference as impact factors,and using street view image data from different eras,semantic segmentation technology,factor analysis,and questionnaire survey methods,this paper evaluates the street quality of Jingshan East Street,Dongcheng District,Beijing,further explores the impact of different factors on street quality,and analyzes possible ways to improve it.展开更多
In many ultrafast imaging applications, the reduced field-of-view(r FOV) technique is often used to enhance the spatial resolution and field inhomogeneity immunity of the images. The stationary-phase characteristic ...In many ultrafast imaging applications, the reduced field-of-view(r FOV) technique is often used to enhance the spatial resolution and field inhomogeneity immunity of the images. The stationary-phase characteristic of the spatiotemporallyencoded(SPEN) method offers an inherent applicability to r FOV imaging. In this study, a flexible r FOV imaging method is presented and the superiority of the SPEN approach in r FOV imaging is demonstrated. The proposed method is validated with phantom and in vivo rat experiments, including cardiac imaging and contrast-enhanced perfusion imaging. For comparison, the echo planar imaging(EPI) experiments with orthogonal RF excitation are also performed. The results show that the signal-to-noise ratios of the images acquired by the proposed method can be higher than those obtained with the r FOV EPI. Moreover, the proposed method shows better performance in the cardiac imaging and perfusion imaging of rat kidney, and it can scan one or more regions of interest(ROIs) with high spatial resolution in a single shot. It might be a favorable solution to ultrafast imaging applications in cases with severe susceptibility heterogeneities, such as cardiac imaging and perfusion imaging. Furthermore, it might be promising in applications with separate ROIs, such as mammary and limb imaging.展开更多
We study the influence of limited-view scanning on the depth imaging of photoacoustic tomography. The situation, in which absorbers are located at different depths with respect to the limited-view scanning trajectory,...We study the influence of limited-view scanning on the depth imaging of photoacoustic tomography. The situation, in which absorbers are located at different depths with respect to the limited-view scanning trajectory, is called depth imaging and is investigated in this paper. The results show that limited-view scanning causes the reconstructed intensity of deep absorbers to be weaker than that of shallow ones and that deep absorbers will be invisible if the scanning range is too small. The concept of effective scanning angle is proposed to analyse that phenomenon. We find that an effective scanning angle can well predict the relationship between scanning angle and the intensity ratio of absorbers. In addition, limited-view scanning is employed to improve image quality.展开更多
In this paper a millimeter-wave (MMW) squint indirect holographic method is presented, which is suitable for imaging with a large field-of-view. The proposed system employs the squint operation mode to remove the ba...In this paper a millimeter-wave (MMW) squint indirect holographic method is presented, which is suitable for imaging with a large field-of-view. The proposed system employs the squint operation mode to remove the background and twin- image interferences, which achieves a similar effect to off-axis holography but leaves out the large-aperture quasi-optical component. The translational scanning manner enables a large field of view and ensures the image uniformity, which is difficult to realize in off-axis holography. In addition, a corresponding imaging algorithm for the presented scheme is developed to reconstruct the image from the recorded hologram. Some imaging results on typical objects, obtained with electromagnetic simulation, demonstrate good performance of the imaging scheme and validate the effectiveness of the image reconstruction algorithm.展开更多
Rapid and accurate identification of potential structural deficiencies is a crucial task in evaluating seismic vulnerability of large building inventories in a region. In the case of multi-story structures, abrupt ver...Rapid and accurate identification of potential structural deficiencies is a crucial task in evaluating seismic vulnerability of large building inventories in a region. In the case of multi-story structures, abrupt vertical variations of story stiffness are known to significantly increase the likelihood of collapse during moderate or severe earthquakes. Identifying and retrofitting buildings with such irregularities—generally termed as soft-story buildings—is, therefore, vital in earthquake preparedness and loss mitigation efforts. Soft-story building identification through conventional means is a labor-intensive and time-consuming process. In this study, an automated procedure was devised based on deep learning techniques for identifying soft-story buildings from street-view images at a regional scale. A database containing a large number of building images and a semi-automated image labeling approach that effectively annotates new database entries was developed for developing the deep learning model. Extensive computational experiments were carried out to examine the effectiveness of the proposed procedure, and to gain insights into automated soft-story building identification.展开更多
This paper presents a robust image feature that can be used to automatically establish match correspondences between aerial images of suburban areas with large view variations. Unlike most commonly used invariant imag...This paper presents a robust image feature that can be used to automatically establish match correspondences between aerial images of suburban areas with large view variations. Unlike most commonly used invariant image features, this feature is view variant. The geometrical structure of the feature allows predicting its visual appearance according to the observer’s view. This feature is named 2EC (2 Edges and a Corner) as it utilizes two line segments or edges and their intersection or corner. These lines are constrained to correspond to the boundaries of rooftops. The description of each feature includes the two edges’ length, their intersection, orientation, and the image patch surrounded by a parallelogram that is constructed with the two edges. Potential match candidates are obtained by comparing features, while accounting for the geometrical changes that are expected due to large view variation. Once the putative matches are obtained, the outliers are filtered out using a projective matrix optimization method. Based on the results of the optimization process, a second round of matching is conducted within a more confined search space that leads to a more accurate match establishment. We demonstrate how establishing match correspondences using these features lead to computing more accurate camera parameters and fundamental matrix and therefore more accurate image registration and 3D reconstruction.展开更多
A new method of view synthesis is proposed based on Delaunay triangulation. The first step of this method is making the Delaunay triangulation of 2 reference images. Secondly, matching the image points using the epipo...A new method of view synthesis is proposed based on Delaunay triangulation. The first step of this method is making the Delaunay triangulation of 2 reference images. Secondly, matching the image points using the epipolar geometry constraint. Finally, constructing the third view according to pixel transferring under the trilinear constraint. The method gets rid of the classic time consuming dense matching technique and takes advantage of Delaunay triangulation. So it can not only save the computation time but also enhance the quality of the synthesized view. The significance of this method is that it can be used directly in the fields of video coding, image compressing and virtual reality.展开更多
The view prediction is an important step in stereo/multiview video coding, wherein, disparity estil mation (DE) is a key and difficult operation. DE algorithms usually require enormous computing power. A fast DE alg...The view prediction is an important step in stereo/multiview video coding, wherein, disparity estil mation (DE) is a key and difficult operation. DE algorithms usually require enormous computing power. A fast DE algorithm based on Delaunay triangulation (DT) is proposed. First, a flexible and content adaptive DT mesh is established on a target frame by an iterative split-merge algorithm. Second, DE on DT nodes are performed in a three-stage algorithm, which gives the majority of nodes a good estimate of the disparity vectors (DV), by removing unreliable nodes due to occlusion, and forcing the minority of 'problematic nodes' to be searched again, within their umbrella-shaped polygon, to the best. Finally, the target view is predicted by using affine transformation. Experimental results show that the proposed algorithm can give a satisfactory DE with less computational cost.展开更多
Launched on December 28,2016,the Super View-1 satellite has operated for over 70 days at an altitude of 530km.The initial results of in-orbit commissioning show that the images from the Super View-1 satellite are clea...Launched on December 28,2016,the Super View-1 satellite has operated for over 70 days at an altitude of 530km.The initial results of in-orbit commissioning show that the images from the Super View-1 satellite are clear with radiation resolution reaching 11 bits pixel.The geospatial positioning accuracy without ground control pointing is5-8 m,the elevation relative accuracy is1 m.The maximum single scene can be60 km×70 km,enabling some 900。展开更多
With the development of the compressive sensing theory, the image reconstruction from the projections viewed in limited angles is one of the hot problems in the research of computed tomography technology. This paper d...With the development of the compressive sensing theory, the image reconstruction from the projections viewed in limited angles is one of the hot problems in the research of computed tomography technology. This paper develops an iterative algorithm for image reconstruction, which can fit the most cases. This method gives an image reconstruction flow with the difference image vector, which is based on the concept that the difference image vector between the reconstructed and the reference image is sparse enough. Then the l1-norm minimization method is used to reconstruct the difference vector to recover the image for flat subjects in limited angles. The algorithm has been tested with a thin planar phantom and a real object in limited-view projection data. Moreover, all the studies showed the satisfactory results in accuracy at a rather high reconstruction speed.展开更多
A semi-reference image quality assessment metric based on similarity measurement for synthesized virtual viewpoint image (VVI) in free-viewpoint television system (FFV) is proposed in this paper. The key point of ...A semi-reference image quality assessment metric based on similarity measurement for synthesized virtual viewpoint image (VVI) in free-viewpoint television system (FFV) is proposed in this paper. The key point of the proposed metric is taking resemblant information between VVI and its neighbor view images for quality assessment to make our metric to be extended to multi-semi-reference image quality assessment easily. The proposed metric first extracts impact factors from image features, then combines an image synthesis technique and similarity functions, in which, disparity information are taken into account for registering the resemblant regions. Experiments are divided into three phases. Phase I is to verify the validation of the proposed metric by taking impaired images and original reference into account. The experimental results show the agreement between evaluation scores and bio-characteristic of human visual system. Phase II shows the accordance with Phase I by taking neighbor view as reference. The proposed metric can be taken as a full reference one to evaluate the image quality even though the original reference is absent. Phase III is then performed to evaluate the quality of WI. Evaluation scores in the experimental results are able to evaluate the quality of VVI.展开更多
For the pre-acquired serial images from camera lengthways motion, a view synthesis algorithm based on epipolar geometry constraint is proposed in this paper. It uses the whole matching and maintaining order characters...For the pre-acquired serial images from camera lengthways motion, a view synthesis algorithm based on epipolar geometry constraint is proposed in this paper. It uses the whole matching and maintaining order characters of the epipolar line, Fourier transform and dynamic programming matching theories, thus truly synthesizing the destination image of current viewpoint. Through the combination of Fourier transform, epipolar geometry constraint and dynamic programming matching, the circumference distortion problem resulting from conventional view synthesis approaches is effectively avoided. The detailed implementation steps of this algorithm are given, and some running instances are presented to illustrate the results.展开更多
Background In this study, we propose view interpolation networks to reproduce changes in the brightness of an object′s surface depending on the viewing direction, which is important for reproducing the material appea...Background In this study, we propose view interpolation networks to reproduce changes in the brightness of an object′s surface depending on the viewing direction, which is important for reproducing the material appearance of a real object. Method We used an original and modified version of U-Net for image transformation. The networks were trained to generate images from the intermediate viewpoints of four cameras placed at the corners of a square. We conducted an experiment using with three different combinations of methods and training data formats. Result We determined that inputting the coordinates of the viewpoints together with the four camera images and using images from random viewpoints as the training data produces the best results.展开更多
The growing demand for current and precise geographic information that pertains to urban areas has given rise to a significant interest in digital surface models that exhibit a high level of detail. Traditional method...The growing demand for current and precise geographic information that pertains to urban areas has given rise to a significant interest in digital surface models that exhibit a high level of detail. Traditional methods for creating digital surface models are insufficient to reflect the details of earth’s features. These models only represent three-dimensional objects in a single texture and fail to offer a realistic depiction of the real world. Furthermore, the need for current and precise geographic information regarding urban areas has been increasing significantly. This study proposes a new technique to address this problem, which involves integrating remote sensing, Geographic Information Systems (GIS), and Architecture Environment software environments to generate a detailed three-dimensional model. The processing of this study starts with: 1) Downloading high-resolution satellite imagery; 2) Collecting ground truth datasets from fieldwork; 3) Imaging nose removing; 4) Generating a Two-dimensional Model to create a digital surface model in GIS using the extracted building outlines; 5) Converting the model into multi-patch layers to construct a 3D model for each object separately. The results show that the 3D model obtained through this method is highly detailed and effective for various applications, including environmental studies, urban development, expansion planning, and shape understanding tasks.展开更多
基金We are grateful for financial supports from National Key R&D Program of China(Grant No.2021YFB2802300)the National Natural Science Foundation of China(Grant Nos.62105014,62105016,and 62020106010)。
文摘Light field 3D display technology is considered a revolutionary technology to address the critical visual fatigue issues in the existing 3D displays.Tabletop light field 3D display provides a brand-new display form that satisfies multi-user shared viewing and collaborative works,and it is poised to become a potential alternative to the traditional wall and portable display forms.However,a large radial viewing angle and correct radial perspective and parallax are still out of reach for most current tabletop light field 3D displays due to the limited amount of spatial information.To address the viewing angle and perspective issues,a novel integral imaging-based tabletop light field 3D display with a simple flat-panel structure is proposed and developed by applying a compound lens array,two spliced 8K liquid crystal display panels,and a light shaping diffuser screen.The compound lens array is designed to be composed of multiple three-piece compound lens units by employing a reverse design scheme,which greatly extends the radial viewing angle in the case of a limited amount of spatial information and balances other important 3D display parameters.The proposed display has a radial viewing angle of 68.7°in a large display size of 43.5 inches,which is larger than the conventional tabletop light field 3D displays.The radial perspective and parallax are correct,and high-resolution 3D images can be reproduced in large radial viewing positions.We envision that this proposed display opens up possibility for redefining the display forms of consumer electronics.
基金The Basic Research Fund of Central-Level Nonprofit Scientific Research Institutes(No.TKS20220304)The Key Research and Development Projects of Guangxi Science and Technology Department(No.2021AB05087).
文摘Tabletop integral imaging display with a more realistic and immersive experience has always been a hot spot in three-dimensional imaging technology,widely used in biomedical imaging and visualization to enhance medical diagnosis.However,the traditional structural characteristics of integral imaging display inevitably introduce the flipping effect outside the effective viewing angle.Here,a full-parallax tabletop integral imaging display without the flipping effect based on space-multiplexed voxel screen and compound lens array is demonstrated,and two holographic functional screens with different parameters are optically designed and fabricated.To eliminate the flipping effect in the reconstruction process,the space-multiplexed voxel screen consisting of a projector array and the holographic functional screen is presented to constrain light beams passing through the corresponding lens.To greatly promote imaging quality within the viewing area,the aspherical structure of the compound lens is optimized to balance the aberrations.It cooperates with the holographic functional screen to modulate the light field spatial distribution.Compared with the simulation results,the distortion rate of the imaging display is reduced to less than 9%from more than 30%.In the experiment,the floating high-quality reconstructed three-dimensional image without the flipping effect can be observed with the correct 3D perception at 96°×96°viewing angle,where 44,100 viewpoints are employed.
文摘With the continuous calls for energy conservation and emission reduction in recent years,more and more people choose walking as their travel mode.The improvement of the quality of street space will directly affect people's willingness to walk.By sorting out relevant research on street quality measurement,extracting quality keywords with high frequency of reference as impact factors,and using street view image data from different eras,semantic segmentation technology,factor analysis,and questionnaire survey methods,this paper evaluates the street quality of Jingshan East Street,Dongcheng District,Beijing,further explores the impact of different factors on street quality,and analyzes possible ways to improve it.
基金Project supported by the National Natural Science Foundation of China(Grant Nos.11474236,81171331,and U1232212)
文摘In many ultrafast imaging applications, the reduced field-of-view(r FOV) technique is often used to enhance the spatial resolution and field inhomogeneity immunity of the images. The stationary-phase characteristic of the spatiotemporallyencoded(SPEN) method offers an inherent applicability to r FOV imaging. In this study, a flexible r FOV imaging method is presented and the superiority of the SPEN approach in r FOV imaging is demonstrated. The proposed method is validated with phantom and in vivo rat experiments, including cardiac imaging and contrast-enhanced perfusion imaging. For comparison, the echo planar imaging(EPI) experiments with orthogonal RF excitation are also performed. The results show that the signal-to-noise ratios of the images acquired by the proposed method can be higher than those obtained with the r FOV EPI. Moreover, the proposed method shows better performance in the cardiac imaging and perfusion imaging of rat kidney, and it can scan one or more regions of interest(ROIs) with high spatial resolution in a single shot. It might be a favorable solution to ultrafast imaging applications in cases with severe susceptibility heterogeneities, such as cardiac imaging and perfusion imaging. Furthermore, it might be promising in applications with separate ROIs, such as mammary and limb imaging.
基金Project supported by the National Basic Research Program of China(Grant No.2012CB921504)the National Natural Science Foundation of China(Grant Nos.10874088,10904069,and 11028408)the Natural Science Foundation of Jiangsu Province,China(Grant No.SBK201021985)
文摘We study the influence of limited-view scanning on the depth imaging of photoacoustic tomography. The situation, in which absorbers are located at different depths with respect to the limited-view scanning trajectory, is called depth imaging and is investigated in this paper. The results show that limited-view scanning causes the reconstructed intensity of deep absorbers to be weaker than that of shallow ones and that deep absorbers will be invisible if the scanning range is too small. The concept of effective scanning angle is proposed to analyse that phenomenon. We find that an effective scanning angle can well predict the relationship between scanning angle and the intensity ratio of absorbers. In addition, limited-view scanning is employed to improve image quality.
基金Project supported by the National Natural Science Foundation of China (Grant Nos.11174280,60990323,and 60990320)the Knowledge Innovation Program of the Chinese Academy of Sciences (Grant No.YYYJ-1123)
文摘In this paper a millimeter-wave (MMW) squint indirect holographic method is presented, which is suitable for imaging with a large field-of-view. The proposed system employs the squint operation mode to remove the background and twin- image interferences, which achieves a similar effect to off-axis holography but leaves out the large-aperture quasi-optical component. The translational scanning manner enables a large field of view and ensures the image uniformity, which is difficult to realize in off-axis holography. In addition, a corresponding imaging algorithm for the presented scheme is developed to reconstruct the image from the recorded hologram. Some imaging results on typical objects, obtained with electromagnetic simulation, demonstrate good performance of the imaging scheme and validate the effectiveness of the image reconstruction algorithm.
基金supported by the US National Science Foundation under Grant No. 1612843. NHERI Design Safe (Rathje et al., 2017)Texas Advanced Computing Center (TACC)。
文摘Rapid and accurate identification of potential structural deficiencies is a crucial task in evaluating seismic vulnerability of large building inventories in a region. In the case of multi-story structures, abrupt vertical variations of story stiffness are known to significantly increase the likelihood of collapse during moderate or severe earthquakes. Identifying and retrofitting buildings with such irregularities—generally termed as soft-story buildings—is, therefore, vital in earthquake preparedness and loss mitigation efforts. Soft-story building identification through conventional means is a labor-intensive and time-consuming process. In this study, an automated procedure was devised based on deep learning techniques for identifying soft-story buildings from street-view images at a regional scale. A database containing a large number of building images and a semi-automated image labeling approach that effectively annotates new database entries was developed for developing the deep learning model. Extensive computational experiments were carried out to examine the effectiveness of the proposed procedure, and to gain insights into automated soft-story building identification.
文摘This paper presents a robust image feature that can be used to automatically establish match correspondences between aerial images of suburban areas with large view variations. Unlike most commonly used invariant image features, this feature is view variant. The geometrical structure of the feature allows predicting its visual appearance according to the observer’s view. This feature is named 2EC (2 Edges and a Corner) as it utilizes two line segments or edges and their intersection or corner. These lines are constrained to correspond to the boundaries of rooftops. The description of each feature includes the two edges’ length, their intersection, orientation, and the image patch surrounded by a parallelogram that is constructed with the two edges. Potential match candidates are obtained by comparing features, while accounting for the geometrical changes that are expected due to large view variation. Once the putative matches are obtained, the outliers are filtered out using a projective matrix optimization method. Based on the results of the optimization process, a second round of matching is conducted within a more confined search space that leads to a more accurate match establishment. We demonstrate how establishing match correspondences using these features lead to computing more accurate camera parameters and fundamental matrix and therefore more accurate image registration and 3D reconstruction.
文摘A new method of view synthesis is proposed based on Delaunay triangulation. The first step of this method is making the Delaunay triangulation of 2 reference images. Secondly, matching the image points using the epipolar geometry constraint. Finally, constructing the third view according to pixel transferring under the trilinear constraint. The method gets rid of the classic time consuming dense matching technique and takes advantage of Delaunay triangulation. So it can not only save the computation time but also enhance the quality of the synthesized view. The significance of this method is that it can be used directly in the fields of video coding, image compressing and virtual reality.
基金supported by the National Natural Science Foundation of China (60472083 60872141)
文摘The view prediction is an important step in stereo/multiview video coding, wherein, disparity estil mation (DE) is a key and difficult operation. DE algorithms usually require enormous computing power. A fast DE algorithm based on Delaunay triangulation (DT) is proposed. First, a flexible and content adaptive DT mesh is established on a target frame by an iterative split-merge algorithm. Second, DE on DT nodes are performed in a three-stage algorithm, which gives the majority of nodes a good estimate of the disparity vectors (DV), by removing unreliable nodes due to occlusion, and forcing the minority of 'problematic nodes' to be searched again, within their umbrella-shaped polygon, to the best. Finally, the target view is predicted by using affine transformation. Experimental results show that the proposed algorithm can give a satisfactory DE with less computational cost.
文摘Launched on December 28,2016,the Super View-1 satellite has operated for over 70 days at an altitude of 530km.The initial results of in-orbit commissioning show that the images from the Super View-1 satellite are clear with radiation resolution reaching 11 bits pixel.The geospatial positioning accuracy without ground control pointing is5-8 m,the elevation relative accuracy is1 m.The maximum single scene can be60 km×70 km,enabling some 900。
基金Project supported by the National Basic Research Program of China(Grant No.2006CB7057005)the National High Technology Research and Development Program of China(Grant No.2009AA012200)the National Natural Science Foundation of China (Grant No.60672104)
文摘With the development of the compressive sensing theory, the image reconstruction from the projections viewed in limited angles is one of the hot problems in the research of computed tomography technology. This paper develops an iterative algorithm for image reconstruction, which can fit the most cases. This method gives an image reconstruction flow with the difference image vector, which is based on the concept that the difference image vector between the reconstructed and the reference image is sparse enough. Then the l1-norm minimization method is used to reconstruct the difference vector to recover the image for flat subjects in limited angles. The algorithm has been tested with a thin planar phantom and a real object in limited-view projection data. Moreover, all the studies showed the satisfactory results in accuracy at a rather high reconstruction speed.
基金Supported by the National Natural Science Foundation of China (No. 60672073,60872094)the Program for New Century Excellent Talents in University (NCET-06-0537)the Natural Science Foundation of Ningbo (No. 2007A610037).
文摘A semi-reference image quality assessment metric based on similarity measurement for synthesized virtual viewpoint image (VVI) in free-viewpoint television system (FFV) is proposed in this paper. The key point of the proposed metric is taking resemblant information between VVI and its neighbor view images for quality assessment to make our metric to be extended to multi-semi-reference image quality assessment easily. The proposed metric first extracts impact factors from image features, then combines an image synthesis technique and similarity functions, in which, disparity information are taken into account for registering the resemblant regions. Experiments are divided into three phases. Phase I is to verify the validation of the proposed metric by taking impaired images and original reference into account. The experimental results show the agreement between evaluation scores and bio-characteristic of human visual system. Phase II shows the accordance with Phase I by taking neighbor view as reference. The proposed metric can be taken as a full reference one to evaluate the image quality even though the original reference is absent. Phase III is then performed to evaluate the quality of WI. Evaluation scores in the experimental results are able to evaluate the quality of VVI.
文摘For the pre-acquired serial images from camera lengthways motion, a view synthesis algorithm based on epipolar geometry constraint is proposed in this paper. It uses the whole matching and maintaining order characters of the epipolar line, Fourier transform and dynamic programming matching theories, thus truly synthesizing the destination image of current viewpoint. Through the combination of Fourier transform, epipolar geometry constraint and dynamic programming matching, the circumference distortion problem resulting from conventional view synthesis approaches is effectively avoided. The detailed implementation steps of this algorithm are given, and some running instances are presented to illustrate the results.
文摘Background In this study, we propose view interpolation networks to reproduce changes in the brightness of an object′s surface depending on the viewing direction, which is important for reproducing the material appearance of a real object. Method We used an original and modified version of U-Net for image transformation. The networks were trained to generate images from the intermediate viewpoints of four cameras placed at the corners of a square. We conducted an experiment using with three different combinations of methods and training data formats. Result We determined that inputting the coordinates of the viewpoints together with the four camera images and using images from random viewpoints as the training data produces the best results.
文摘The growing demand for current and precise geographic information that pertains to urban areas has given rise to a significant interest in digital surface models that exhibit a high level of detail. Traditional methods for creating digital surface models are insufficient to reflect the details of earth’s features. These models only represent three-dimensional objects in a single texture and fail to offer a realistic depiction of the real world. Furthermore, the need for current and precise geographic information regarding urban areas has been increasing significantly. This study proposes a new technique to address this problem, which involves integrating remote sensing, Geographic Information Systems (GIS), and Architecture Environment software environments to generate a detailed three-dimensional model. The processing of this study starts with: 1) Downloading high-resolution satellite imagery; 2) Collecting ground truth datasets from fieldwork; 3) Imaging nose removing; 4) Generating a Two-dimensional Model to create a digital surface model in GIS using the extracted building outlines; 5) Converting the model into multi-patch layers to construct a 3D model for each object separately. The results show that the 3D model obtained through this method is highly detailed and effective for various applications, including environmental studies, urban development, expansion planning, and shape understanding tasks.