Historical architecture is an important carrier of cultural and historical heritage in a country and region,and its protection and restoration work plays a crucial role in the inheritance of cultural heritage.However,...Historical architecture is an important carrier of cultural and historical heritage in a country and region,and its protection and restoration work plays a crucial role in the inheritance of cultural heritage.However,the damage and destruction of buildings urgently need to be repaired due to the ancient age of historical buildings and the influence of natural environment and human factors.Therefore,an artificial intelligence repair technology based on three-dimensional(3D)point cloud(PC)reconstruction and generative adversarial networks(GANs)was proposed to improve the precision and efficiency of repair work.First,in-depth research on the principles and algorithms of 3D PC data processing and GANs should be conducted.Second,a digital restoration frameworkwas constructed by combining these two artificial intelligence technologies to achieve precise and efficient restoration of historical buildings through continuous adversarial learning processes.The experimental results showed that the errors in the restoration of palace buildings,defense walls,pagodas,altars,temples,and mausoleums were 0.17,0.12,0.13,0.11,and 0.09,respectively.The technique can significantly reduce the error while maintaining the high-precision repair effect.This technology with artificial intelligence as the core has excellent accuracy and stability in the digital restoration.It provides a new technical means for the digital restoration of historical buildings and has important practical significance for the protection of cultural heritage.展开更多
It is a research subject in computer vision to 3D reconstruction of an object represented by a single 2D line drawing. Previous works on 3D reconstruction from 2D line drawings focus on objects with lines, plane, view...It is a research subject in computer vision to 3D reconstruction of an object represented by a single 2D line drawing. Previous works on 3D reconstruction from 2D line drawings focus on objects with lines, plane, view, and so on. This paper mainly studies the 3D reconstruction from 2D line drawings. Besides, a new approach is proposed: it is that for the research of the point coordinates of 2D line drawings, so as to achieve the object reconstruction by the reconstruction of point coordinates. The reconstruction process includes: (1) the collection of point coordinates (X,Y) of 2D line drawings; (2) the derivation of mathematical formula about the reconstruction of the point of 2D line drawings, and calculating the corresponding point of the 3D coordinates; (3) the regeneration of 3D graphics with 3D points; (4) analyze error by the proportional of parallel of axonometric projection, in order to prove the accuracy of the method.展开更多
Background Three-dimensional(3D)building models with unambiguous roof plane geometry parameters,roof structure units,and linked topology provide essential data for many applications related to human activities in urba...Background Three-dimensional(3D)building models with unambiguous roof plane geometry parameters,roof structure units,and linked topology provide essential data for many applications related to human activities in urban environments.The task of 3D reconstruction from point clouds is still in the development phase,especially the recognition and interpretation of roof topological structures.Methods This study proposes a novel visual perception-based approach to automatically decompose and reconstruct building point clouds into meaningful and simple parametric structures,while the associated mutual relationships between the roof plane geometry and roof structure units are expressed by a hierarchical topology tree.First,a roof plane extraction is performed by a multi-label graph cut energy optimization framework and a roof structure graph(RSG)model is then constructed to describe the roof topological geometry with common adjacency,symmetry,and convexity rules.Moreover,a progressive roof decomposition and refinement are performed,generating a hierarchical representation of the 3D roof structure models.Finally,a visual plane fitted residual or area constraint process is adopted to generate the RSG model with different levels of details.Results Two airborne laser scanning datasets with different point densities and roof styles were tested,and the performance evaluation metrics were obtained by International Society for Photogrammetry and Remote Sensing,achieving a correctness and accuracy of 97.7%and 0.29m,respectively.Conclusions The standardized assessment results demonstrate the effectiveness and robustness of the proposed approach,showing its ability to generate a variety of structural models,even with missing data.展开更多
In order to find better simplicity measurements for 3D object recognition, a new set of local regularities is developed and tested in a stepwise 3D reconstruction method, including localized minimizing standard deviat...In order to find better simplicity measurements for 3D object recognition, a new set of local regularities is developed and tested in a stepwise 3D reconstruction method, including localized minimizing standard deviation of angles(L-MSDA), localized minimizing standard deviation of segment magnitudes(L-MSDSM), localized minimum standard deviation of areas of child faces (L-MSDAF), localized minimum sum of segment magnitudes of common edges (L-MSSM), and localized minimum sum of areas of child face (L-MSAF). Based on their effectiveness measurements in terms of form and size distortions, it is found that when two local regularities: L-MSDA and L-MSDSM are combined together, they can produce better performance. In addition, the best weightings for them to work together are identified as 10% for L-MSDSM and 90% for L-MSDA. The test results show that the combined usage of L-MSDA and L-MSDSM with identified weightings has a potential to be applied in other optimization based 3D recognition methods to improve their efficacy and robustness.展开更多
Repetitive structures of a building share features in terms of geometries and appearance and,therefore,the 3D information for these structures can be transferred from one specification to another for the purpose of 3D...Repetitive structures of a building share features in terms of geometries and appearance and,therefore,the 3D information for these structures can be transferred from one specification to another for the purpose of 3D modeling and reconstruction once they are identified as repetitive structures.In this paper,a novel approach is proposed for the detection of the repetitive structures specified by the polygons of a building’s footprints.Instead of directly operating on the polygon in 2D space,the polygon is converted into a bend angle function representation in 1D space,whereby an extrusion is represented as a closed polygon intersected by the x-axis and located above it,while an intrusion is represented as a closed polygon below the x-axis.In this way,a polygon of a footprint is decomposed into a number of extrusions and intrusions which can in turn be processed.The task of detecting any repetitive structures specified in a building’s footprints then becomes the task of clustering the intersected polygons in the bend angle function space.The extrusions/intrusions which can be placed in the same clusters can be regarded as repetitive structures.Experiments show that this proposed approach can detect repetitive structures with different sizes,orientations and complexities.展开更多
With the rapid popularization of mobile devices and the wide application of various sensors,scene perception methods applied to mobile devices occupy an important position in location-based services such as navigation...With the rapid popularization of mobile devices and the wide application of various sensors,scene perception methods applied to mobile devices occupy an important position in location-based services such as navigation and augmented reality(AR).The development of deep learning technologies has greatly improved the visual perception ability of machines to scenes.The basic framework of scene visual perception,related technologies and the specific process applied to AR navigation are introduced,and future technology development is proposed.An application(APP)is designed to improve the application effect of AR navigation.The APP includes three modules:navigation map generation,cloud navigation algorithm,and client design.The navigation map generation tool works offline.The cloud saves the navigation map and provides navigation algorithms for the terminal.The terminal realizes local real-time positioning and AR path rendering.展开更多
This article presents a passive navigation method of terrain contour matching by reconstructing the 3-D terrain from the image sequence(acquired by the onboard camera).To achieve automation and simultaneity of the ima...This article presents a passive navigation method of terrain contour matching by reconstructing the 3-D terrain from the image sequence(acquired by the onboard camera).To achieve automation and simultaneity of the image sequence processing for navigation,a correspondence registration method based on control points tracking is proposed which tracks the sparse control points through the whole image sequence and uses them as correspondence in the relation geometry solution.Besides,a key frame selection method based on the images overlapping ratio and intersecting angles is explored,thereafter the requirement for the camera system configuration is provided.The proposed method also includes an optimal local homography estimating algorithm according to the control points,which helps correctly predict points to be matched and their speed corresponding.Consequently,the real-time 3-D terrain of the trajectory thus reconstructed is matched with the referenced terrain map,and the result of which provides navigating information.The digital simulation experiment and the real image based experiment have verified the proposed method.展开更多
The recent fast development in computer vision and mobile sensor technology such as mobile LiDAR and RGB-D cameras is pushing the boundary of the technology to suit the need of real-life applications in the fields of ...The recent fast development in computer vision and mobile sensor technology such as mobile LiDAR and RGB-D cameras is pushing the boundary of the technology to suit the need of real-life applications in the fields of Augmented Reality(AR),robotics,indoor GIS and self-driving.Camera localization is often a key and enabling technology among these applications.In this paper,we developed a novel camera localization workflow based on a highly accurate 3D prior map optimized by our RGBD SLAM method in conjunction with a deep learning routine trained using consecutive video frames labeled with high precision camera pose.Furthermore,an AR registration method tightly coupled with a game engine is proposed,which incorporates the proposed localization algorithm and aligns the real Kinetic camera with a virtual camera of the game engine to facilitate AR application development in an integrated manner.The experimental results show that the localization accuracy can achieve an average error of 35 cm based on a fine-tuned prior 3D feature database at 3 cm accuracy compared against the ground-truth 3D LiDAR map.The influence of the localization accuracy on the visual effect of AR overlay is also demonstrated and the alignment of the real and virtual camera streamlines the implementation of AR fire emergency response demo in a Virtual Geographic Environment.展开更多
基金supported by The Social Science Foundation of Fujian Province(Grant no.FJ2021B080)The 2023 Fujian Provincial Foreign Cooperation Science and Technology Plan Project(2023I0047)+3 种基金The 2022 Longyan Industry-University-Research Joint Innovation Project(2022LYF18001)The 2023 Fujian Natural Resources Science and Tech-nology Innovation Project(KY-060000-04-2023-2002)Open Project Fund of Hunan Provincial Key Laboratory for Remote Sensing Monitoring of Ecological Environment in Dongting Lake Area(Project No:DTH Key Lab.2023-04)The Construction Science and Technology Research and Development Project of Fujian Province,China(Grant no.2022-K-85).
文摘Historical architecture is an important carrier of cultural and historical heritage in a country and region,and its protection and restoration work plays a crucial role in the inheritance of cultural heritage.However,the damage and destruction of buildings urgently need to be repaired due to the ancient age of historical buildings and the influence of natural environment and human factors.Therefore,an artificial intelligence repair technology based on three-dimensional(3D)point cloud(PC)reconstruction and generative adversarial networks(GANs)was proposed to improve the precision and efficiency of repair work.First,in-depth research on the principles and algorithms of 3D PC data processing and GANs should be conducted.Second,a digital restoration frameworkwas constructed by combining these two artificial intelligence technologies to achieve precise and efficient restoration of historical buildings through continuous adversarial learning processes.The experimental results showed that the errors in the restoration of palace buildings,defense walls,pagodas,altars,temples,and mausoleums were 0.17,0.12,0.13,0.11,and 0.09,respectively.The technique can significantly reduce the error while maintaining the high-precision repair effect.This technology with artificial intelligence as the core has excellent accuracy and stability in the digital restoration.It provides a new technical means for the digital restoration of historical buildings and has important practical significance for the protection of cultural heritage.
文摘It is a research subject in computer vision to 3D reconstruction of an object represented by a single 2D line drawing. Previous works on 3D reconstruction from 2D line drawings focus on objects with lines, plane, view, and so on. This paper mainly studies the 3D reconstruction from 2D line drawings. Besides, a new approach is proposed: it is that for the research of the point coordinates of 2D line drawings, so as to achieve the object reconstruction by the reconstruction of point coordinates. The reconstruction process includes: (1) the collection of point coordinates (X,Y) of 2D line drawings; (2) the derivation of mathematical formula about the reconstruction of the point of 2D line drawings, and calculating the corresponding point of the 3D coordinates; (3) the regeneration of 3D graphics with 3D points; (4) analyze error by the proportional of parallel of axonometric projection, in order to prove the accuracy of the method.
基金Supported by the National Natural Science Foundation of China(41901405,41725005,41531177)and the National Key Research and Development Program of China(2016YFF0103501).
文摘Background Three-dimensional(3D)building models with unambiguous roof plane geometry parameters,roof structure units,and linked topology provide essential data for many applications related to human activities in urban environments.The task of 3D reconstruction from point clouds is still in the development phase,especially the recognition and interpretation of roof topological structures.Methods This study proposes a novel visual perception-based approach to automatically decompose and reconstruct building point clouds into meaningful and simple parametric structures,while the associated mutual relationships between the roof plane geometry and roof structure units are expressed by a hierarchical topology tree.First,a roof plane extraction is performed by a multi-label graph cut energy optimization framework and a roof structure graph(RSG)model is then constructed to describe the roof topological geometry with common adjacency,symmetry,and convexity rules.Moreover,a progressive roof decomposition and refinement are performed,generating a hierarchical representation of the 3D roof structure models.Finally,a visual plane fitted residual or area constraint process is adopted to generate the RSG model with different levels of details.Results Two airborne laser scanning datasets with different point densities and roof styles were tested,and the performance evaluation metrics were obtained by International Society for Photogrammetry and Remote Sensing,achieving a correctness and accuracy of 97.7%and 0.29m,respectively.Conclusions The standardized assessment results demonstrate the effectiveness and robustness of the proposed approach,showing its ability to generate a variety of structural models,even with missing data.
文摘In order to find better simplicity measurements for 3D object recognition, a new set of local regularities is developed and tested in a stepwise 3D reconstruction method, including localized minimizing standard deviation of angles(L-MSDA), localized minimizing standard deviation of segment magnitudes(L-MSDSM), localized minimum standard deviation of areas of child faces (L-MSDAF), localized minimum sum of segment magnitudes of common edges (L-MSSM), and localized minimum sum of areas of child face (L-MSAF). Based on their effectiveness measurements in terms of form and size distortions, it is found that when two local regularities: L-MSDA and L-MSDSM are combined together, they can produce better performance. In addition, the best weightings for them to work together are identified as 10% for L-MSDSM and 90% for L-MSDA. The test results show that the combined usage of L-MSDA and L-MSDSM with identified weightings has a potential to be applied in other optimization based 3D recognition methods to improve their efficacy and robustness.
基金This work was supported by the Klaus Tschira Foundation Heidelberg and the project[FA1189/3-1]funded by the Deutsche Forschungsgemeinschaft(DFG).
文摘Repetitive structures of a building share features in terms of geometries and appearance and,therefore,the 3D information for these structures can be transferred from one specification to another for the purpose of 3D modeling and reconstruction once they are identified as repetitive structures.In this paper,a novel approach is proposed for the detection of the repetitive structures specified by the polygons of a building’s footprints.Instead of directly operating on the polygon in 2D space,the polygon is converted into a bend angle function representation in 1D space,whereby an extrusion is represented as a closed polygon intersected by the x-axis and located above it,while an intrusion is represented as a closed polygon below the x-axis.In this way,a polygon of a footprint is decomposed into a number of extrusions and intrusions which can in turn be processed.The task of detecting any repetitive structures specified in a building’s footprints then becomes the task of clustering the intersected polygons in the bend angle function space.The extrusions/intrusions which can be placed in the same clusters can be regarded as repetitive structures.Experiments show that this proposed approach can detect repetitive structures with different sizes,orientations and complexities.
基金supported by ZTE Industry⁃University⁃Institute Coopera⁃tion Funds under Grant No.HC⁃CN⁃20210707004.
文摘With the rapid popularization of mobile devices and the wide application of various sensors,scene perception methods applied to mobile devices occupy an important position in location-based services such as navigation and augmented reality(AR).The development of deep learning technologies has greatly improved the visual perception ability of machines to scenes.The basic framework of scene visual perception,related technologies and the specific process applied to AR navigation are introduced,and future technology development is proposed.An application(APP)is designed to improve the application effect of AR navigation.The APP includes three modules:navigation map generation,cloud navigation algorithm,and client design.The navigation map generation tool works offline.The cloud saves the navigation map and provides navigation algorithms for the terminal.The terminal realizes local real-time positioning and AR path rendering.
基金supported by the "Eleventh Five" Obligatory Budget of PLA (Grant No.513150801)
文摘This article presents a passive navigation method of terrain contour matching by reconstructing the 3-D terrain from the image sequence(acquired by the onboard camera).To achieve automation and simultaneity of the image sequence processing for navigation,a correspondence registration method based on control points tracking is proposed which tracks the sparse control points through the whole image sequence and uses them as correspondence in the relation geometry solution.Besides,a key frame selection method based on the images overlapping ratio and intersecting angles is explored,thereafter the requirement for the camera system configuration is provided.The proposed method also includes an optimal local homography estimating algorithm according to the control points,which helps correctly predict points to be matched and their speed corresponding.Consequently,the real-time 3-D terrain of the trajectory thus reconstructed is matched with the referenced terrain map,and the result of which provides navigating information.The digital simulation experiment and the real image based experiment have verified the proposed method.
基金This work was funded by the National Key Research and Development Program of China[grant number 2016YFB0502102]It was also partially funded by National Natural Science Foundation of China[grant number 41101436]the Scientific Research Foundation for the Returned Overseas Chinese Scholars,State Education Ministry。
文摘The recent fast development in computer vision and mobile sensor technology such as mobile LiDAR and RGB-D cameras is pushing the boundary of the technology to suit the need of real-life applications in the fields of Augmented Reality(AR),robotics,indoor GIS and self-driving.Camera localization is often a key and enabling technology among these applications.In this paper,we developed a novel camera localization workflow based on a highly accurate 3D prior map optimized by our RGBD SLAM method in conjunction with a deep learning routine trained using consecutive video frames labeled with high precision camera pose.Furthermore,an AR registration method tightly coupled with a game engine is proposed,which incorporates the proposed localization algorithm and aligns the real Kinetic camera with a virtual camera of the game engine to facilitate AR application development in an integrated manner.The experimental results show that the localization accuracy can achieve an average error of 35 cm based on a fine-tuned prior 3D feature database at 3 cm accuracy compared against the ground-truth 3D LiDAR map.The influence of the localization accuracy on the visual effect of AR overlay is also demonstrated and the alignment of the real and virtual camera streamlines the implementation of AR fire emergency response demo in a Virtual Geographic Environment.