The aim is to reconstruct a complete and detailed clothed human from a single-view input.Implicit function is suitable for this task because it represents fine shape details and varied topology.Current methods,however...The aim is to reconstruct a complete and detailed clothed human from a single-view input.Implicit function is suitable for this task because it represents fine shape details and varied topology.Current methods,however,often suffer from artefacts such as broken or disembodied body parts,missing details,or depth ambiguity due to the ambiguity and complexity of human articulation.The main issue observed by the authors is structureagnostic.To address these problems,the authors fully utilise the skinned multi-person linear(SMPL)model and propose a method using the Skeleton-aware Implicit Function(SIF).To alleviate the broken or disembodied body parts,the proposed skeleton-aware structure prior makes the skeleton awareness into an implicit function,which consists of a bone-guided sampling strategy and a skeleton-relative encoding strategy.To deal with the missing details and depth ambiguity problems,the authors’body-guided pixel-aligned feature exploits the SMPL to enhance 2D normal and depth semantic features,and the proposed feature aggregation uses the extra geometry-aware prior to enabling a more plausible merging with less noisy geometry.Additionally,SIF is also adapted to the RGB-D input,and experimental results show that SIF outperforms the state-of-the-arts methods on challenging datasets from Twindom and Thuman3.0.展开更多
<strong>Aim:</strong> To carry out a 3D vector reconstruction of the typical cervical vertebra from anatomical sections of the “Korean Visible Human” for educational purposes. <strong>Material and ...<strong>Aim:</strong> To carry out a 3D vector reconstruction of the typical cervical vertebra from anatomical sections of the “Korean Visible Human” for educational purposes. <strong>Material and Methods:</strong> The anatomical subject was a 33-year-old Korean man who died of leukemia. He was 164 cm tall and weighed 55 kg. This man donated his body to science. Her body was frozen and cut into several anatomical sections after an MRI and CT scan. These anatomical sections were made using a special saw called a 0.2 mm thick cryomacrotome. Thus 8100 cuts were obtained. Only the sections numbered 940 to 1200 were used for our study. A segmentation by manual contouring of the different parts of the typical cervical vertebra was made using the software Winsurf version 3.5 on a laptop PC running Windows 7 equipped with a Ram of 8 gigas. <strong>Results:</strong> Our 3D vector model of the typical cervical vertebra is easily manipulated using the Acrobat 3DPDF interface. Each part of the vertebra accessible in a menu can be displayed, hidden or made transparent, and 3D labels are available as well as educational menus for learning anatomy. <strong>Conclusion: </strong>This original work constitutes a remarkable educational tool for the anatomical study of the typical cervical vertebra and can also be used as a 3D atlas for simulation purposes for training in therapeutic gestures.展开更多
Human pose estimation is a basic and critical task in the field of computer vision that involves determining the position(or spatial coordinates)of the joints of the human body in a given image or video.It is widely u...Human pose estimation is a basic and critical task in the field of computer vision that involves determining the position(or spatial coordinates)of the joints of the human body in a given image or video.It is widely used in motion analysis,medical evaluation,and behavior monitoring.In this paper,the authors propose a method for multi-view human pose estimation.Two image sensors were placed orthogonally with respect to each other to capture the pose of the subject as they moved,and this yielded accurate and comprehensive results of three-dimensional(3D)motion reconstruction that helped capture their multi-directional poses.Following this,we propose a method based on 3D pose estimation to assess the similarity of the features of motion of patients with motor dysfunction by comparing differences between their range of motion and that of normal subjects.We converted these differences into Fugl–Meyer assessment(FMA)scores in order to quantify them.Finally,we implemented the proposed method in the Unity framework,and built a Virtual Reality platform that provides users with human–computer interaction to make the task more enjoyable for them and ensure their active participation in the assessment process.The goal is to provide a suitable means of assessing movement disorders without requiring the immediate supervision of a physician.展开更多
Large-scale indoor 3D reconstruction with multiple robots faces challenges in core enabling technologies.This work contributes to a framework addressing localization,coordination,and vision processing for multi-agent ...Large-scale indoor 3D reconstruction with multiple robots faces challenges in core enabling technologies.This work contributes to a framework addressing localization,coordination,and vision processing for multi-agent reconstruction.A system architecture fusing visible light positioning,multi-agent path finding via reinforcement learning,and 360°camera techniques for 3D reconstruction is proposed.Our visible light positioning algorithm leverages existing lighting for centimeter-level localization without additional infrastructure.Meanwhile,a decentralized reinforcement learning approach is developed to solve the multi-agent path finding problem,with communications among agents optimized.Our 3D reconstruction pipeline utilizes equirectangular projection from 360°cameras to facilitate depth-independent reconstruction from posed monocular images using neural networks.Experimental validation demonstrates centimeter-level indoor navigation and 3D scene reconstruction capabilities of our framework.The challenges and limitations stemming from the above enabling technologies are discussed at the end of each corresponding section.In summary,this research advances fundamental techniques for multi-robot indoor 3D modeling,contributing to automated,data-driven applications through coordinated robot navigation,perception,and modeling.展开更多
Reconstructing 3D digital models of humans from sensory data is a long-standing problem in computer vision and graphics with a variety of applications in VR/AR,film production,and human–computer interaction,etc.While...Reconstructing 3D digital models of humans from sensory data is a long-standing problem in computer vision and graphics with a variety of applications in VR/AR,film production,and human–computer interaction,etc.While a huge amount of effort has been devoted to developing various capture hardware and reconstruction algorithms,traditional reconstruction pipelines may still suffer from high-cost capture systems and tedious capture processes,which prevent them from being easily accessible.Moreover,the dedicatedly hand-crafted pipelines are prone to reconstruction artifacts,resulting in limited visual quality.To solve these challenges,the recent trend in this area is to use deep neural networks to improve reconstruction efficiency and robustness by learning human priors from existing data.Neural network-based implicit functions have been also shown to be a favorable 3D representation compared to traditional forms like meshes and voxels.Furthermore,neural rendering has emerged as a powerful tool to achieve highly photorealistic modeling and re-rendering of humans by end-to-end optimizing the visual quality of output images.In this article,we will briefly review these advances in this fast-developing field,discuss the advantages and limitations of different approaches,and finally,share some thoughts on future research directions.展开更多
To provide practical and surgical anatomy for the imaging diagnosis and surgical treatment of the disease of the caudate lobe of the liver. Methods: Based on Chinese Visible Human 1-5 data sets and assisted by 3D vis...To provide practical and surgical anatomy for the imaging diagnosis and surgical treatment of the disease of the caudate lobe of the liver. Methods: Based on Chinese Visible Human 1-5 data sets and assisted by 3D visualization and reconstruction, the 3D models of the upper abdomen or the liver were reconstructed and the cross-sectional images were converted to the coronal and sagittal images. The anatomy of the caudate lobe of the liver on the coronal and sagittal planes was investigated on serial planes of the upper abdomen. Results: The caudate lobe was bordered on the left by the fissura ligamenti venosi, posteriorly by the IVC, superiorly by the hepatic veins and inferiorly by the hepatic hilum. Its right and ventral borders might be obscure, with only relative borders existent. The right wall of the IVC was a good landmark to judge the relative realm of paracaval portion, and the relative ventral plane might exist between the hepatic hilum and entrance of hepatic veins. The caudate lobe could be divided into two principal regions: the left Spiegel lobe and the right paracaval portion. The caudate process, and the right rear process occurring in some individuals belonged to the right paracaval portion. The caudate lobe was blood supplied by the portal vein, which directly drained into the IVC. Conclusion: There are not definite borders for the right part of the caudate lobe, and most of the knowledge on it is based on the cast study, which may not suit for the clinical diagnosis and practice. The coronal and sagittal sections can better show the anatomic relationships between the caudate lobe, the other parts of the liver and the adjacent structures. The 3D digital visualization is an accurate and convenient study method for clinical anatomy.展开更多
基金National Key R&D Program of China,Grant/Award Number:2022YFF0901902。
文摘The aim is to reconstruct a complete and detailed clothed human from a single-view input.Implicit function is suitable for this task because it represents fine shape details and varied topology.Current methods,however,often suffer from artefacts such as broken or disembodied body parts,missing details,or depth ambiguity due to the ambiguity and complexity of human articulation.The main issue observed by the authors is structureagnostic.To address these problems,the authors fully utilise the skinned multi-person linear(SMPL)model and propose a method using the Skeleton-aware Implicit Function(SIF).To alleviate the broken or disembodied body parts,the proposed skeleton-aware structure prior makes the skeleton awareness into an implicit function,which consists of a bone-guided sampling strategy and a skeleton-relative encoding strategy.To deal with the missing details and depth ambiguity problems,the authors’body-guided pixel-aligned feature exploits the SMPL to enhance 2D normal and depth semantic features,and the proposed feature aggregation uses the extra geometry-aware prior to enabling a more plausible merging with less noisy geometry.Additionally,SIF is also adapted to the RGB-D input,and experimental results show that SIF outperforms the state-of-the-arts methods on challenging datasets from Twindom and Thuman3.0.
文摘<strong>Aim:</strong> To carry out a 3D vector reconstruction of the typical cervical vertebra from anatomical sections of the “Korean Visible Human” for educational purposes. <strong>Material and Methods:</strong> The anatomical subject was a 33-year-old Korean man who died of leukemia. He was 164 cm tall and weighed 55 kg. This man donated his body to science. Her body was frozen and cut into several anatomical sections after an MRI and CT scan. These anatomical sections were made using a special saw called a 0.2 mm thick cryomacrotome. Thus 8100 cuts were obtained. Only the sections numbered 940 to 1200 were used for our study. A segmentation by manual contouring of the different parts of the typical cervical vertebra was made using the software Winsurf version 3.5 on a laptop PC running Windows 7 equipped with a Ram of 8 gigas. <strong>Results:</strong> Our 3D vector model of the typical cervical vertebra is easily manipulated using the Acrobat 3DPDF interface. Each part of the vertebra accessible in a menu can be displayed, hidden or made transparent, and 3D labels are available as well as educational menus for learning anatomy. <strong>Conclusion: </strong>This original work constitutes a remarkable educational tool for the anatomical study of the typical cervical vertebra and can also be used as a 3D atlas for simulation purposes for training in therapeutic gestures.
基金This work was supported by grants fromthe Natural Science Foundation of Hebei Province,under Grant No.F2021202021the S&T Program of Hebei,under Grant No.22375001Dthe National Key R&D Program of China,under Grant No.2019YFB1312500.
文摘Human pose estimation is a basic and critical task in the field of computer vision that involves determining the position(or spatial coordinates)of the joints of the human body in a given image or video.It is widely used in motion analysis,medical evaluation,and behavior monitoring.In this paper,the authors propose a method for multi-view human pose estimation.Two image sensors were placed orthogonally with respect to each other to capture the pose of the subject as they moved,and this yielded accurate and comprehensive results of three-dimensional(3D)motion reconstruction that helped capture their multi-directional poses.Following this,we propose a method based on 3D pose estimation to assess the similarity of the features of motion of patients with motor dysfunction by comparing differences between their range of motion and that of normal subjects.We converted these differences into Fugl–Meyer assessment(FMA)scores in order to quantify them.Finally,we implemented the proposed method in the Unity framework,and built a Virtual Reality platform that provides users with human–computer interaction to make the task more enjoyable for them and ensure their active participation in the assessment process.The goal is to provide a suitable means of assessing movement disorders without requiring the immediate supervision of a physician.
基金supported by Bright Dream Robotics and the HKUSTBDR Joint Research Institute Funding Scheme under Project HBJRI-FTP-005(Automated 3D Reconstruction using Robot-mounted 360-Degree Camera with Visible Light Positioning Technology for Building Information Modelling Applications,OKT22EG06).
文摘Large-scale indoor 3D reconstruction with multiple robots faces challenges in core enabling technologies.This work contributes to a framework addressing localization,coordination,and vision processing for multi-agent reconstruction.A system architecture fusing visible light positioning,multi-agent path finding via reinforcement learning,and 360°camera techniques for 3D reconstruction is proposed.Our visible light positioning algorithm leverages existing lighting for centimeter-level localization without additional infrastructure.Meanwhile,a decentralized reinforcement learning approach is developed to solve the multi-agent path finding problem,with communications among agents optimized.Our 3D reconstruction pipeline utilizes equirectangular projection from 360°cameras to facilitate depth-independent reconstruction from posed monocular images using neural networks.Experimental validation demonstrates centimeter-level indoor navigation and 3D scene reconstruction capabilities of our framework.The challenges and limitations stemming from the above enabling technologies are discussed at the end of each corresponding section.In summary,this research advances fundamental techniques for multi-robot indoor 3D modeling,contributing to automated,data-driven applications through coordinated robot navigation,perception,and modeling.
基金The authors would like to acknowledge the support from NSFC(No.62172364).
文摘Reconstructing 3D digital models of humans from sensory data is a long-standing problem in computer vision and graphics with a variety of applications in VR/AR,film production,and human–computer interaction,etc.While a huge amount of effort has been devoted to developing various capture hardware and reconstruction algorithms,traditional reconstruction pipelines may still suffer from high-cost capture systems and tedious capture processes,which prevent them from being easily accessible.Moreover,the dedicatedly hand-crafted pipelines are prone to reconstruction artifacts,resulting in limited visual quality.To solve these challenges,the recent trend in this area is to use deep neural networks to improve reconstruction efficiency and robustness by learning human priors from existing data.Neural network-based implicit functions have been also shown to be a favorable 3D representation compared to traditional forms like meshes and voxels.Furthermore,neural rendering has emerged as a powerful tool to achieve highly photorealistic modeling and re-rendering of humans by end-to-end optimizing the visual quality of output images.In this article,we will briefly review these advances in this fast-developing field,discuss the advantages and limitations of different approaches,and finally,share some thoughts on future research directions.
基金Supported by the National Natural Science Fund Aided Project (60473128)
文摘To provide practical and surgical anatomy for the imaging diagnosis and surgical treatment of the disease of the caudate lobe of the liver. Methods: Based on Chinese Visible Human 1-5 data sets and assisted by 3D visualization and reconstruction, the 3D models of the upper abdomen or the liver were reconstructed and the cross-sectional images were converted to the coronal and sagittal images. The anatomy of the caudate lobe of the liver on the coronal and sagittal planes was investigated on serial planes of the upper abdomen. Results: The caudate lobe was bordered on the left by the fissura ligamenti venosi, posteriorly by the IVC, superiorly by the hepatic veins and inferiorly by the hepatic hilum. Its right and ventral borders might be obscure, with only relative borders existent. The right wall of the IVC was a good landmark to judge the relative realm of paracaval portion, and the relative ventral plane might exist between the hepatic hilum and entrance of hepatic veins. The caudate lobe could be divided into two principal regions: the left Spiegel lobe and the right paracaval portion. The caudate process, and the right rear process occurring in some individuals belonged to the right paracaval portion. The caudate lobe was blood supplied by the portal vein, which directly drained into the IVC. Conclusion: There are not definite borders for the right part of the caudate lobe, and most of the knowledge on it is based on the cast study, which may not suit for the clinical diagnosis and practice. The coronal and sagittal sections can better show the anatomic relationships between the caudate lobe, the other parts of the liver and the adjacent structures. The 3D digital visualization is an accurate and convenient study method for clinical anatomy.