摘要
通过整合深度和颜色信息,深度摄像机Kinect能够稳健的侦测出人体及人体骨架关节点,为计算机视觉、人体行为识别、机器人学的发展带来了革命性的进步.然而单台深度摄像机的侦测范围有限.虽然采用多台深度摄像机所构建的摄像机网可有效的扩大侦测范围,但是必须依赖深度摄影机之间的相对位置与朝向的精确标定.论文采用作者之前提出的以人体骨架为基础的视角无关转换技术,能快速稳健的标定出深度摄像机之间的位置关系.通过利用相邻两台深度摄影机同时侦测到的人体骨架,论文能直接利用深度摄影机所量测的人体上半身中稳定的关节点为新坐标系的参考点,实时的计算出两摄影机之间的平移向量和旋转矩阵,而不依赖其他额外的校正设备或人为介入处理.通过在室内环境中安装两台摆放于不同位置与朝向的深度摄影机,从而,验证了该方法的实时性与易用性.该实时标定方法解决了深度摄影机侦测范围有限的限制,同时,可由两两相邻的标定扩展到多台深度相机的全局标定,从而,可以被广泛的应用于人体行为识别、情境感知服务等领域.
Combining depth information and color image, D-RGB cameras provide a ready detection of human and associated 3D skeleton joints data, facilitating, if not revolutionizing, conventional image centric researches in, among others, computer vision, surveillance, and human activity analysis. Applicability of a D-RBG camera, however, is restricted by its limited range of frustum of depth in the range of 0.8 to 4 meters. Although a D-RGB camera network, constructed by deployment of several D-RGB cameras at various locations, could extend the range of coverage, it requires precise localization of the camera network: relative location and orientation of neighboring cameras. By introducing a skeleton- based viewpoint invariant transformation (SVIT), which derives the relative location and orientation of a detected human's upper torso to a D-RGB camera, this paper presents a reliable automatic localization technique without the need for additional instrument or human intervention. By respectively applying SVIT to two neighboring D-RGB cameras on a commonly observed skeleton, the respective relative position and orientation of the detected human's skeleton for these two cameras can be obtained before being combined to yield the relative position and orientation of these two cameras, thus solving the localization problem. Experiments have been conducted in which two Kinects are situated with bearing differences of about 45 degrees and 90 degrees; the coverage can be extended by up to 70% with the installment of an additional Kinect. The same localization technique can be applied repeatedly to a larger number of D-RGB cameras, thus extending the applicability of D-RGB cameras to camera networks in making human behavior analysis and context-aware service in a larger surveillance area.
出处
《物理学报》
SCIE
EI
CAS
CSCD
北大核心
2014年第7期175-183,共9页
Acta Physica Sinica
基金
科技部国际合作项目(批准号:2010DFA12210)
上海科技人才项目(批准号:11XD1404800)
上海科委科学基础研究重点项目(批准号:12JC1408800)资助的课题~~
关键词
深度摄像机网
全局标定
视角无关
KINECT
D-RGB camera network, network localization, viewpoint invariance, Kinect