期刊文献+

结合图像识别的移动增强现实系统设计与应用 被引量:20

Mobile augmented reality system design and application based on image recognition
原文传递
导出
摘要 目的在移动互联网时代下,移动增强现实应用得到越来越快的发展。然而户外场景中存在许多相似结构的建筑,且手机的存储和计算能力有限,因此应用多集中于室内小范围环境,对于室外大规模复杂场景的适应性较弱。对此,建立一套基于云端图像识别的移动增强现实系统。方法为解决相似特征的误匹配问题,算法中将重力信息加入到SURF和BRISK特征描述中去,构建Gravity-SURF和Gravity-BRISK特征描述。云端系统对增强信息进行有效管理,采用基于Gravity-SURF特征的VLAD方法对大规模图像进行识别;在智能终端上的应用中呈现识别图像的增强信息,并利用识别图像的Gravity-BRISK特征和光流结合的方法对相机进行跟踪,采用Unity3D渲染引擎实时绘制3维模型。结果在包含重力信息的4 000幅户外图像的数据库中进行实验。采用结合重力信息的特征描述算法,能够增强具有相似特征的描述符的区分性,并提高匹配正确率。图像识别算法的识别率能达到88%以上,识别时间在420 ms左右;光流跟踪的RMS误差小于1.2像素,帧率能达到23帧/s。结论本文针对室外大规模复杂场景建立的基于图像识别的移动增强现实系统,能方便对不同应用的增强现实数据进行管理。系统被应用到谷歌眼镜和新闻领域上,不局限于单一的应用领域。结果表明,识别算法和跟踪注册算法能够满足系统的精度和实时性要求。 Objective With the development of mobile Interuet, the speed of advancement of mobile augmented reality ap- plications is accelerating. However, given that storage and computation capacity are limited on mobile devices and that large-scale outdoor images mostly include similar building structure images, these applications are restricted to indoor envi- ronments. To solve this problem, a mobile augmented reality system based on cloud image recognition is established. Method To reduce mistake-matching probability, we add gTavity information into the SURF and BRISK feature description algorithm. Gravity information is used as the main orientation. In our system, these two description algorithms are named Gravity-SURF and Gravity-BRISK. In the cloud, large-scale images and augmented information are managed by the aug- mented reality management system. The VLAD algorithm based on Gravity-SURF is applied to large-scale image recogni- tion. On intelligent terminals, different user interfaces are designed for augmented reality applications to display augmented information. The model rendered is a presentation format of the augmented information. We propose a method that combines Gravity-BRISK and optical flow to perform a real-time camera tracking algorithm. When using rendering engine Unity3D for 3D rendering, we propose a solution for the transformation of camera pose in different coordinate systems. Thereafter, the camera pose data will be transmitted from an Android platform to Unity3D by the Android Native Development Kit. Result In the experiments, we first build an image database with 4 000 images. The database contains 800 outdoor scenes and each scene contains 5 images in different angles and distances. Each image must be associated with the local gravity information. Thereafter, we compare the traditional description algorithm with the gravity-based description algorithm. From the experi- ments, we can conclude that the gravity-based description algorithm is able to enhance the discrimination of similar features and improve the matching accuracy. Finally, we test the performance and accuracy of our image recognition algorithm and tracking algorithm. The recognition algorithm experiments show that the VLAD algorithm based on Gravity-SURF can im- prove the recognition rate. In 4G and WIFI network environments, the upload time of the recognition image is less than 40 ms, the online recognition time is approximately 420 ms, and the augmented data download time is less than 4 s. Fur- thermore, the tracking algorithm experimerits show that the RMS errors in variation of the rotation and scale are all less than 1. 2 pixels. For the optical flow stage, the frame rate is able to reach 23 frame/s. In terms of the above experiments, the algorithm can satisfy the accuracy and real-time demands of the system. Conclusion The mobile augmented reality system is suitable for complex outdoor environments. This system has beela applied in Google Glass and the journalism field but is not limited to these two areas. Nonetheless, the augmented information in different applications can be managed efficiently. The system we established aims to promote the development of MAR applications.
出处 《中国图象图形学报》 CSCD 北大核心 2016年第2期184-191,共8页 Journal of Image and Graphics
基金 国家高科技研究发展计划基金项目(2013AA013802) 新华社通信技术局科研专项~~
关键词 移动增强现实 重力信息 图像识别 特征跟踪 模型渲染 mobile augmented reality gravity information image recognition feature tracking model rendering
  • 相关文献

参考文献17

  • 1Miller R, Constine J. Apple acquires augmented reality company metaio [EB/OL]. (2015-05-28)[2015-07-26]. http://techcrunch.com/2015/05/28/apple-metaio/.
  • 2Farooqui A. Smartglasses patented by Samsung could give Google Glass a run for its money [EB/OL]. (2015-05-17)[2015-07-26]. http://www.sammobile.com/2015/05/17/smartglasses-patented-by-samsung-could-give-google-glass-a-run-for-its-money/.
  • 3Wagner D, Reitmayr G, Mulloni A, et al. Pose tracking from natural features on mobile phones [C]/The 7th IEEE/ACM International Symposium on Mixed and Augmented Reality. Cambridge: IEEE Computer Society, 2008: 125-134. [DOI: 10.1109/ISMAR.2008.4637338].
  • 4Klein G, Murray D. Parallel tracking and mapping on a camera phone [C]//The 8th IEEE International Symposium on Mixed and Augmented Reality. Orlando, FL: IEEE, 2009: 83-86. [DOI: 10.1109/ISMAR.2009.5336495].
  • 5李扬,孙超,张明敏,张文杰,潘志庚.跟踪与匹配并行的增强现实注册方法[J].中国图象图形学报,2011,16(4):680-685. 被引量:24
  • 6Bay H, Ess A, Tuytelaars T, et al. Speeded-up robust features[J]. Computer Vision and Image Understanding, 2008, 110(3): 346-359. [DOI: 10.1016/j.cviu.2007.09.014].
  • 7Pirchheim C, Schmalstieg D, Reitmayr G. Handling pure camera rotation in keyframe-based slam [C]//2013 IEEE International Symposium on Mixed and Augmented Reality. Adelaide, SA: IEEE, 2013: 229-238. [DOI: 10.1109/ISMAR.2013.6671783].
  • 8Leutenegger S, Chli M, Siegwart R Y. BRISK: binary robust invariant scalable keypoints [C]//Proceedings IEEE International Conference on Computer Vision. Barcelona: IEEE, 2011: 2548-2555. [DOI: 10.1109/ICCV.2011.6126542].
  • 9Kurz D, Ben Himane S. Inertial sensor-aligned visual feature descriptors [C]//Proceedings of IEEE Conference on Computer Vision and Pattern Recognition. Providence, RI: IEEE, 2011: 161-166. [DOI: 10.1109/CVPR.2011.5995339].
  • 10Sivic J, Zisserman A. Video google: a text retrieval approach to object matching in videos [C]//Proceedings of the 9th IEEE International Conference on Computer Vision. Nice, France: IEEE, 2003: 1470-1477. [DOI: 10.1109/ICCV.2003.1238663].

二级参考文献27

  • 1管涛,李利军,段利亚,王乘.基于标识与平面间单应性矩阵的虚实注册方法[J].华中科技大学学报(自然科学版),2006,34(10):55-57. 被引量:11
  • 2章国锋,秦学英,董子龙,华炜,鲍虎军.面向增强视频的基于结构和运动恢复的摄像机定标[J].计算机学报,2006,29(12):2104-2111. 被引量:10
  • 3Azuma R T.A survey of augmented reality[.1].PresenceTeleoperators and Virtual Environments,1997,6 (4):355-385.
  • 4Zhou F,Duh HBL,Billinghurst M,Trends in augmented reality tracking,interaction and display:a review of ten years of ISMAR[C]//Proceedings of 7th IEEE/ACM International Symposium on Mixed and Augmented Reality.Cambridge,UK:IEEE Computer Society,2008:193-202.
  • 5Kato H,Billinghurst M.Marker tracking and HMD calibration for a video-based augmented reality conferencing system[C]//Proceedings of 2nd IEEE and ACM International Workshop on Augmented Reality,Washington DC,USA:IEEE Computer Society,1999:85-94.
  • 6Fiala M.ARTag:an improved marker system based on ARtoolkit[J].National Research Council Publication NRC,2004,4(7):166-174.
  • 7Kim,K,Lepetit,V,Woo,W.Scalable real-time planar targets tracking for digilog books[J].The Visual Computer,2010,26(11):1145-1154.
  • 8Bay H,Tuytelaars T,Van Gool L.Speeded-up robust features (SURF)[J].Computer Vision and Image Understanding,2008,110(3):346-389.
  • 9Wagner D,Reitmayr G,Mulloni A,Drummond T,Schmalstieg D.Pose tracking from natural features on mobile phones[C]//Proceedings of 7th IEEE/ACM International Symposium on Mixed and Augmented Reality.Washington DC,USA:IEEE Computer Society,2008:125-134.
  • 10Simon G,Fitzgibbon A W,Zisserman A.Markerless tracking using planar structures in the scene[C]//Proceedings of IEEE and ACM International Symposium on Augmented Reality 2000.Washington DC,USA:IEEE Computer Society,2000:120-128.

共引文献38

同被引文献160

引证文献20

二级引证文献39

相关作者

内容加载中请稍等...

相关机构

内容加载中请稍等...

相关主题

内容加载中请稍等...

浏览历史

内容加载中请稍等...
;
使用帮助 返回顶部