期刊文献+

基于自动定位分割的图书识别框架 被引量:2

A Framework for Book Cover Recognition Based on Automatic Location and Segmentation
下载PDF
导出
摘要 提出一种基于自动定位分割的图书识别算法,主要包括对拍摄图像进行图书封面区域的自动定位、感兴趣区域(ROI)分割、标准形状矫正以及有效区域的特征提取与相似性匹配部分.自动定位分割部分根据图书封面的几何形状特点,通过基于霍夫变换的形状检测算法对自然场景下拍摄的图书封面图像进行有效区域定位,对ROI进行分割并根据逆仿射变换将其矫正到标准形状;然后对获取的有效图书封面区域进行基于改进的尺度不变特征变换(SIFT)的特征点检测和特征描述,并采用词包(BOW)方法对其进行特征量化和码本学习,从而将定位分割出的图书图像与数据库中图书源图像进行相似性匹配.实验结果表明,对包含一定复杂程度背景的图书图像进行准确的定位分割和矫正,在很大程度上影响着基于特征匹配的图书识别技术的精确度. This paper put forward a framework for book recognition based on automatic location and segmentation, which includes the stage of book automatic location, ROI segmentation, standard shape correction, effective feature extraction and similarity matching. Automatic location and segmentation stage based on the geometric shape characteristic of book, takes effective regional orientation and ROI segmentation through the hough transform algorithm , and then correct the real book cover areas to a standard shape according to the inverse affine transformation. The stage of feature extraction and similarity matching mainly used the improved SIFT algorithm to conduct feature points detection and generate feature description, and then conduct the similarity matching by the popular BOW method which generates codebook and quantizes the features. The results of our experiments show that the accurate location and segmentation of the book cover from a image which contain some appropriate background and taking standard shape correction have a significant impact on the precision of recognition.
作者 刘玉杰 李伟
出处 《计算机辅助设计与图形学学报》 EI CSCD 北大核心 2012年第11期1464-1470,共7页 Journal of Computer-Aided Design & Computer Graphics
基金 中央高校基本科研基金(09CX04044A 10CX04043A 10CX04014B 11CX04053A 11CX06086A 12CX06083A 12CX06086A) 文化部科技创新基金(46-2010) 山东省自然科学基金(ZR2009GL014) 山东省中青年科学家奖励基金(BS2010DX037)
关键词 图书识别 霍夫变换 自动定位分割 特征提取 基于内容的图像检索 book cover recognition hough transform automatic location and segmentation featureextraction content-based image retrieval
  • 相关文献

参考文献19

  • 1袁昕,朱淼良.基于主色匹配的图像检索系统[J].计算机辅助设计与图形学学报,2000,12(12):917-921. 被引量:20
  • 2Girod B, Chandrasekhar V, Chen D M, et al. Mobile visual search [J]. IEEE Signal Processing Magazine, 2011, 28(4): 61-76.
  • 3Lowe D G. Distinctive image features from scale-invariant keypoints [J]. International Journal of Computer Vision, 2004, 60(2): 91-110.
  • 4Morel J M, Yu G S. ASIFT: a new framework for fully affine invariant image comparison [J]. SIAM Journal on Imaging Sciences, 2009, 2(2): 438-469.
  • 5Iwata K, Yamamoto K. Book cover identification by using four directional features filed for a small-scale library system [C] //Proceedings of International Conference on Document Analysis and Recognition. Los Alamitos: IEEE Computer Society Press, 2001:582-586.
  • 6Tsai S S, Chen D, Singh J P, etal. Rate-efficient, real-time cd cover recognition on a camera-phone[C] //Proceedings of the 16th International Conference on Multimedia. New York: ACM Press, 2008: 1023-1024.
  • 7Tsai S S, Chen D M, Chandrasekhar V, etal. Mobile product recognition[C] //Proceedings of the International Conference on Multimedia. New York: ACMPress, 2010:1587-1590.
  • 8Burges C J C. A tutorial on support vector machines for pattern recognition [J]. Data Mining and Knowledge Discovery, 1998, 2(2): 121-167.
  • 9Shi J B, Malik J. Normalized cuts and image segmentation [J]. IEEE Transactions on Pattern Analysis and Machine Intelligence, 2000, 22(8): 888-905.
  • 10Osher S, Sethian J A. Fronts propagating with curvature dependent speedz algorithms based on the Hamilton Jacobi formulation [J]. Journal of Computational Physics, 1988, 79 (1): 12-49.

二级参考文献3

  • 1Shared Mehrotra,Proceedings ofIEEE International Conference on Multimedia Computing andSystems’,1997年,632页
  • 2Smith J R,The 4th ACM International Multimedia Conference 96 Proceedings (ACM Multimedia9,1996年,87页
  • 3徐旭,朱淼良,梁倩卉.一种用于CBIR系统的主色提取及表示方法[J].计算机辅助设计与图形学学报,1999,11(5):385-388. 被引量:27

共引文献19

同被引文献7

引证文献2

二级引证文献3

相关作者

内容加载中请稍等...

相关机构

内容加载中请稍等...

相关主题

内容加载中请稍等...

浏览历史

内容加载中请稍等...
;
使用帮助 返回顶部