期刊文献+

一种新的DCT压缩域字符快速定位算法 被引量:2

A New Fast Text Location Algorithm in DCT-Compressed Domain
下载PDF
导出
摘要 提出了一种基于DCT压缩域的字符定位算法,能够快速定位出具有复杂背景图像中的字符区域。该算法面向部分解码后的JPEG图像,从Y分量DCT压缩码流中提取出一种新的字符/非字符分类特征,并采用自适应阈值法实现分类,利用投影法确定出字符区的位置。实验表明,该算法对不同复杂背景下的JPEG图像,可以有效实现中、英文字符区的提取,查全率和查准率可以达到90%以上,处理速度快,能够实现实时处理。 A fast and efficient automatic text location method is presented. Text regions are segmented from JPEG compressed images with complex background us- ing a new feature, which is extracted from the DCT- compressed domain. Hence only a very small amount of decompressing operations is required. Then a projecting and merging algorithm is used to locate the final text ar- eas. Experimental results show that this method works well on various language text locations with precision and recall of more than 90%.
出处 《测控技术》 CSCD 2005年第5期48-51,共4页 Measurement & Control Technology
基金 国家自然科学基金资助项目(60402036) 北京市基金资助项目(4042008) 教育部博士点基金资助项目(20040005015)
关键词 DCT系数 字符定位 压缩域处理 加权频率 自适应阈值 DCT coefficient text location com- pressed-domain processing weighted frequency adaptive threshold
  • 相关文献

参考文献11

  • 1黄祥林,沈兰荪.基于DCT压缩域的纹理图像分类[J].电子与信息学报,2002,24(2):216-221. 被引量:27
  • 2李晓华,沈兰荪.基于压缩域的图像检索技术[J].计算机学报,2003,26(9):1051-1059. 被引量:22
  • 3黄祥林,沈兰荪.基于DCT压缩域的图象字符定位[J].中国图象图形学报(A辑),2002,7(1):22-26. 被引量:18
  • 4Chen X, Yang J, et al. Automatic detection of signs with affine transformation[A]. Applications of Computer Vision(WACV)[C],Pittsburgh ,2002.32-26.
  • 5Yang J, Chen X, et al. Automatic detection and translation of text from natural scenes[A]. Acoustics, Speech, and Signal Processing(ICASSP)[C]. Orlando, FL USA, 2002.
  • 6Wang K. Character Location in Scene Images from Digital Camera[J]. Pattern Recognition, 2003,36:2 287-2 299
  • 7Li C, Ding X, et al. Automatic text location in natural scene images[A]. Document Analysis and Recognition [C]. Seattle, WA USA, 2001.
  • 8Li H, Doermann D. A video text detection system based on automated training [J].Pattern Recognition. 2000,(2):223-226.
  • 9Xi Jie, Hua Xiansheng, et al. A video text detection and recognition system[J]. Multimedia and Expo, 2001,(8):873-876.
  • 10Zhong Yu, Zhang Hongjiang, et al. Automatic caption localization in compressed video[J]. IEEE Trans Pattern Analysis and Machine Intelligence, 2000,22(4):385-392.

二级参考文献61

  • 1胡守仁 余少波.神经网络导论[M].长沙:国防科技大学出版社,1992.113-129.
  • 2Mandal M K. Wavelet based coding and indexing of images and video-Ph D dissertationS. University of Ottawa, Ottawa, Canada, 1998.
  • 3Chang Shih-Fu. Compressed-domain techniques for image/video indexing and manipulation. In: Proceedings of IEEE International Conference on Image Processing, Washington, DC,USA, 1995. 314-317.
  • 4Ma W Y, Manjunath B S, A comparison of wavelet transform features for texture image annotation, In: Proceedings of IEEE International Conference on Image Processing, Washington,DC,USA, 1995. 256-259.
  • 5Lee Moon-Chuen, Pun Chi-Man. Texture classification using dominant wavelet packet energy features. In: Proceedings of IEEE Southwest Symposium on Image Analysis and Interpretation, Austin, TX, USA, 2000. 301-304.
  • 6Chang T, Kuo C C J. Texture analysis and classification withtree-structured wavelet transform. IEEE Transactions on Image Processing,1993, 2(4) : 429-441.
  • 7Mandal M K, Aboulnasr T, Panchanathan S. Fast wavelet histogram techniques for image indexing. Journal of Computer Vision and Image Understanding. 1999, 75(1) : 99-110.
  • 8Smith J R, Chang S F. Automated binary texture feature setsfor image retrieval. Ins Proceedings of IEEE International Conference on Acoustics, Speech, and Signal Processing, Atlanta,1996. 2239-2242.
  • 9Seales W B, Yuan C J, Brown M. Efficient content extractionin compressed images. In: Proceedings of IEEE Workshop on Content-Based Access of Image and Video Libraries, San Juan,.Puerto Rico,1997. 52-58.
  • 10Yu Hong Heather. Visual image retrieval on compressed domain with Q-distance. In: Proceedings of IEEE International Conference on Computational Intelligence and Multimedia Applications, New Delhi, India, 1999. 1013-1016.

共引文献54

同被引文献16

  • 1Li Huiping, Doermann D. A video text detection system based on automated training [ J ]. Pattern Recognition.2000,2:223 -226.
  • 2Lienhart R, et al. Localizing and segmenting text in images and videos [ J ]. IEEE transactions on Circuits and Systems for Video Technology. 2002, 12 ( 4 ) : 256 -268.
  • 3Zhong Yu, Zhang Hongjiang, et al. Automatic caption localization in compressed video[J]. IEEE Transactions on Pattern Analysis and Machine Intelligence, 2000,22(4) :385 -392.
  • 4Mao Wenge, Chung Fu-lai, et al. Hybrid Chinese/English text detection in images and video frames[ A]. Proc of 2002 Inter. Conf. on Pattern Recognition [ C ].Quebe, Canada: ICPR,2002. 1015-1018.
  • 5Shih Y F, Chen Shy-Shyan, et al. A documem segmentation,classification and recognition system [A]. Proc of the Second Inter. Conf. on Systems Integration [ C ].Morristown, N J, 1992,258 - 267.
  • 6Lyu M R, Song Jiqiang,Cai Min.A comprehensive meth- od for multilingual video text detection, localization, and extraction[J].IEEE Transactions on Circuits and Systems for Video Technology,2005,15(2) :243-255.
  • 7Zhong Yu, Zhang Hongjiang, Jain A K.Automatic caption localization in compressed video[J].IEEE Transactions on Pattern Analysis and Machine Intelligence, 2000,22 (4) : 385-392.
  • 8Gu Lifang.Text detection and extraction in MPEG se- quences[C]//Proceedings of CBMI' 01, Brescia, Italy, 2001: 19-21.
  • 9Zhou Qiya, Yang Gaobo, Chen Weiwei, et al.A fast and accurate moving object extraction scheme in the MPEG compressed domain[C]//Proceedings of ICIG, Chengdu, China, 2007 : 592-597.
  • 10Qian Xueming,Liu Guizhong, Wang Huan, et al.Text de- tection, localization, and tracking in compressed video[J]. Signal Processing: Image Communication, 2007,22 : 752-768.

引证文献2

二级引证文献4

相关作者

内容加载中请稍等...

相关机构

内容加载中请稍等...

相关主题

内容加载中请稍等...

浏览历史

内容加载中请稍等...
;
使用帮助 返回顶部