一种新的DCT压缩域字符快速定位算法被引量：2

A New Fast Text Location Algorithm in DCT-Compressed Domain

下载PDF

导出

摘要提出了一种基于DCT压缩域的字符定位算法,能够快速定位出具有复杂背景图像中的字符区域。该算法面向部分解码后的JPEG图像,从Y分量DCT压缩码流中提取出一种新的字符/非字符分类特征,并采用自适应阈值法实现分类,利用投影法确定出字符区的位置。实验表明,该算法对不同复杂背景下的JPEG图像,可以有效实现中、英文字符区的提取,查全率和查准率可以达到90%以上,处理速度快,能够实现实时处理。 A fast and efficient automatic text location method is presented. Text regions are segmented from JPEG compressed images with complex background us- ing a new feature, which is extracted from the DCT- compressed domain. Hence only a very small amount of decompressing operations is required. Then a projecting and merging algorithm is used to locate the final text ar- eas. Experimental results show that this method works well on various language text locations with precision and recall of more than 90%.

作者孙慧平刘党辉沈兰荪

机构地区北京工业大学信号与信息处理研究室

出处《测控技术》 CSCD 2005年第5期48-51,共4页 Measurement & Control Technology

基金国家自然科学基金资助项目(60402036) 北京市基金资助项目(4042008) 教育部博士点基金资助项目(20040005015)

关键词 DCT系数字符定位压缩域处理加权频率自适应阈值 DCT coefficient text location com- pressed-domain processing weighted frequency adaptive threshold

分类号 TP391.4 [自动化与计算机技术—计算机应用技术] TN919.8 [电子电信—通信与信息系统]

引文网络
相关文献

参考文献11

1黄祥林,沈兰荪.基于DCT压缩域的纹理图像分类[J].电子与信息学报,2002,24(2):216-221. 被引量：27
2李晓华,沈兰荪.基于压缩域的图像检索技术[J].计算机学报,2003,26(9):1051-1059. 被引量：22
3黄祥林,沈兰荪.基于DCT压缩域的图象字符定位[J].中国图象图形学报（A辑）,2002,7(1):22-26. 被引量：18
4Chen X, Yang J, et al. Automatic detection of signs with affine transformation[A]. Applications of Computer Vision(WACV)[C],Pittsburgh ,2002.32-26.
5Yang J, Chen X, et al. Automatic detection and translation of text from natural scenes[A]. Acoustics, Speech, and Signal Processing(ICASSP)[C]. Orlando, FL USA, 2002.
6Wang K. Character Location in Scene Images from Digital Camera[J]. Pattern Recognition, 2003,36:2 287-2 299
7Li C, Ding X, et al. Automatic text location in natural scene images[A]. Document Analysis and Recognition [C]. Seattle, WA USA, 2001.
8Li H, Doermann D. A video text detection system based on automated training [J].Pattern Recognition. 2000,(2):223-226.
9Xi Jie, Hua Xiansheng, et al. A video text detection and recognition system[J]. Multimedia and Expo, 2001,(8):873-876.
10Zhong Yu, Zhang Hongjiang, et al. Automatic caption localization in compressed video[J]. IEEE Trans Pattern Analysis and Machine Intelligence, 2000,22(4):385-392.

二级参考文献61

1胡守仁余少波.神经网络导论[M].长沙:国防科技大学出版社,1992.113-129.
2Mandal M K. Wavelet based coding and indexing of images and video-Ph D dissertationS. University of Ottawa, Ottawa, Canada, 1998.
3Chang Shih-Fu. Compressed-domain techniques for image/video indexing and manipulation. In: Proceedings of IEEE International Conference on Image Processing, Washington, DC,USA, 1995. 314-317.
4Ma W Y, Manjunath B S, A comparison of wavelet transform features for texture image annotation, In: Proceedings of IEEE International Conference on Image Processing, Washington,DC,USA, 1995. 256-259.
5Lee Moon-Chuen, Pun Chi-Man. Texture classification using dominant wavelet packet energy features. In: Proceedings of IEEE Southwest Symposium on Image Analysis and Interpretation, Austin, TX, USA, 2000. 301-304.
6Chang T, Kuo C C J. Texture analysis and classification withtree-structured wavelet transform. IEEE Transactions on Image Processing,1993, 2(4) : 429-441.
7Mandal M K, Aboulnasr T, Panchanathan S. Fast wavelet histogram techniques for image indexing. Journal of Computer Vision and Image Understanding. 1999, 75(1) : 99-110.
8Smith J R, Chang S F. Automated binary texture feature setsfor image retrieval. Ins Proceedings of IEEE International Conference on Acoustics, Speech, and Signal Processing, Atlanta,1996. 2239-2242.
9Seales W B, Yuan C J, Brown M. Efficient content extractionin compressed images. In: Proceedings of IEEE Workshop on Content-Based Access of Image and Video Libraries, San Juan,.Puerto Rico,1997. 52-58.
10Yu Hong Heather. Visual image retrieval on compressed domain with Q-distance. In: Proceedings of IEEE International Conference on Computational Intelligence and Multimedia Applications, New Delhi, India, 1999. 1013-1016.

共引文献54

1周海萍.新课程应加强体验教学[J].青海师专学报,2005(S1):77-78. 被引量：1
2李晓华,沈兰荪.基于小波压缩域的统计纹理特征提取方法[J].电子学报,2003,31(z1):2123-2126. 被引量：8
3赵士伟,卓力,沈兰荪.压缩域视频处理关键帧提取技术的初步研究[J].计算机应用研究,2009,26(2):744-746.
4董卫军,周明全,黎晓,耿国华.基于小波分析的边缘检测技术研究[J].计算机工程与应用,2004,40(25):38-40. 被引量：15
5廖红文,冯国灿,Jiang Jianmin.压缩域上人脸识别的研究[J].中山大学学报（自然科学版）,2004,43(5):16-19. 被引量：1
6张二虎,张绪进,段敬红.一种改进的基于DCT压缩域的图像字符定位方法[J].计算机工程与应用,2004,40(27):97-98.
7王田,杨士中.增强无线视频图像传输差错恢复能力的方法研究[J].中国图象图形学报（A辑）,2004,9(10):1204-1209. 被引量：2
8熊回香.基于内容的图像检索技术的发展方向[J].现代图书情报技术,2004(12):32-35. 被引量：12
9芮挺,王金岩,沈春林,丁健.采用离散余弦变换的小波图像去噪方法[J].光电工程,2005,32(1):51-54. 被引量：9
10张玉新,吴玲达,谢毓湘,栾希道.一种基于小波变换与分形编码的新闻图片检索方法[J].计算机应用研究,2005,22(2):250-251. 被引量：1

同被引文献16

1Li Huiping, Doermann D. A video text detection system based on automated training [ J ]. Pattern Recognition.2000,2:223 -226.
2Lienhart R, et al. Localizing and segmenting text in images and videos [ J ]. IEEE transactions on Circuits and Systems for Video Technology. 2002, 12 ( 4 ) : 256 -268.
3Zhong Yu, Zhang Hongjiang, et al. Automatic caption localization in compressed video[J]. IEEE Transactions on Pattern Analysis and Machine Intelligence, 2000,22(4) :385 -392.
4Mao Wenge, Chung Fu-lai, et al. Hybrid Chinese/English text detection in images and video frames[ A]. Proc of 2002 Inter. Conf. on Pattern Recognition [ C ].Quebe, Canada: ICPR,2002. 1015-1018.
5Shih Y F, Chen Shy-Shyan, et al. A documem segmentation,classification and recognition system [A]. Proc of the Second Inter. Conf. on Systems Integration [ C ].Morristown, N J, 1992,258 - 267.
6Lyu M R, Song Jiqiang,Cai Min.A comprehensive meth- od for multilingual video text detection, localization, and extraction[J].IEEE Transactions on Circuits and Systems for Video Technology,2005,15(2) :243-255.
7Zhong Yu, Zhang Hongjiang, Jain A K.Automatic caption localization in compressed video[J].IEEE Transactions on Pattern Analysis and Machine Intelligence, 2000,22 (4) : 385-392.
8Gu Lifang.Text detection and extraction in MPEG se- quences[C]//Proceedings of CBMI' 01, Brescia, Italy, 2001: 19-21.
9Zhou Qiya, Yang Gaobo, Chen Weiwei, et al.A fast and accurate moving object extraction scheme in the MPEG compressed domain[C]//Proceedings of ICIG, Chengdu, China, 2007 : 592-597.
10Qian Xueming,Liu Guizhong, Wang Huan, et al.Text de- tection, localization, and tracking in compressed video[J]. Signal Processing: Image Communication, 2007,22 : 752-768.

引证文献2

1孙慧平,刘党辉,沈兰荪.基于DCT压缩域的快速字符定位算法研究[J].电子学报,2006,34(4):751-754. 被引量：4
2周启亚,杨高波.MPEG-2压缩域车辆牌照字符提取算法[J].计算机工程与应用,2012,48(33):197-202.

二级引证文献4

1胡正平,王瑾.多尺度-方向笔画结合SVM验证的文字区域定位[J].仪器仪表学报,2010,31(4):916-922. 被引量：2
2张阳,王嘉梅.一种改进的小波变换域的字符定位方法[J].微型机与应用,2011,30(18):35-37. 被引量：4
3叶龙欢,王俊峰,高琳,袁军.复杂背景下的票据字符分割方法[J].计算机应用,2012,32(11):3198-3200. 被引量：7
4史小松,黄勇杰,刘永革.基于阈值分割和形态学的甲骨拓片文字定位方法[J].北京信息科技大学学报（自然科学版）,2014,29(6):7-10. 被引量：4

1孙慧平,刘党辉,沈兰荪.基于DCT压缩域的快速字符定位算法研究[J].电子学报,2006,34(4):751-754. 被引量：4
2牛晓霞,王成儒.基于DCT域的公路车牌定位算法[J].微处理机,2010,31(4):75-77. 被引量：1
3张曦煌,卞国春,李红.基于统计特征的DCT压缩域纹理图像检索方法[J].计算机工程与设计,2006,27(7):1282-1285. 被引量：3
4张二虎,张绪进,段敬红.一种改进的基于DCT压缩域的图像字符定位方法[J].计算机工程与应用,2004,40(27):97-98.
5郑猛,郑世宝,王慈.一种DCT域实现图像分数倍尺度变换的方法[J].上海交通大学学报,2004,38(9):1515-1518. 被引量：1
6黄祥林,沈兰荪.基于DCT压缩域的图象字符定位[J].中国图象图形学报（A辑）,2002,7(1):22-26. 被引量：18
7张二虎,张绪进,张志刚.小波变换域中图像字符的定位提取方法[J].应用科学学报,2006,24(2):135-139. 被引量：2
8夏春艳,李树平,宋志超.基于粗糙集理论属性约简的改进算法[J].微计算机信息,2010,26(36):282-283. 被引量：4
9人工智能[J].中国学术期刊文摘,2006,12(10):145-149.
10刘艳,李宏东.DCT域图象处理和特征提取技术[J].中国图象图形学报（A辑）,2003,8(2):121-128. 被引量：21

测控技术

2005年第5期

浏览历史

内容加载中请稍等...

一种新的DCT压缩域字符快速定位算法被引量：2

参考文献11

二级参考文献61

共引文献54

同被引文献16

引证文献2

二级引证文献4

相关作者

相关机构

相关主题

浏览历史

一种新的DCT压缩域字符快速定位算法 被引量：2

参考文献11

二级参考文献61

共引文献54

同被引文献16

引证文献2

二级引证文献4

相关作者

相关机构

相关主题

浏览历史

一种新的DCT压缩域字符快速定位算法被引量：2