A learning-based method to detect and segment text from scene images 被引量：3

A learning-based method to detect and segment text from scene images

下载PDF

导出

摘要 This paper proposes a learning-based method for text detection and text segmentation in natural scene images. First, the input image is decomposed into multiple connected-components (CCs) by Niblack clustering algorithm. Then all the CCs including text CCs and non-text CCs are verified on their text features by a 2-stage classification module, where most non-text CCs are discarded by an attentional cascade classifier and remaining CCs are further verified by an SVM. All the accepted CCs are output to result in text only binary image. Experiments with many images in different scenes showed satisfactory performance of our proposed method. This paper proposes a learning-based method for text detection and text segmentation in natural scene images. First, the input image is decomposed into multiple connected-components （CCs） by Niblack clustering algorithm. Then all the CCs including text CCs and non-text CCs are verified on their text features by a 2-stage classification module, where most non-text CCs are discarded by an attentional cascade classifier and remaining CCs are further verified by an SVM. All the accepted CCs are output to result in text only binary image. Experiments with many images in different scenes showed satisfactory performance of our proposed method.

作者 JIANG Ren-jie QI Fei-hu XU Li WU Guo-rong ZHU Kai-hua

机构地区 Department of Computer Science and Technology

出处《Journal of Zhejiang University-Science A(Applied Physics & Engineering)》 SCIE EI CAS CSCD 2007年第4期568-574,共7页 浙江大学学报（英文版）A辑（应用物理与工程）

基金 Project supported by the OMRON and SJTU Collaborative Founda-tion under PVS project (2005.03~2005.10)

关键词 Text detection Text segmentation Text feature Attentional cascade 景象图文字特征文字检测文字分割学习算法

分类号 TP391.41 [自动化与计算机技术—计算机应用技术]

引文网络
相关文献

参考文献10

1Lyu, M.R,Song, J.,Cai, M.A comprehensive method for multilingual video text detection, localization, and extraction[].IEEE Trans Circuits Syst Video Technol.2005
2Clark, P,Mirmehdi, M.Finding Text Regions Using Localized Measures[].Proc th British Machine Vision Conference.2000
3Zhu, K,Qi, F.,Jiang, R.,Xu, L.Using Adaboost to Detect and Segment Characters from Natural Scenes[].Proc Conference on Camera Based Document Analysis and Recognition.2005
4Chun, B.T,Bae, Y.,Kim, T.Y.Automatic Text Ex-traction in Digital Videos Using FFT and Neural Network[].Proc IEEE International Fuzzy Systems Conference.1999
5Weinman, J,Hanson, A.,McCallum, A.Sign Detection in Natural Images with Conditional Random Fields[].Proc IEEE International Workshop on Machine Learning for Signal Processing.2004
6Mao, W,Chung, F.,Lanm, K.,Siu, W.Hybrid Chi-nese/English Text Detection in Images and Video Frames[].Proc International Conference on Computer Vision and Pattern Recognition.2002
7Kim, K.C,Byun, H.R.,Song, Y.J.,Choi, Y.W.,Chi, S.Y.,Kim, K.K.,Chung, Y.K.Scene Text Extraction in Natural Scene Images Using Hierarchical Feature Com-bining and Verification[].Proc International Conference on Computer Vision and Pattern Recognition.2004
8Liu, C.,Wang, C.,Dai, R.Text Detection in ImagesBased on Unsupervised Classification of Edge-basedFeatures[].Proc International Conference on DocumentAnalysis and Recognition.2005
9Qian, X,Liu, G.Text Detection, Localization and Segmentation in Compressed Videos[].Proc International Conference on Acoustics Speech and Signal Processing.2006
10Chen, D,Shearer, K.,Bourlard, H.Text Enhancementwith Symmetric Alter for Video OCR[].Proc InternationalConference on Image Analysis and Recognition.2001

同被引文献6

1Zang J,Kasturi R. Extraction of Text Objects in Video Documents: Recent Progress[C]. Proc of the 8th Interna- tional Conference on Pattern Recognition. New York: IEEE Computer Society, 2008 : 5-17.
2Wong E K,Chen M. A New Robust Algorithm for Video Text Extraction [J]. Pattern Recognition, 2003, 36: 1397- 1406.
3Cai M ,Song J ,Lyu M R. A New Approach for Video Text Detection [C]//Proc of 2002 International Conference on Image Processing, New York: IEEE Computer Society, 2002:117-120.
4谢昭莉.彭琴.边缘图像连通区域标记的算法研究和SoPC实现[EB/OL].[2011-06-13].http://www.eeworld.com.cn/FPGA/2011/0708/article-2306.html.
5程豪,黄磊,刘昌平,谭怒涛.基于笔画和Adaboost的两层视频文字定位算法[J].自动化学报,2008,34(10):1312-1318. 被引量：10
6李念永,梁艳梅,张舒,杨立,常胜江.基于BP神经网络的复杂彩色图像文本定位[J].光子学报,2009,38(10):2712-2716. 被引量：3

引证文献3

1杨高波,吴潇,张兆扬,朱宁波.基于过渡像素的视频图像文本检测与定位[J].湖南大学学报（自然科学版）,2011,38(6):69-74. 被引量：3
2杨高波,吴潇,朱宁波.基于梯度向量的复杂场景文本定位[J].湖南大学学报（自然科学版）,2012,39(3):75-79.
3郭桂芳,洪留荣,葛方振.基于形态学的多方向文本定位方法[J].宿州学院学报,2015,30(9):103-105. 被引量：1

二级引证文献4

1李泽军,陈敏.传感网中不规则复杂3D平面定位策略研究[J].湖南大学学报（自然科学版）,2015,42(8):125-131.
2陈利萍,徐大宏.基于小波变换的视频图像压缩算法优化与仿真研究[J].济南职业学院学报,2016(2):91-95.
3焦亮,张太红.基于深度学习身份证鉴别与信息检测方法研究[J].计算机技术与发展,2020,30(12):203-209. 被引量：1
4朱志坚.基于Laplace变换的视频文本检测[J].广播与电视技术,2015(5):71-74.

1LIU Jin,YAN Li.A Fast Algorithm for Matching Remote Scene Images[J].Geo-Spatial Information Science,2008,11(3):197-200.
2秦飞,汪荣贵,梁启香,张冬梅,李想.基于关键特征点的改进TLD目标跟踪算法研究[J].计算机工程与应用,2016,52(4):181-187. 被引量：10
3杨玉锋,李伟彤,许磊,周禹.一种改进的TLD跟踪算法[J].科技创新与应用,2016,6(28):63-64. 被引量：1
4焦蓬斐,秦品乐.引入前景检测的TLD改进算法[J].山东工业技术,2016(9):226-226.
5张丹,陈兴文,赵姝颖.面向仿人机器人的视觉跟踪交互设计与实践[J].机器人技术与应用,2015(4):37-40.
6孙炜,薛敏,孙天宇,胡梦云,吕云峰.基于支持向量机优化的行人跟踪学习检测方法[J].湖南大学学报（自然科学版）,2016,43(10):102-109. 被引量：7
7吕枘蓬,蔡肖芋,董亮,涂继辉.基于TLD框架的上下文目标跟踪算法[J].电视技术,2015,39(9):6-9. 被引量：10
8金龙,孙涵.TLD视频目标跟踪方法改进[J].计算机与现代化,2015(4):42-46. 被引量：6
9刘阔,宁毅,湛永松.低分辨率条件下基于TLD的鲁棒车辆跟踪算法[J].计算机应用与软件,2016,33(12):264-269. 被引量：1
10黄元捷.基于随机蕨丛的改进型TLD跟踪算法[J].计算机光盘软件与应用,2015,18(2):127-128. 被引量：1

Journal of Zhejiang University-Science A(Applied Physics & Engineering)

2007年第4期

浏览历史

内容加载中请稍等...

A learning-based method to detect and segment text from scene images 被引量：3

参考文献10

同被引文献6

引证文献3

二级引证文献4

相关作者

相关机构

相关主题

浏览历史