期刊文献+

一种具有旋转鲁棒性的文本图像文种识别方法 被引量:4

A Robust Rotation-invariant Script Identification Method of Document Images
下载PDF
导出
摘要 针对目前用于文本图像文种识别的纹理特征描述子对文字行倾斜缺乏不变性,采用可控金字塔变换提取文本图像的纹理特征,通过对特征空间元素重新排列,提出一种对文字行倾斜具有鲁棒性的文本图像文种识别方法。不同倾斜角度文本图像的文种识别结果表明,该算法具有较高的识别准确率并对文字行倾斜具有较强的鲁棒性。 Script identification is significant for attaining information from document images. Most algorithms on texturc feature extraction from document images for script identification are inadaptable to the skew of text line presently. For the skew of text line is inevitably, a new algorithm robust to the skew of text line is proposed. Steerable Pyramid transform is used on the document images and the energy statistical features of sub-bands is extracted. Through the realignment of features, the algorithm implements robustness to rotation. Libsvm is used as a classifier. The experiments are eonducted on image database containing ten scripts that are scanned from books or magazines. The test samples are rotated with different angles and the results confirm that the algorithm can identify scripts accurately and is robust to the skew of text line simuhaneously.
出处 《中国图象图形学报》 CSCD 北大核心 2010年第6期879-886,共8页 Journal of Image and Graphics
基金 国家自然科学基金项目(60473022)
关键词 文种识别 可控金字塔变换 纹理特征 文本图像 seript identification, Steerable Pyramid transform, texture feature, document images
  • 相关文献

参考文献15

  • 1Nakayama T,Spitz A L.European language determination from image[C] //Proceedings of the International Conference on Document Analysis and Recognition.Tsukuba,Japan; University of Tsukuba,1993:159-162.
  • 2Spitz A L.Script and language determination from document images[C] //Proceedings of Third Annual Symplic Document Analysis Information Retrieval.Las Vegas,America:University of Las Vegas,1994:229-235.
  • 3Elgammal A M,Ismail M A.Techniques for language identification for hybrid Arabic-English document images[C] //Proceedings of Sixth International Conference on Document Analysis and Recognition.Seattle,Washington DC,America:University of Seattle,2001:1100-1104.
  • 4Ding J,Lam L,Suen C Y.Classification of oriental and European scripts by using characteristic features[C]//Proceedings of ICDAR[C].Ulm,Germany:IEEE Computer Society,1997:1023-1027.
  • 5Pal U,Chaudhuri B B.Identification of different script lines from multi-script documents[J].Image and Vision computing,2002,20(13-14):945-954.
  • 6Spitz A L.Determination of the script and language content of document images[J].IEEE Transactions on Pattern Analysis and Machine Intelligence,1997,19(3):235-245.
  • 7Hochberg J,Kelly P,Thomas T.Automatic script identification from images using cluster-based templates[J].IEEE Transactions on Pattern Analysis and Machine Intelligence,1997,19(2):176-181.
  • 8Busch A,Boise W W,Sridharan S.Texture for script identification[J].IEEE Transactions on Pattern Analysis and Machine Intelligence,2005,27(11):1720-1732.
  • 9Tan T.Rotation invariant texture features and their use in automatic script identification[J].IEEE Transactions on Pattern Analysis and Machine Intelligence,1998,20(7):751-756.
  • 10曾理,唐远炎,陈廷槐.基于多尺度小波纹理分析的文字种类自动识别[J].计算机学报,2000,23(7):699-704. 被引量:26

二级参考文献2

  • 1Tan T N,IEEE Trans Pattern Anal Machine Intell,1998年,20卷,7期,751页
  • 2Ding Jie,Proceedings of the International Conference on Document Analysis,1996年,1023页

共引文献25

同被引文献33

  • 1薛明东,郭立,张国宣,刘士建.一种新的图像识别算法[J].计算机工程,2005,31(9):173-175. 被引量:4
  • 2陈燕东,刘景琳,孟志强.新型实时光电混合图像识别系统设计[J].电子测量与仪器学报,2007,21(3):103-107. 被引量:3
  • 3陆小川,伊兵哲,平西建,程娟.含噪文本图像的中英文文种识别研究[J].计算机工程与设计,2007,28(21):5150-5152. 被引量:3
  • 4Spitz A L.Determination of the script and language content of document images[J].IEEE Transactions on Pattern Analysis and Machine Intelligence,1997,19(3):235-245.
  • 5Pal U,Chaudhuri B B.Identification of different script lines from multi-script documents[J].Image and Vision Computing,2002,20..945-954.
  • 6Elgammal A M,Ismail M A.Techniques for language identification for hybrid arabic-english document images[C]// Proc of 6th International Conference on Document Analysis and Recognition.Seattle,USA:IEEE Computer Society,2001:1100-1104.
  • 7Hochberg J,Kelly P,Thomas T.Automatic script identification from images using cluster-based templates[J].IEEE Transactions on Pattern Analysis and Machine Intelligence,1997,19(2):176-181.
  • 8Nakayama T,Spitz A L.European language determination from image[C]// Proc of the International Conference on Document Analysis and Recognition.Tsukuba,Japan:IEEE Computer Society,1993:159-162.
  • 9Busch A,Boles W W,Sridharan S.Texture for script identification[J].IEEE Transactions on Pattern Analysis and Machine Intelligence,2005,27(11):1720-1732.
  • 10Tan T N.Rotation invariant texture features and their use in automatic script identification[J].IEEE Transactions on Pattern Analysis and Machine Intelligence,1998,20(7):751-756.

引证文献4

二级引证文献6

相关作者

内容加载中请稍等...

相关机构

内容加载中请稍等...

相关主题

内容加载中请稍等...

浏览历史

内容加载中请稍等...
;
使用帮助 返回顶部