期刊文献+

文本图像页面分割算法研究 被引量:6

An algorithm for document images segmentation
下载PDF
导出
摘要 提出了一种基于改进纹理谱的文本页面分割算法,该算法首先采用改进的递归投影轮廓切割算法对文本图像页面进行粗分割,并提取文本图像的纹理谱特征;然后采用最小距离法将相邻纹理单元进行分类;最后实现文本图像页面文字区与非文字区的精确分割.实验表明,提出的方法在含有文字、图、表格的文本图像页面分割中效果很好,对其他复杂文本图像页面分割也具有适应性. A page segmentation algorithm was proposed based on improved texture spectrum. Firstly, the algorithm used the improved recursive projection profile cutting algorithm to segment a document image, and it calculated texture spectrum features of small image windows via the texture unit. Then, it classified adjacent windows by minimum distance, thereby accomplishing the segmentation of text and non-text regions for document images. Experiments show that the proposed method has good adaptability for characters, pictures and charts.
出处 《中国科学技术大学学报》 CAS CSCD 北大核心 2010年第5期500-504,共5页 JUSTC
基金 安徽省教育厅自然科学基金重点项目(KJ2009A054,KJ2007A076)资助
关键词 文本图像 图像分割 纹理谱 document image image segmentation texture spectrum
  • 相关文献

参考文献9

  • 1Tang Y Y,Lee S W,Suen C Y.Automatic document processing:a survey[J].Pattern Recognition,1996,29(Z):1 931-1 952.
  • 2Likforman-Sulem L,zahour A,Taconet B.Text line segmentation of historical documents:a survey[J].International Journal of Document Analysis,2007,9(2):123-138.
  • 3Nagy G,Seth S,Viswanathan M.A prototype document image analysis system for technical journals[J].Computer,1992,25(7):10-22.
  • 4Kise K,Sam A,Iwata M.Segmentation of page images using the area Voronoi diagram[J].Computer Vision Image Understanding,1998,70(3):370-382.
  • 5O'Gorman L.The document spectrum for page layout analysis[J].IEEE Transactions on Pattern Analysis and Machine Intelligence,1993,15(11):1 162-1 173.
  • 6王姝华,曹阳,李佐,蔡士杰.连通区的页面分割与分类方法[J].计算机辅助设计与图形学学报,2002,14(1):17-20. 被引量:3
  • 7He D C,Wang L.Texture unit,texture spectrum,and texture analysis[J].IEEE Transactions on Geoscience and Remote Sensing,1990,28(4):509-512.
  • 8He D C,Wang L.Texture features based on texture spectrum[J].Pattern Recognition,1991,24(5):391-399.
  • 9王加俊,黄贤武,郭玮玮,仲兴荣.文本页面图像的图文分割与分类算法[J].中国图象图形学报(A辑),2004,9(5):571-577. 被引量:4

二级参考文献17

  • 1[1]Y Y Tang, S W Lee, C Y Suen. Automatic document processing: A survey[J]. Pattern Recognition, 1996, 29(2):1931~1952
  • 2[2]Y Y Tang, H Ma, D Xi, et al. Modified fractal signature (MFS): A new approach to document analysis for automatic knowledge acquisition[J]. IEEE Transactions on Knowledge and Data Engineering, 1997,9(5):747~762
  • 3[3]T Pavlidis, J Zhou. Page segmentation and classification[J]. CVGIP: Graphical Models and Image Processing, 1992,54(6):484~496
  • 4[4]K Y Wong, R G Casey, F M Wahl. Document analysis system[J]. IBM Journal of Research and Development, 1982, 26(6):642~656
  • 5[5]F Y Shih, S Chen. Adaptive document block segmentation and classification[J]. IEEE Transactions on System, Man, and Cybernetics, 1996, 26(5):797~802
  • 6[6]G Nagy, S Seth, M Viswanathan. A prototype document image analysis system for technical journals[J]. IEEE Computer, 1992, 25(7):10~22
  • 7[7]L O'Gorman. The document spectrum for structured page layout analysis[J]. IEEE Transactions on Pattern Analysis and Machine Intelligence, 1993, 15(11):1162~1173
  • 8[8]L A Fletcher, R Kasturi. A robust algorithm for text string separation from mixed text/graphics images[J]. IEEE Transactions on Pattern Analysis and Machine Intelligence, 1988, 10(6):910~918
  • 9[9]A K Jain, Y Zhang. Page segmentation using texture analysis[J]. Pattern Recognition, 1996,29(5):743~770
  • 10Abele L, Wahl F, Scherl W. Procedures for an automated segmentation of text, graphic and halftone regions in documents[A]. In: Proceedings of the 2nd Scandinavian Conference on Image analysis [C], Hellsinkii, Finland, 1981 : 177- 182.

共引文献4

同被引文献38

引证文献6

二级引证文献18

相关作者

内容加载中请稍等...

相关机构

内容加载中请稍等...

相关主题

内容加载中请稍等...

浏览历史

内容加载中请稍等...
;
使用帮助 返回顶部