期刊文献+

基于OCR信息的JBIG2编码算法 被引量:3

Lossy JBIG2 based on optical character recognition
原文传递
导出
摘要 二值图像编码在文本存储、图象检索中有广泛的应用。为了提高二值图像的压缩比,提出了一种利用OCR结果的JB IG 2(jo in t b i-leve l im age group)编码算法。它在对二值文本图像进行基于模式匹配的压缩时,利用了OCR识别结果和识别置信度的信息,从而更好地完成了字模重建和模式匹配的处理,提高了JB IG 2算法的性能。图像中所有识别结果可信的字符被重建字模代替,编码器只需编码字符的位置。实验结果表明:该算法优于以往JB IG 2算法的效果,它可以获得高于以往有损压缩算法的图像质量,并在实验图像上得到高于以往无损压缩算法14.3%的压缩比。 Bi-level image coding is useful for document storage and archiving, image searches on the Internet and digital libraries. The JBIG2 (joint hi-level image group) standard for lossless and lossy coding of hi-level images is a very flexible encoding strategy which allows researchers to design their own encoders. OCR processing of text images is one encoding technique that gives measurable recognition and the confidence results. We propose a lossy JBIG2 encoding method which uses OCR processing results to improve text image compression based on pattern matching. All the credible recognized characters in the image are replaced by representative character images so that the encoder only needs to mark the positions of these characters. Experiment results show that this method gives better results than previous JBIG2 encoding methods with 14.3% less storage compared to previous lossless methods while preserving relatively good text image quality.
出处 《清华大学学报(自然科学版)》 EI CAS CSCD 北大核心 2006年第7期1247-1249,1253,共4页 Journal of Tsinghua University(Science and Technology)
基金 国家自然科学基金资助项目(60472002)
关键词 模式识别 二值图像编码 文本图像压缩 OCR 模式匹配 pattern recognition bi-level image coding text image compression OCR pattern matching
  • 相关文献

参考文献7

  • 1ISO/IEC JTC1/SC29/WG1 N1545.JBIG2 Final Draft International Standard[S].1999.
  • 2Ascher R N,Nagy G.Means for achieving a high degree of compaction on scan-digitized printed text[J].IEEE Trans on Computers,1974,23(11):1174-1179.
  • 3Howard P.Lossless and lossy compression of text images by soft pattern matching[C]∥ Proc of the 1996 IEEE Data Compression Conference.Snowbird,Utah:IEEE,1996:210-219.
  • 4YE Yan.Text Image Compression Based on Pattern Matching[D].San Diego,CA:University of California,San Diego,2002.
  • 5林晓帆,丁晓青,吴佑寿,陈友斌,刘今晖.字符识别的置信度分析[J].清华大学学报(自然科学版),1998,38(9):47-50. 被引量:13
  • 6Witten I H,Moffat A,Bell T C.Managing Gigabytes[M].San Francisco,CA:Morgan Kaufmann,1999.
  • 7Pratt W K,Capitant P J,Chen W,et al.Combined symbol matching facsimile data compression system[J].Proc of the IEEE,1980,68(7):786-796.

二级参考文献5

共引文献12

同被引文献27

  • 1Howard P G. Text Image Compression Using Soft Pattern Matching[J]. Computer Journal, 1997,40(3):146-166.
  • 2ISO/IEC JTC 1/SC 29/WG 1 (ITU-T SGS), Coding of Still Pictures[S]. 1999.
  • 3YE Yan. Text Image Compression Based on Pattern Matching[M]. CA: University of California, 2002.
  • 4Sayood K. Introduction to Data Compression[M]. Third Edition.北京:人民邮电出版社,2009.
  • 5Ascher R N, Nagy G. Means for Achieving a High Degree of Compaction on Scan-Digitized Printed Text[J]. IEEE Trans on Computers, 1974, 23(11) :1174 1179.
  • 6Haffner P,Bottou L,Howard P Get al.Browsing through high quality document images with DjVu. Proceedings of IEEE Advances in Digital Libraries . 1998
  • 7Romero R,Berger R,Thibadeau Ret al.Printed Chinese character recognition. http://www.cs.cmu.edu/afs/cs/project/pcvision/www/chinese.html . 2009
  • 8Haskell B G,Howard P G,Lecun Y A,et al.Image and video coding-emerging standards and beyond. IEEE Transaction on Circuits and Systems for Video Technology . 1998
  • 9Han,J,Kamber,M.Data mining concepts and techniques. . 2006
  • 10Ye Yan.Text I mage Compression Based on Pattern Matching. . 2002

引证文献3

二级引证文献14

相关作者

内容加载中请稍等...

相关机构

内容加载中请稍等...

相关主题

内容加载中请稍等...

浏览历史

内容加载中请稍等...
;
使用帮助 返回顶部