
基于LTP和HOG纹理特征融合的中亚文档图像文种识别 被引量:3

Script identification of Central Asian document images based on LTP and HOG texture feature fusion
摘要 针对中亚地区存在一些相似度较高的文种,提出一种基于具有旋转不变性的统一局部三值模式(rotation invariant uniform local ternary pattern, riu2-LTP)和方向梯度直方图(histogram of oriented gradients, HOG)特征交叉融合的文档图像文种方法。使用SVM分类器对包含10个文种共10 000张图片的数据库进行试验;为了提高多文种识别效果,采用贝叶斯优化SVM的超参数。对文档图像提取了半径为1,采样点为8的riu2-LTP;重新对数据库提取HOG;采用交叉融合方法将20维riu2-LTP特征与36维HOG特征分别依次融入到新的特征集。试验表明,本研究方法平均查准率达到99%,相较于单一LTP、riu2-LTP和HOG方法有更好性能。 Due to the existence of a number of scripts with high similarity in Central Asia, a document image script identification method based on the cross-fusion of a unified local ternary pattern(riu2-LTP) with rotational invariance and histogram of oriented gradients(HOG) features was proposed. An SVM classifier was used to perform experiments on a database containing a total of 10 000 images of 10 scripts. In order to improve multi-script identification, Bayesian optimized SVM hyperparameters were used. The method first extracted riu2-LTP with a radius of and a sampling 8 points for the document images;HOG was extracted from the database again;the cross-fusion method was to incorporate the 20-dimensional riu2-LTP features and 36-dimensional HOG features sequentially into the new feature set, respectively. The experiments showed that the average recognition rate of this method reached 99%, which was better than the single LTP, riu2-LTP, and HOG methods.
作者 吴正健 木特力甫·马木提 吾尔尼沙·买买提 阿力木江·艾沙 库尔班·吾布力 WU Zhengjian;MUTALLIP Mamut;HORNISA Mamat;ALIM Aysa;KURBAN Ubu(School of Information Science Engineering,Xinjiang University,Urumqi 830046,Xinjiang,China;The Library,Xinjiang University,Urumqi 830046,Xinjang,China;The Key Lab.of Xinjiang Mutilingual Information Technology,Urumqi 830046,Xinjiang,China)
出处 《山东大学学报(工学版)》 CAS CSCD 北大核心 2021年第2期115-121,共7页 Journal of Shandong University(Engineering Science)
基金 国家自然科学基金资助项目(61862061,6161563052,61363064) 新疆大学博士科研启动基金项目(BS180268) 新疆维吾尔自治区高校科研计划创新团队基金项目(XJEDU2017T002)。
关键词 LTP HOG 特征融合 贝叶斯优化 文种识别 LTP HOG feature fusion Bayesian optimization script identification
  • 相关文献



  • 1维尼拉.木沙江,吐尔地.托合提,吐尔洪.吾司曼.基于重定位的维、哈、柯文Unicode编码及多文种索引技术研究[J].郑州大学学报(理学版),2009,41(1):48-49. 被引量:2
  • 2陆小川,伊兵哲,平西建,程娟.含噪文本图像的中英文文种识别研究[J].计算机工程与设计,2007,28(21):5150-5152. 被引量:3
  • 3Spitz A L.Determination of the script and language content of document images[J].IEEE Transactions on Pattern Analysis and Machine Intelligence,1997,19(3):235-245.
  • 4Pal U,Chaudhuri B B.Identification of different script lines from multi-script documents[J].Image and Vision Computing,2002,20..945-954.
  • 5Elgammal A M,Ismail M A.Techniques for language identification for hybrid arabic-english document images[C]// Proc of 6th International Conference on Document Analysis and Recognition.Seattle,USA:IEEE Computer Society,2001:1100-1104.
  • 6Hochberg J,Kelly P,Thomas T.Automatic script identification from images using cluster-based templates[J].IEEE Transactions on Pattern Analysis and Machine Intelligence,1997,19(2):176-181.
  • 7Nakayama T,Spitz A L.European language determination from image[C]// Proc of the International Conference on Document Analysis and Recognition.Tsukuba,Japan:IEEE Computer Society,1993:159-162.
  • 8Busch A,Boles W W,Sridharan S.Texture for script identification[J].IEEE Transactions on Pattern Analysis and Machine Intelligence,2005,27(11):1720-1732.
  • 9Tan T N.Rotation invariant texture features and their use in automatic script identification[J].IEEE Transactions on Pattern Analysis and Machine Intelligence,1998,20(7):751-756.
  • 10Padma M C,Vijaya P A.Entropy based texture features useful for automatic script identification[J].International Journal on Computer Science and Engineering,2010,2(2):115-120.












使用帮助 返回顶部