期刊文献+

NSCT子带纹理特征融合的中亚文种识别 被引量:1

Script identification of central Asian based on fusion texture feature of NSCT sub-bands
下载PDF
导出
摘要 由于中亚地区某些文种相似度较高,单一纹理特征不能充分描述它们的纹理特点。为此,提出基于NSCT子带纹理特征融合的文种识别方法,即先对预处理后的文档图像进行非下采样Contourlet变换。对变换产生的子带分别提取局部二值模式和灰度共生矩阵特征,生成高维融合特征向量,通过主成分分析法对其进行降维生成低维特征向量。通过对阿拉伯文、俄文、藏文、中文、维吾尔文、英文、蒙古文、吉尔吉斯斯坦文、哈萨克斯坦文、土耳其文进行实验,验证了该方法能更准确地提取文档图像多尺度、多方向的纹理特征,有效提高识别率。 Due to the higher similarity of some scripts in Central Asia,a single texture feature can not adequately describe their texture feature.To solve this problem,a script-identification method based on fusion texture feature of nonsubsampled Contourlet transform sub-bands was proposed.The preprocessed document images were subjected to nonsubsampled Contourlet transform firstly.The local binary patterns and the gray level co-occurrence matrix features were extracted from the sub-bands gene rated by the transformation,and the high-dimensional fusion feature vector was generated.The principal component analysis was used to reduce dimension to generate low-dimensional feature vectors.Experiments on Arabic,Russian,Tibetan,Chinese,Uyghur,English,Mongolian,Kyrgyzstan,Kazakhstan,and Turkish verify that the proposed method can more accurately extract the multi-scale and multi-directional texture features of document images,and can improve the recognition rate effectively.
作者 韩兴坤 阿力木江.艾沙 努尔毕亚.亚地卡尔 朱亚俐 库尔班.吾布力 HAN Xing-kun;Alimjan Aysa;Nurbiya Yadikar;ZHU Ya-li;Kurban Ubul(School of Information Science and Engineering,Xinjiang University,Urumqi 830046,China;Network and Information Center,Xinjiang University,Urumqi 830046,China)
出处 《计算机工程与设计》 北大核心 2018年第9期2848-2855,共8页 Computer Engineering and Design
基金 国家自然科学基金项目(61363064 61563052 61163028) 新疆大学博士科研启动基金项目(BS150262)
关键词 文种识别 融合纹理特征 非下采样CONTOURLET变换 局部二值模式 灰度共生矩阵 支持向量机 script identification fusion texture features nonsubsampled Contourlet transform local binary patterns gray level co-occurrence matrix support vector machine
  • 相关文献

参考文献2

二级参考文献34

  • 1刘宁,裴雷.彩色激光打印机、复印机同一认定新方法[J].江苏警官学院学报,2005,20(2):165-170. 被引量:20
  • 2Spitz A L.Determination of the script and language content of document images[J].IEEE Transactions on Pattern Analysis and Machine Intelligence,1997,19(3):235-245.
  • 3Pal U,Chaudhuri B B.Identification of different script lines from multi-script documents[J].Image and Vision Computing,2002,20..945-954.
  • 4Elgammal A M,Ismail M A.Techniques for language identification for hybrid arabic-english document images[C]// Proc of 6th International Conference on Document Analysis and Recognition.Seattle,USA:IEEE Computer Society,2001:1100-1104.
  • 5Hochberg J,Kelly P,Thomas T.Automatic script identification from images using cluster-based templates[J].IEEE Transactions on Pattern Analysis and Machine Intelligence,1997,19(2):176-181.
  • 6Nakayama T,Spitz A L.European language determination from image[C]// Proc of the International Conference on Document Analysis and Recognition.Tsukuba,Japan:IEEE Computer Society,1993:159-162.
  • 7Busch A,Boles W W,Sridharan S.Texture for script identification[J].IEEE Transactions on Pattern Analysis and Machine Intelligence,2005,27(11):1720-1732.
  • 8Tan T N.Rotation invariant texture features and their use in automatic script identification[J].IEEE Transactions on Pattern Analysis and Machine Intelligence,1998,20(7):751-756.
  • 9Padma M C,Vijaya P A.Entropy based texture features useful for automatic script identification[J].International Journal on Computer Science and Engineering,2010,2(2):115-120.
  • 10Hiremath P S,Shivashankar S.Wavelet based co-occurrence histogram features for texture classification with an application to script identification in a document images[J].Pattern Recognition Letters,2008,29(9):1182-1189.

共引文献5

同被引文献11

引证文献1

相关作者

内容加载中请稍等...

相关机构

内容加载中请稍等...

相关主题

内容加载中请稍等...

浏览历史

内容加载中请稍等...
;
使用帮助 返回顶部