期刊文献+
共找到8篇文章
< 1 >
每页显示 20 50 100
Arabic Optical Character Recognition:A Review
1
作者 Salah Alghyaline 《Computer Modeling in Engineering & Sciences》 SCIE EI 2023年第6期1825-1861,共37页
This study aims to review the latest contributions in Arabic Optical Character Recognition(OCR)during the last decade,which helps interested researchers know the existing techniques and extend or adapt them accordingl... This study aims to review the latest contributions in Arabic Optical Character Recognition(OCR)during the last decade,which helps interested researchers know the existing techniques and extend or adapt them accordingly.The study describes the characteristics of the Arabic language,different types of OCR systems,different stages of the Arabic OCR system,the researcher’s contributions in each step,and the evaluationmetrics for OCR.The study reviews the existing datasets for the Arabic OCR and their characteristics.Additionally,this study implemented some preprocessing and segmentation stages of Arabic OCR.The study compares the performance of the existing methods in terms of recognition accuracy.In addition to researchers’OCRmethods,commercial and open-source systems are used in the comparison.The Arabic language is morphologically rich and written cursive with dots and diacritics above and under the characters.Most of the existing approaches in the literature were evaluated on isolated characters or isolated words under a controlled environment,and few approaches were tested on pagelevel scripts.Some comparative studies show that the accuracy of the existing Arabic OCR commercial systems is low,under 75%for printed text,and further improvement is needed.Moreover,most of the current approaches are offline OCR systems,and there is no remarkable contribution to online OCR systems. 展开更多
关键词 Arabic Optical character recognition(ocr) Arabic ocr software Arabic ocr datasets Arabic ocr evaluation
下载PDF
Support Vector Machine Based Handwritten Hindi Character Recognition and Summarization
2
作者 Sunil Dhankhar Mukesh Kumar Gupta +3 位作者 Fida Hussain Memon Surbhi Bhatia Pankaj Dadheech Arwa Mashat 《Computer Systems Science & Engineering》 SCIE EI 2022年第10期397-412,共16页
In today’s digital era,the text may be in form of images.This research aims to deal with the problem by recognizing such text and utilizing the support vector machine(SVM).A lot of work has been done on the English l... In today’s digital era,the text may be in form of images.This research aims to deal with the problem by recognizing such text and utilizing the support vector machine(SVM).A lot of work has been done on the English language for handwritten character recognition but very less work on the under-resourced Hindi language.A method is developed for identifying Hindi language characters that use morphology,edge detection,histograms of oriented gradients(HOG),and SVM classes for summary creation.SVM rank employs the summary to extract essential phrases based on paragraph position,phrase position,numerical data,inverted comma,sentence length,and keywords features.The primary goal of the SVM optimization function is to reduce the number of features by eliminating unnecessary and redundant features.The second goal is to maintain or improve the classification system’s performance.The experiment included news articles from various genres,such as Bollywood,politics,and sports.The proposed method’s accuracy for Hindi character recognition is 96.97%,which is good compared with baseline approaches,and system-generated summaries are compared to human summaries.The evaluated results show a precision of 72%at a compression ratio of 50%and a precision of 60%at a compression ratio of 25%,in comparison to state-of-the-art methods,this is a decent result. 展开更多
关键词 Support vector machine(SVM) optimization PRECISION Hindi character recognition optical character recognition(ocr) automatic summarization and compression ratio
下载PDF
Baseline Isolated Printed Text Image Database for Pashto Script Recognition
3
作者 Arfa Siddiqu Abdul Basit +3 位作者 Waheed Noor Muhammad Asfandyar Khan M.Saeed H.Kakar Azam Khan 《Intelligent Automation & Soft Computing》 SCIE 2023年第7期875-885,共11页
The optical character recognition for the right to left and cursive languages such as Arabic is challenging and received little attention from researchers in the past compared to the other Latin languages.Moreover,the... The optical character recognition for the right to left and cursive languages such as Arabic is challenging and received little attention from researchers in the past compared to the other Latin languages.Moreover,the absence of a standard publicly available dataset for several low-resource lan-guages,including the Pashto language remained a hurdle in the advancement of language processing.Realizing that,a clean dataset is the fundamental and core requirement of character recognition,this research begins with dataset generation and aims at a system capable of complete language understanding.Keeping in view the complete and full autonomous recognition of the cursive Pashto script.The first achievement of this research is a clean and standard dataset for the isolated characters of the Pashto script.In this paper,a database of isolated Pashto characters for forty four alphabets using various font styles has been introduced.In order to overcome the font style shortage,the graphical software Inkscape has been used to generate sufficient image data samples for each character.The dataset has been pre-processed and reduced in dimensions to 32×32 pixels,and further converted into the binary format with a black background and white text so that it resembles the Modified National Institute of Standards and Technology(MNIST)database.The benchmark database is publicly available for further research on the standard GitHub and Kaggle database servers both in pixel and Comma Separated Values(CSV)formats. 展开更多
关键词 Text-image database optical character recognition(ocr) pashto isolated characters visual recognition autonomous language understanding deep learning convolutional neural network(CNN)
下载PDF
智能移动终端涉密信息监测系统 被引量:3
4
作者 王本钰 顾益军 彭舒凡 《科学技术与工程》 北大核心 2022年第6期2317-2325,共9页
网络高度发达的信息时代,防止涉密信息被泄露是一件非常重要的任务,尤其是对于政府、军队、公安等重点单位。传统的涉密信息监测系统往往是安装在主机等终端中,无法对于通过手机等智能移动终端偷拍涉密图片或者通过聊天软件上传涉密图... 网络高度发达的信息时代,防止涉密信息被泄露是一件非常重要的任务,尤其是对于政府、军队、公安等重点单位。传统的涉密信息监测系统往往是安装在主机等终端中,无法对于通过手机等智能移动终端偷拍涉密图片或者通过聊天软件上传涉密图片的行为无法进行有效的制止。针对这个问题,设计了一种将CTPN文本检测算法、光学字符识别技术(optical character recognition,OCR)与场景识别、图片传输监控相结合的智能移动终端涉密信息监测系统,可广泛应用于Android移动平台中。该系统通过全局扫描,实时相机监察,社交管控三防一体对失泄密行为进行监控监察,有效防止失泄密事故案件的发生。测试结果显示,该系统不仅可以准确识别涉密图片、监测涉密行为并且处理速度快、占用内存空间小,可以满足涉密单位的基本需求。 展开更多
关键词 CTPN文本检测算法 光学字符识别技术(optical character recognition ocr) 智能移动终端 监控监察
下载PDF
Study on the de-watermark algorithm based on grayscale text
5
作者 黄国权 Chen Zhipeng Sun Xiaocui 《High Technology Letters》 EI CAS 2021年第1期95-102,共8页
When using the current popular text recognition algorithms such as optical character recognition(OCR)algorithm for text images,the presence of watermarks in text images interferes with algorithm recognition to the ext... When using the current popular text recognition algorithms such as optical character recognition(OCR)algorithm for text images,the presence of watermarks in text images interferes with algorithm recognition to the extent of fuzzy font,which is not conducive to the improvement of the recognition rate.In order to pursue fast and high recognition rate,watermark removal has become a critical problem to be solved.This work studies the watermarking algorithm based on morphological algorithm set and classic image algorithm in computer images.It can not only remove the watermark in a short time,but also keep the form and clarity of the text in the image.The algorithm also meets the requirements that the higher the clarity of image and text,the better the processing effect.It can process the Chinese characters with complex structure,complicated radicals or other characters well.In addition,the algorithm can basically process ordinary size images in 1 s,the efficiency is relatively high. 展开更多
关键词 de-watermark text recognition character recognition optical character recognition(ocr)application
下载PDF
Denoising Letter Images from Scanned Invoices Using Stacked Autoencoders
6
作者 Samah Ibrahim Alshathri Desiree Juby Vincent V.S.Hari 《Computers, Materials & Continua》 SCIE EI 2022年第4期1371-1386,共16页
Invoice document digitization is crucial for efficient management in industries.The scanned invoice image is often noisy due to various reasons.This affects the OCR(optical character recognition)detection accuracy.In ... Invoice document digitization is crucial for efficient management in industries.The scanned invoice image is often noisy due to various reasons.This affects the OCR(optical character recognition)detection accuracy.In this paper,letter data obtained from images of invoices are denoised using a modified autoencoder based deep learning method.A stacked denoising autoencoder(SDAE)is implemented with two hidden layers each in encoder network and decoder network.In order to capture the most salient features of training samples,a undercomplete autoencoder is designed with non-linear encoder and decoder function.This autoencoder is regularized for denoising application using a combined loss function which considers both mean square error and binary cross entropy.A dataset consisting of 59,119 letter images,which contains both English alphabets(upper and lower case)and numbers(0 to 9)is prepared from many scanned invoices images and windows true type(.ttf)files,are used for training the neural network.Performance is analyzed in terms of Signal to Noise Ratio(SNR),Peak Signal to Noise Ratio(PSNR),Structural Similarity Index(SSIM)and Universal Image Quality Index(UQI)and compared with other filtering techniques like Nonlocal Means filter,Anisotropic diffusion filter,Gaussian filters and Mean filters.Denoising performance of proposed SDAE is compared with existing SDAE with single loss function in terms of SNR and PSNR values.Results show the superior performance of proposed SDAE method. 展开更多
关键词 Stacked denoising autoencoder(SDAE) optical character recognition(ocr) signal to noise ratio(SNR) universal image quality index(UQ1)and structural similarity index(SSIM)
下载PDF
关于影像传输系统在异地业务中的应用
7
作者 周承林 《中国科技期刊数据库 科研》 2017年第1期0045-0046,共2页
各个行业都会有一些远程的或者异地的业务开展的时候,原始的OA系统如果没有关于远程业务的相关功能,或者远程业务功能的不可查看细节的局限性,则可能在执行某些应用的时候,会有不清晰,不明确的缺点。本文将讨论在各行业的OA系统在遇到... 各个行业都会有一些远程的或者异地的业务开展的时候,原始的OA系统如果没有关于远程业务的相关功能,或者远程业务功能的不可查看细节的局限性,则可能在执行某些应用的时候,会有不清晰,不明确的缺点。本文将讨论在各行业的OA系统在遇到以上缺陷的时候,影像传输系统对于这种缺陷的补全。 展开更多
关键词 影像传输系统 ocr(Optical character recognition光学字符识别)系统 CMS(Content Management System)
下载PDF
A CCD based machine vision system for real-time text detection
8
作者 Shihua ZHAO Lipeng SUN +2 位作者 Gang LI Yun LIU Binbing LIU 《Frontiers of Optoelectronics》 EI CSCD 2020年第4期418-424,共7页
Text detection and recognition is a hot topic in computer vision,which is considered to be the further development of the traditional optical character recognition(OCR)technology.With the rapid development of machine ... Text detection and recognition is a hot topic in computer vision,which is considered to be the further development of the traditional optical character recognition(OCR)technology.With the rapid development of machine vision system and the wide application of deep learning algorithms,text recognition has achieved excellent performance.In contrast,detecting text block from complex natural scenes is still a challenging task.At present,many advanced natural scene text detection algorithms have been proposed,but most of them run slow due to the complexity of the detection pipeline and can not be applied to industrial scenes.In this paper,we proposed a CCD based machine vision system for realtime text detection in invoice images.In this system,we applied optimizations from several aspects including the optical system,the hardware architecture,and the deep learning algorithm to improve the speed performance of the machine vision system.The experimental data confirms that the optimization methods can significantly improve the running speed of the machine vision system and make it meeting the real-time text detection requirements in industrial scenarios. 展开更多
关键词 machine vision text detection optical character recognition(ocr) deep learning
原文传递
上一页 1 下一页 到第
使用帮助 返回顶部