期刊文献+
共找到27篇文章
< 1 2 >
每页显示 20 50 100
Arabic Optical Character Recognition:A Review
1
作者 Salah Alghyaline 《Computer Modeling in Engineering & Sciences》 SCIE EI 2023年第6期1825-1861,共37页
This study aims to review the latest contributions in Arabic Optical Character Recognition(OCR)during the last decade,which helps interested researchers know the existing techniques and extend or adapt them accordingl... This study aims to review the latest contributions in Arabic Optical Character Recognition(OCR)during the last decade,which helps interested researchers know the existing techniques and extend or adapt them accordingly.The study describes the characteristics of the Arabic language,different types of OCR systems,different stages of the Arabic OCR system,the researcher’s contributions in each step,and the evaluationmetrics for OCR.The study reviews the existing datasets for the Arabic OCR and their characteristics.Additionally,this study implemented some preprocessing and segmentation stages of Arabic OCR.The study compares the performance of the existing methods in terms of recognition accuracy.In addition to researchers’OCRmethods,commercial and open-source systems are used in the comparison.The Arabic language is morphologically rich and written cursive with dots and diacritics above and under the characters.Most of the existing approaches in the literature were evaluated on isolated characters or isolated words under a controlled environment,and few approaches were tested on pagelevel scripts.Some comparative studies show that the accuracy of the existing Arabic OCR commercial systems is low,under 75%for printed text,and further improvement is needed.Moreover,most of the current approaches are offline OCR systems,and there is no remarkable contribution to online OCR systems. 展开更多
关键词 Arabic optical character Recognition(OCR) Arabic OCR software Arabic OCR datasets Arabic OCR evaluation
下载PDF
Review of Optical Character Recognition for Power System Image Based on Artificial Intelligence Algorithm
2
作者 Xun Zhang Wanrong Bai Haoyang Cui 《Energy Engineering》 EI 2023年第3期665-679,共15页
Optical Character Recognition(OCR)refers to a technology that uses image processing technology and character recognition algorithms to identify characters on an image.This paper is a deep study on the recognition effe... Optical Character Recognition(OCR)refers to a technology that uses image processing technology and character recognition algorithms to identify characters on an image.This paper is a deep study on the recognition effect of OCR based on Artificial Intelligence(AI)algorithms,in which the different AI algorithms for OCR analysis are classified and reviewed.Firstly,the mechanisms and characteristics of artificial neural network-based OCR are summarized.Secondly,this paper explores machine learning-based OCR,and draws the conclusion that the algorithms available for this form of OCR are still in their infancy,with low generalization and fixed recognition errors,albeit with better recognition effect and higher recognition accuracy.Finally,this paper explores several of the latest algorithms such as deep learning and pattern recognition algorithms.This paper concludes that OCR requires algorithms with higher recognition accuracy. 展开更多
关键词 optical character recognition artificial intelligence power system image artificial neural network machine leaning deep learning
下载PDF
OPTICAL CHARACTERIZATION OF TiO_2 THIN FILM ON SILICON SUBSTRATE DEPOSITED BY DC REACTIVE MAGNETRON SPUTTERING 被引量:3
3
作者 H.Q.Wang H.Shen +3 位作者 D.C.Ba B.W.Wang L.S.Wen D.Chen 《Acta Metallurgica Sinica(English Letters)》 SCIE EI CAS CSCD 2005年第3期194-198,共5页
TiO2 thin film has attracted considerable attention in recent years, due to its different refractive index and transparency with amorphous and different crysta ls in the visible and near-infrared wavelength region, hi... TiO2 thin film has attracted considerable attention in recent years, due to its different refractive index and transparency with amorphous and different crysta ls in the visible and near-infrared wavelength region, high dielectric constant, wide band gap, high wear resistance and stability, etc, for which make it being used in many fields. This paper aims to investigate the optical characterizatio n of thin film TiO2 on silicon wafer. The TiO2 thin films were prepared by DC re active magnetron sputtering process from Ti target. The reflectivity of the film s was measured by UV-3101PC, and the index of refraction (n) and extinction coef ficient (k) were measured by n & k Analyzer 1200. 展开更多
关键词 optical characterization TiO2 thin film DC reactive magnetron sputtering n & k
下载PDF
Nonlinear optical characterization of single β-barium-borate nanocrystals using second-harmonic confocal microscopy
4
作者 Rodrigo G.dos Santos Lauro J.Q.Maia +1 位作者 Cid B.de Araújo Leonardo de S.Menezes 《Chinese Optics Letters》 SCIE EI CAS CSCD 2018年第4期81-85,共5页
The confocal microscopy technique was applied for nonlinear optical characterization of single β-barium-borate(β-BBO) nanocrystals. The experimental setup allows measurements of the laser polarization-selective se... The confocal microscopy technique was applied for nonlinear optical characterization of single β-barium-borate(β-BBO) nanocrystals. The experimental setup allows measurements of the laser polarization-selective secondharmonic(SH) generation, and the results can be used to determine the nanocrystals' c-axis orientation, as well as to obtain information about their second-order susceptibility χ^(2). The dependence of the SH signal on the laser polarization allowed the discrimination of individual particles from aggregates. The data were fitted using a model that takes into account the BBO properties and the experimental setup characteristics considering(i) the electrostatic approximation,(ii) the effects of the microscope objective used to focus the light on the sample in an epi-geometry configuration, and(iii) the symmetry of χ^(2) for the β-BBO nanocrystals. A signal at the third-harmonic frequency was also detected, but it was too weak to be studied in detail. 展开更多
关键词 SH BBO HR barium-borate nanocrystals using second-harmonic confocal microscopy Nonlinear optical characterization of single
原文传递
Green Method, Optical and Structural Characterization of ZnO Nanoparticles Synthesized Using Leaves Extract of M. oleifera
5
作者 Jose Vulfrano Gonzalez-Fernandez Diego David Pinzon-Moreno +1 位作者 Antony Alexander Neciosup-Puican Maria Veronica Carranza-Oropeza 《Journal of Renewable Materials》 SCIE EI 2022年第3期833-847,共15页
ZnO nanoparticles(ZnO-NP)present innovative optical,electrical,and magnetic properties that depend on spe-cific characteristics,e.g.,size,distribution,and morphology.Thus,these properties are essential to address vari... ZnO nanoparticles(ZnO-NP)present innovative optical,electrical,and magnetic properties that depend on spe-cific characteristics,e.g.,size,distribution,and morphology.Thus,these properties are essential to address various applications in areas such as electronics,medicine,energy,and others.In addition,the performance of this ZnO-NP depends of their preparation which can be done by chemical,physical,and biological methods.Meanwhile,nowadays,the main interest in developing ZnO-NP synthesis through biological methods bases on the decrease of use of toxic chemicals or energy applied to the procedures,making the process more cost-effective and environ-mentally friendly.However,the large-scale production of nanoparticles by green synthesis remains a big challenge due to the complexity of the biological extracts used in chemical reactions.That being the case,the preparation of ZnO-NP using Moringa oleifera extract as an alternative biological agent for capping and reduction in synthesis was evaluated in this work.Then,the results based on the analysis of the optical and structural characterization of the ZnO-NP obtained by employing UV-Vis,DLS,zeta potential,XRD,ATR-FTIR,and FE-SEM indicate mostly the presence of spherical nanosized material with a mean hydrodynamic diameter of 47.2 nm measured by DLS and a mean size diameter of 25 nm observed with FE-SEM technique.Furthermore,in FE-SEM images a homo-geneous dispersion and distribution is observed in the absence of agglutination,agglomeration,or generation of significant lumps of the ZnO-NP.The XRD analysis showed that heat annealing induced the crystallite size favor-ing their monocrystallinity.Those obtained data confirm the synthesis of ZnO-NP and the absence of impurities associated with organic compounds in the annealed samples.Finally,those results and low-cost production pre-sent to the synthesized ZnO-NP by this biological method as a useful material in several applications. 展开更多
关键词 ZnO nanoparticles green chemistry Moringa oleifera extract optical characterization structural characterization
下载PDF
New method for fast morphological characterization of organic polycrystalline films by polarized optical microscopy
6
作者 何小川 杨建兵 +1 位作者 闫东航 翁羽翔 《Chinese Physics B》 SCIE EI CAS CSCD 2015年第7期396-400,共5页
A new method to visualize the large-scale crystal grain morphology of organic polycrystalline films is proposed. First,optical anisotropic transmittance images of polycrystalline zinc phthalocyanine(Zn Pc) films vac... A new method to visualize the large-scale crystal grain morphology of organic polycrystalline films is proposed. First,optical anisotropic transmittance images of polycrystalline zinc phthalocyanine(Zn Pc) films vacuum deposited by weak epitaxial growth(WEG) method were acquired with polarized optical microscopy(POM). Then morphology properties including crystal grain size, distribution, relative orientation, and crystallinity were derived from these images by fitting with a transition dipole model. At last, atomic force microscopy(AFM) imaging was carried out to confirm the fitting and serve as absolute references. This method can be readily generalized to other organic polycrystalline films, thus providing an efficient way to access the large-scale morphologic properties of organic polycrystalline films, which may prove to be useful in industry as a film quality monitoring method. 展开更多
关键词 organic polycrystalline films morphology characterization polarized optical microscopy
下载PDF
Automated Extraction and Analysis of CBC Test from Scanned Images
7
作者 Iman S. Alansari 《Journal of Software Engineering and Applications》 2024年第2期129-141,共13页
Health care is an important part of human life and is a right for everyone. One of the most basic human rights is to receive health care whenever they need it. However, this is simply not an option for everyone due to... Health care is an important part of human life and is a right for everyone. One of the most basic human rights is to receive health care whenever they need it. However, this is simply not an option for everyone due to the social conditions in which some communities live and not everyone has access to it. This paper aims to serve as a reference point and guide for users who are interested in monitoring their health, particularly their blood analysis to be aware of their health condition in an easy way. This study introduces an algorithmic approach for extracting and analyzing Complete Blood Count (CBC) parameters from scanned images. The algorithm employs Optical Character Recognition (OCR) technology to process images containing tabular data, specifically targeting CBC parameter tables. Upon image processing, the algorithm extracts data and identifies CBC parameters and their corresponding values. It evaluates the status (High, Low, or Normal) of each parameter and subsequently presents evaluations, and any potential diagnoses. The primary objective is to automate the extraction and evaluation of CBC parameters, aiding healthcare professionals in swiftly assessing blood analysis results. The algorithmic framework aims to streamline the interpretation of CBC tests, potentially improving efficiency and accuracy in clinical diagnostics. 展开更多
关键词 Image Processing optical character Recognition Tesseract OCR Health Care Application
下载PDF
Support Vector Machine Based Handwritten Hindi Character Recognition and Summarization
8
作者 Sunil Dhankhar Mukesh Kumar Gupta +3 位作者 Fida Hussain Memon Surbhi Bhatia Pankaj Dadheech Arwa Mashat 《Computer Systems Science & Engineering》 SCIE EI 2022年第10期397-412,共16页
In today’s digital era,the text may be in form of images.This research aims to deal with the problem by recognizing such text and utilizing the support vector machine(SVM).A lot of work has been done on the English l... In today’s digital era,the text may be in form of images.This research aims to deal with the problem by recognizing such text and utilizing the support vector machine(SVM).A lot of work has been done on the English language for handwritten character recognition but very less work on the under-resourced Hindi language.A method is developed for identifying Hindi language characters that use morphology,edge detection,histograms of oriented gradients(HOG),and SVM classes for summary creation.SVM rank employs the summary to extract essential phrases based on paragraph position,phrase position,numerical data,inverted comma,sentence length,and keywords features.The primary goal of the SVM optimization function is to reduce the number of features by eliminating unnecessary and redundant features.The second goal is to maintain or improve the classification system’s performance.The experiment included news articles from various genres,such as Bollywood,politics,and sports.The proposed method’s accuracy for Hindi character recognition is 96.97%,which is good compared with baseline approaches,and system-generated summaries are compared to human summaries.The evaluated results show a precision of 72%at a compression ratio of 50%and a precision of 60%at a compression ratio of 25%,in comparison to state-of-the-art methods,this is a decent result. 展开更多
关键词 Support vector machine(SVM) optimization PRECISION Hindi character recognition optical character recognition(OCR) automatic summarization and compression ratio
下载PDF
CHARACTER DETECTION AND RECOGNITION SYSTEM OF BEER BOTTLES 被引量:1
9
作者 Shen Bangxing Wu Wenjun +2 位作者 Zhang Yepeng Shen Gang Yang Liangen 《Chinese Journal of Mechanical Engineering》 SCIE EI CAS CSCD 2005年第3期467-469,共3页
An optical imaging system and a configuration characteristic algorithm are presented to reduce the difficulties in extracting intact characters image with weak contrast, in recognizing characters on fast moving beer b... An optical imaging system and a configuration characteristic algorithm are presented to reduce the difficulties in extracting intact characters image with weak contrast, in recognizing characters on fast moving beer bottles. The system consists of a hardware subsystem, including a rotating device, CCD, 16 mm focus lens, a frame grabber card, a penetrating lighting and a computer, and a software subsystem. The software subsystem performs pretreatment, character segmentation and character recognition. In the pretreatment, the original image is filtered with preset threshold to remove isolated spots. Then the horizontal projection and the vertical projection are used respectively to retrieve the character segmentation. Subsequently, the configuration characteristic algorithm is applied to recognize the characters. The experimental results demonstrate that this system can recognize the characters on beer bottles accurately and effectively; the algorithm is proven fast, stable and robust, making it suitable in the industrial environment. 展开更多
关键词 optical imaging system Raised character recognition Configuration characteristic algorithm
下载PDF
Annealing behaviour of structure and morphology and its effects on the optical gain of Er^3+/yb^3+ co-doped Al2O3 planar waveguide amplifier
10
作者 谭娜 张庆瑜 《Chinese Physics B》 SCIE EI CAS CSCD 2006年第9期2165-2169,共5页
Using transmission electron microscopy (TEM) and x-ray diffraction analysis, we have studied the structural and morphological evolution of highly Er/Yb co-doped A1203 films in the temperature range from 600℃-900℃.... Using transmission electron microscopy (TEM) and x-ray diffraction analysis, we have studied the structural and morphological evolution of highly Er/Yb co-doped A1203 films in the temperature range from 600℃-900℃. By comparison with TEM observation, the annealing behaviours of photoluminescence (PL) emission and optical loss were found to have relation to the structure and morphology. The increase of PL intensity and optical loss above 800℃ might result from the crystallization of amorphous Al2O3 films. Based on the study on the structure and morphology, a rate equation propagation model of a multilevel system was used to calculate the optical gains of Er-doped Al2O3 planar waveguide amplifiers involving the variation of PL efficiency and optical loss with annealing temperature. It was found that the amplifiers had an optimized optical gain at the temperature corresponding to the minimum of optical loss, rather than at the temperature corresponding to the maximum of PL efficiency, suggesting that the optical loss is a key factor for determining the optical gain of an Er-doped Al2O3 planar waveguide amplifier. 展开更多
关键词 Er-doped waveguide amplifier annealing behaviour structural characterization optical gain
下载PDF
Visual News Ticker Surveillance Approach from Arabic Broadcast Streams
11
作者 Moeen Tayyab Ayyaz Hussain +2 位作者 Usama Mir M.Aqeel Iqbal Muhammad Haneef 《Computers, Materials & Continua》 SCIE EI 2023年第3期6177-6193,共17页
The news ticker is a common feature of many different news networks that display headlines and other information.News ticker recognition applications are highly valuable in e-business and news surveillance for media r... The news ticker is a common feature of many different news networks that display headlines and other information.News ticker recognition applications are highly valuable in e-business and news surveillance for media regulatory authorities.In this paper,we focus on the automatic Arabic Ticker Recognition system for the Al-Ekhbariya news channel.The primary emphasis of this research is on ticker recognition methods and storage schemes.To that end,the research is aimed at character-wise explicit segmentation using a semantic segmentation technique and words identification method.The proposed learning architecture considers the grouping of homogeneousshaped classes.This incorporates linguistic taxonomy in a unified manner to address the imbalance in data distribution which leads to individual biases.Furthermore,experiments with a novel ArabicNews Ticker(Al-ENT)dataset that provides accurate character-level and character components-level labeling to evaluate the effectiveness of the suggested approach.The proposed method attains 96.5%,outperforming the current state-of-the-art technique by 8.5%.The study reveals that our strategy improves the performance of lowrepresentation correlated character classes. 展开更多
关键词 Arabic text recognition optical character recognition deep convolutional network SegNet LeNet
下载PDF
Baseline Isolated Printed Text Image Database for Pashto Script Recognition
12
作者 Arfa Siddiqu Abdul Basit +3 位作者 Waheed Noor Muhammad Asfandyar Khan M.Saeed H.Kakar Azam Khan 《Intelligent Automation & Soft Computing》 SCIE 2023年第7期875-885,共11页
The optical character recognition for the right to left and cursive languages such as Arabic is challenging and received little attention from researchers in the past compared to the other Latin languages.Moreover,the... The optical character recognition for the right to left and cursive languages such as Arabic is challenging and received little attention from researchers in the past compared to the other Latin languages.Moreover,the absence of a standard publicly available dataset for several low-resource lan-guages,including the Pashto language remained a hurdle in the advancement of language processing.Realizing that,a clean dataset is the fundamental and core requirement of character recognition,this research begins with dataset generation and aims at a system capable of complete language understanding.Keeping in view the complete and full autonomous recognition of the cursive Pashto script.The first achievement of this research is a clean and standard dataset for the isolated characters of the Pashto script.In this paper,a database of isolated Pashto characters for forty four alphabets using various font styles has been introduced.In order to overcome the font style shortage,the graphical software Inkscape has been used to generate sufficient image data samples for each character.The dataset has been pre-processed and reduced in dimensions to 32×32 pixels,and further converted into the binary format with a black background and white text so that it resembles the Modified National Institute of Standards and Technology(MNIST)database.The benchmark database is publicly available for further research on the standard GitHub and Kaggle database servers both in pixel and Comma Separated Values(CSV)formats. 展开更多
关键词 Text-image database optical character recognition(OCR) pashto isolated characters visual recognition autonomous language understanding deep learning convolutional neural network(CNN)
下载PDF
Vehicle Density Prediction in Low Quality Videos with Transformer Timeseries Prediction Model(TTPM)
13
作者 D.Suvitha M.Vijayalakshmi 《Computer Systems Science & Engineering》 SCIE EI 2023年第1期873-894,共22页
Recent advancement in low-cost cameras has facilitated surveillance in various developing towns in India.The video obtained from such surveillance are of low quality.Still counting vehicles from such videos are necess... Recent advancement in low-cost cameras has facilitated surveillance in various developing towns in India.The video obtained from such surveillance are of low quality.Still counting vehicles from such videos are necessity to avoid traf-fic congestion and allows drivers to plan their routes more precisely.On the other hand,detecting vehicles from such low quality videos are highly challenging with vision based methodologies.In this research a meticulous attempt is made to access low-quality videos to describe traffic in Salem town in India,which is mostly an un-attempted entity by most available sources.In this work profound Detection Transformer(DETR)model is used for object(vehicle)detection.Here vehicles are anticipated in a rush-hour traffic video using a set of loss functions that carry out bipartite coordinating among estimated and information acquired on real attributes.Every frame in the traffic footage has its date and time which is detected and retrieved using Tesseract Optical Character Recognition.The date and time extricated and perceived from the input image are incorporated with the length of the recognized objects acquired from the DETR model.This furnishes the vehicles report with timestamp.Transformer Timeseries Prediction Model(TTPM)is proposed to predict the density of the vehicle for future prediction,here the regular NLP layers have been removed and the encoding temporal layer has been modified.The proposed TTPM error rate outperforms the existing models with RMSE of 4.313 and MAE of 3.812. 展开更多
关键词 Detection transformer self-attention tesseract optical character recognition transformer timeseries prediction model time encoding vector
下载PDF
智能移动终端涉密信息监测系统 被引量:3
14
作者 王本钰 顾益军 彭舒凡 《科学技术与工程》 北大核心 2022年第6期2317-2325,共9页
网络高度发达的信息时代,防止涉密信息被泄露是一件非常重要的任务,尤其是对于政府、军队、公安等重点单位。传统的涉密信息监测系统往往是安装在主机等终端中,无法对于通过手机等智能移动终端偷拍涉密图片或者通过聊天软件上传涉密图... 网络高度发达的信息时代,防止涉密信息被泄露是一件非常重要的任务,尤其是对于政府、军队、公安等重点单位。传统的涉密信息监测系统往往是安装在主机等终端中,无法对于通过手机等智能移动终端偷拍涉密图片或者通过聊天软件上传涉密图片的行为无法进行有效的制止。针对这个问题,设计了一种将CTPN文本检测算法、光学字符识别技术(optical character recognition,OCR)与场景识别、图片传输监控相结合的智能移动终端涉密信息监测系统,可广泛应用于Android移动平台中。该系统通过全局扫描,实时相机监察,社交管控三防一体对失泄密行为进行监控监察,有效防止失泄密事故案件的发生。测试结果显示,该系统不仅可以准确识别涉密图片、监测涉密行为并且处理速度快、占用内存空间小,可以满足涉密单位的基本需求。 展开更多
关键词 CTPN文本检测算法 光学字符识别技术(optical character recognition OCR) 智能移动终端 监控监察
下载PDF
Libyan Licenses Plate Recognition Using Template Matching Method 被引量:1
15
作者 Alla A. El. Senoussi Abdella 《Journal of Computer and Communications》 2016年第7期62-71,共10页
License plate recognition (LPR) applies image processing and character recognition technology to identify vehicles by automatically reading their license plates. The work presented in this paper aims to create a compu... License plate recognition (LPR) applies image processing and character recognition technology to identify vehicles by automatically reading their license plates. The work presented in this paper aims to create a computer vision system capable of taking real-time input image from a static camera and identifying the license plate from extracted image. This problem is examined in two stages: First the license plate region detection and extraction from background and plate segmentation to sub-images, and second the character recognition stage. The method used for the license plate region detection is based on the assumption that the license plate area is a high concentration of smaller details, making it a region of high intensity of edges. The Sobel filter and their vertical and horizontal projections are used to identify the plate region. The result of testing this stage was an accuracy of 67.5%. The final stage of the LPR system is optical character recognition (OCR). The method adopted for this stage is based on template matching using correlation. Testing the performance of OCR resulted in an overall recognition rate of 87.76%. 展开更多
关键词 License Plate Recognition optical character Recognition Computer Vision System
下载PDF
ERCS: An Efficient and Robust Card Recognition System for Camera-Based Image
16
作者 Zhonghong Ou Baiqiao Xiong +1 位作者 Fenrui Xiao Meina Song 《China Communications》 SCIE CSCD 2020年第12期247-264,共18页
Cards Recognition Systems,(CRSs)are representative computer vision-based applications.They have a broad range of usage scenarios.For example,they can be used to recognize images containing business cards,personal iden... Cards Recognition Systems,(CRSs)are representative computer vision-based applications.They have a broad range of usage scenarios.For example,they can be used to recognize images containing business cards,personal identification cards,and bank cards etc.Even though CRSs have been studied for many years,it is still difficult to recognize cards in camera-based images taken by ordinary devices,e.g.,mobile phones.Diversity of viewpoints and complex backgrounds in the images make the recognition task challenging.Existing systems employing traditional image processing schemes are not robust to varied environment,and are inefficient in dealing with natural images,e.g.,taken by mobile phones.To tackle the problem,we propose a novel framework for card recognition by employing a Convolutional Neutral Network(CNN)based approach.The system localizes the foreground of the image by utilizing a Fully Convolutional Network(FCN).With the help of the foreground map,the system localizes the corners of the card region and employs perspective transformation to alleviate the effects from distortion.Text lines in the card region are detected and recognized by utilizing CNN and Long Short Term Memory,(LSTM).To evaluate the proposed scheme,we collect a large dataset which contains 4,065 images in a variety of shooting scenarios.Experimental results demonstrate the efficacy of the proposed scheme.Specifically,it is able to achieve an accuracy of 90.62%in the end-toend test,outperforming the state-of-the-art. 展开更多
关键词 card localization card recognition optical character recognition CNN
下载PDF
AI Cannot Understand Memes:Experiments with OCR and Facial Emotions
17
作者 Ishaani Priyadarshini Chase Cotton 《Computers, Materials & Continua》 SCIE EI 2022年第1期781-800,共20页
The increasing capabilities of Artificial Intelligence(AI),has led researchers and visionaries to think in the direction of machines outperforming humans by gaining intelligence equal to or greater than humans,which m... The increasing capabilities of Artificial Intelligence(AI),has led researchers and visionaries to think in the direction of machines outperforming humans by gaining intelligence equal to or greater than humans,which may not always have a positive impact on the society.AI gone rogue,and Technological Singularity are major concerns in academia as well as the industry.It is necessary to identify the limitations of machines and analyze their incompetence,which could draw a line between human and machine intelligence.Internet memes are an amalgam of pictures,videos,underlying messages,ideas,sentiments,humor,and experiences,hence the way an internet meme is perceived by a human may not be entirely how a machine comprehends it.In this paper,we present experimental evidence on how comprehending Internet Memes is a challenge for AI.We use a combination of Optical Character Recognition techniques like Tesseract,Pixel Link,and East Detector to extract text from the memes,and machine learning algorithms like Convolutional Neural Networks(CNN),Region-based Convolutional Neural Networks(RCNN),and Transfer Learning with pre-trained denseNet for assessing the textual and facial emotions combined.We evaluate the performance using Sensitivity and Specificity.Our results show that comprehending memes is indeed a challenging task,and hence a major limitation of AI.This research would be of utmost interest to researchers working in the areas of Artificial General Intelligence and Technological Singularity. 展开更多
关键词 Technological singularity optical character recognition transfer learning convolutional neural networks(CNN) region-based convolutional neural networks(RCNN)
下载PDF
Denoising Letter Images from Scanned Invoices Using Stacked Autoencoders
18
作者 Samah Ibrahim Alshathri Desiree Juby Vincent V.S.Hari 《Computers, Materials & Continua》 SCIE EI 2022年第4期1371-1386,共16页
Invoice document digitization is crucial for efficient management in industries.The scanned invoice image is often noisy due to various reasons.This affects the OCR(optical character recognition)detection accuracy.In ... Invoice document digitization is crucial for efficient management in industries.The scanned invoice image is often noisy due to various reasons.This affects the OCR(optical character recognition)detection accuracy.In this paper,letter data obtained from images of invoices are denoised using a modified autoencoder based deep learning method.A stacked denoising autoencoder(SDAE)is implemented with two hidden layers each in encoder network and decoder network.In order to capture the most salient features of training samples,a undercomplete autoencoder is designed with non-linear encoder and decoder function.This autoencoder is regularized for denoising application using a combined loss function which considers both mean square error and binary cross entropy.A dataset consisting of 59,119 letter images,which contains both English alphabets(upper and lower case)and numbers(0 to 9)is prepared from many scanned invoices images and windows true type(.ttf)files,are used for training the neural network.Performance is analyzed in terms of Signal to Noise Ratio(SNR),Peak Signal to Noise Ratio(PSNR),Structural Similarity Index(SSIM)and Universal Image Quality Index(UQI)and compared with other filtering techniques like Nonlocal Means filter,Anisotropic diffusion filter,Gaussian filters and Mean filters.Denoising performance of proposed SDAE is compared with existing SDAE with single loss function in terms of SNR and PSNR values.Results show the superior performance of proposed SDAE method. 展开更多
关键词 Stacked denoising autoencoder(SDAE) optical character recognition(OCR) signal to noise ratio(SNR) universal image quality index(UQ1)and structural similarity index(SSIM)
下载PDF
Study on the de-watermark algorithm based on grayscale text
19
作者 黄国权 Chen Zhipeng Sun Xiaocui 《High Technology Letters》 EI CAS 2021年第1期95-102,共8页
When using the current popular text recognition algorithms such as optical character recognition(OCR)algorithm for text images,the presence of watermarks in text images interferes with algorithm recognition to the ext... When using the current popular text recognition algorithms such as optical character recognition(OCR)algorithm for text images,the presence of watermarks in text images interferes with algorithm recognition to the extent of fuzzy font,which is not conducive to the improvement of the recognition rate.In order to pursue fast and high recognition rate,watermark removal has become a critical problem to be solved.This work studies the watermarking algorithm based on morphological algorithm set and classic image algorithm in computer images.It can not only remove the watermark in a short time,but also keep the form and clarity of the text in the image.The algorithm also meets the requirements that the higher the clarity of image and text,the better the processing effect.It can process the Chinese characters with complex structure,complicated radicals or other characters well.In addition,the algorithm can basically process ordinary size images in 1 s,the efficiency is relatively high. 展开更多
关键词 de-watermark text recognition character recognition optical character recognition(OCR)application
下载PDF
Research on the Early Warning System of Cold Chain Cargo Based on OCR Technology
20
作者 Jiaxuan Meng 《World Journal of Engineering and Technology》 2022年第3期527-538,共12页
This paper designs a set of semi-automatic intelligent cold chain cargo proximity warning system with wireless data transmission, lightweight Optical Character Recognition identification algorithm framework and electr... This paper designs a set of semi-automatic intelligent cold chain cargo proximity warning system with wireless data transmission, lightweight Optical Character Recognition identification algorithm framework and electronic label automatic warning as the core technology for cold chain dairy Fast Moving Consumer Goods contractors. In terms of hardware, Pulse Frequency Modulation modulation and demodulation are used as the main technology to realize wireless transmission and reception of equipment, and digital electronic tags are added to warn the same batch of upcoming goods. In terms of software, based on Chinese-ocr algorithm, image preprocessing and recognition methods are studied, and an early warning system is designed. So as to realize semi-automatic early warning of cold chain logistics goods. 展开更多
关键词 optical character Recognition Wireless Signal Transmission Image Processing Cold Chain Logistics Managemen Automatic Early Warning System
下载PDF
上一页 1 2 下一页 到第
使用帮助 返回顶部