APPROACH TO TEXT EXTRACTION FROM IMAGE

Disha Bhat; Charitha D; Dimple M K; Amruthashree R V; Shruthi G

doi:10.26483/ijarcs.v9i0.6270

PDF

Published: Aug 8, 2018

DOI: https://doi.org/10.26483/ijarcs.v9i0.6270

Keywords:

image, text, extraction

Disha Bhat

Charitha D

Dimple M K

Amruthashree R V

Shruthi G

Abstract

The multimedia resources in a database and on the web are increasing. The multimedia resources can be images and videos. It has become a very difficult task to develop the effective methods to manage as well as to retrieve these resources by their content. The Text which is an important object which carries high-level semantic information which is useful for this task. The current technologyis optical character recognition (OCR) is used to convert machine generated text which is printed against clean background to computer readable form (ASCII). But, text isoften printed against shaded or textured backgrounds or is embedded in images.Examples include maps, photographs, advertisements, videos etc. Current document segmentation and recognition technologies cannot handle these situations well. Our system takes advantages of the distinctive characteristics of text that make it stand out from other image material that is, text possesses certain frequency and orientation information. We will first clean the image by changing the contrast and gradient of the image. Now the objects in the images are identified and numbered. Further in the text recognition process, these numbered objects are segregated into text and non-text. Later the recognised text is reconstructed to form a meaningful text present in the image. Also we are focusing on extracting the text such that certain portion of the images such as logos etc is retained. This is done by calculating the pixels of the required portion of the image to be retained and then training the system in such a way that it extracts all the text except the portion of the image to be retained.

Downloads

Download data is not yet available.

Issue

2018: Volume 9 Special Issue No. 3, May 2018

Section

Articles

COPYRIGHT

Submission of a manuscript implies: that the work described has not been published before, that it is not under consideration for publication elsewhere; that if and when the manuscript is accepted for publication, the authors agree to automatic transfer of the copyright to the publisher.

Authors who publish with this journal agree to the following terms:

Authors retain copyright and grant the journal right of first publication with the work simultaneously licensed under a Creative Commons Attribution License that allows others to share the work with an acknowledgment of the work's authorship and initial publication in this journal.
Authors are able to enter into separate, additional contractual arrangements for the non-exclusive distribution of the journal's published version of the work (e.g., post it to an institutional repository or publish it in a book), with an acknowledgment of its initial publication in this journal.
Authors are permitted and encouraged to post their work online (e.g., in institutional repositories or on their website) prior to and during the submission process, as it can lead to productive exchanges, as well as earlier and greater citation of published work
The journal allows the author(s) to retain publishing rights without restrictions.
The journal allows the author(s) to hold the copyright without restrictions.

References

Deepayan Sarkar "Optical Character Recognition using Neural Networks" University of Wisconsin MadisonECE 539 Project, Fall 2003. [2] â€œEvaluation of OCR Algorithms for Images with Different Spatial Resolutions and Noisesâ€ School of Information Technology and Engineering Faculty of Engineering University of OttawaÂ©Qing Chen, Ottawa, Canada, 2003. [3] â€œA Neural Network Implementation of Optical Character Recognitionâ€ Technical Report Number CSSE10-05 COMP 6600 â€“ Artificial Intelligence Spring 2009. [4] Sukhpreet Singh M.tech Student â€œOptical Character Recognition Techniques: A Surveyâ€, Dept. of Computer Engineering, YCOE Talwandi Sabo BP. India. [5] Amarjot Singh, ketanbacchuwar, Akshaybhasinâ€œSurvey of OCR Applicationsâ€.International Journal of Machine Learning and Computing , June 2012 [6] M.D. Ganis, C.L. Wilson, J.L. Blue, â€œNeural network-based systems for handprint OCR applicationsâ€ in IEEE Transactions on Image Processing, 1998, Vol: 7, Issue: 8, p.p. 1097 â€“ 1112. [7] Sadagopan Srinivasan, Li Zhao, Lin Sun, Zhen Fang, Peng Li, Tao Wang,RavishankarIyer, Ramesh Illikkal ,â€œPerformance Characterization and Acceleration of Optical Character Recognition on Handheld Platformsâ€, IEEE December 2010, DOI: 10.1109/IISWC.2010.5648852 [8] Sonia Bhaskar, Nicholas Lavassar, Scott GreenEE â€œImplementing Optical Character Recognition on the Android Operating System for Business Cardsâ€ 368 Digital Image Processing, 2010. [9] Anitha Mary M.O. Chacko and P.M. Dhanyaâ€œA Comparative Study of Different Feature Extraction Techniques for Offline Malayalam Character Recognitionâ€ Springer India 2015, DOI 10.1007/978-81-322-2208-8_2

Article Sidebar

Main Article Content

Abstract

Downloads

Article Details

References

Similar Articles