Demystification of Bilingual Optical Character Recognition System for Devanagari and English Scripts
Rohit Verma1, Jahid Ali2
1Rohit Verma, Research Scholar, IKG Punjab Technical University, Jalandhar, India.
2Dr. Jahid Ali, Director, Shri Sai Iqbal College of Management & IT, Pathankot, India.
Manuscript received on September 23, 2019. | Revised Manuscript received on October 15, 2019. | Manuscript published on October 30, 2019. | PP: 5180-5185 | Volume-9 Issue-1, October 2019 | Retrieval Number: A1856109119/2019©BEIESP | DOI: 10.35940/ijeat.A1856.109119
Open Access | Ethics and Policies | Cite | Mendeley
© The Authors. Blue Eyes Intelligence Engineering and Sciences Publication (BEIESP). This is an open access article under the CC BY-NC-ND license (http://creativecommons.org/licenses/by-nc-nd/4.0/)
Abstract: Research is deliberately going on in the field of pattern recognition. New ideas are developed and implemented in this field throughout the globe. Optical Character Recognition (OCR) is one of the inseparable applications of Pattern Recognition. Though extensive research is already reported in this field, but multilingual Optical Character Recognition is the most challenging aspect which is still, the need of the hour. Myriads of researchers are digging the information to gather the best solutions for the recognition purpose. In this research paper, we are purposing the steps for the recognition of Devanagari and English scripts simultaneously occurring in the documents. A new approach of segmentation and splitting the characters of both the scripts is also introduced for the benefits of researchers. Most commonly in the documents containing English and Devanagari scripts, English characters are already separated, the challenge is to separate the Devanagari characters. Algorithm to implement the challenging aspect to segment the Devanagari and Roman scripts simultaneously is also implemented in the present paper.
Keywords: Optical Character Recognition, Segmentation, Feature extraction, Dataset, pattern recognition, image processing.