Character Recognition and Language Translation with Optical Character Recognition |
Author(s): |
| Dnyanada Mangesh Padwal , Theem College Of Engineering; Tejashree Mahtre, Theem College Of Engineering; Roshan Chavan, Theem College Of Engineering |
Keywords: |
| Tesseract, OCR, optical character recognition, character Recognition, document |
Abstract |
|
In our day to day life the people are facing many problems in understand the languages. For example, people in different states speak different languages they might not understand or speak other state language at that time this OCR Website will help them. Existing system, having a separate application for each and every process like camera, Google translator and Optical Character Recognition (OCR) text scanner. But, people expect the application consists of all the three facilities together. So this proposed web application provides a new idea to the people to translate the other language text into their known language. This application contains three steps. 1.Take a choose image of the unknown language text which you want to translate( printed material), 2.Tessaract is an open source Optical Character Recognition (OCR) technology, which is used to extract the text from the image then Google API and Bing API is used for translation of language. 3. The translated text is generated in PDF format. This paper presents details about translation in terms of a web application that accepts image document as an input, where input document is a user define image file containing text in any language available in the Python-tesseract library and does its exact translation in any supported languages using Google Translator (i. e Googletrans). Using the computational power the individual elements like text, images, and special characters can be distinguished. OCR-Optical Character Recognizer does the work. |
Other Details |
|
Paper ID: IJSRDV9I30021 Published in: Volume : 9, Issue : 3 Publication Date: 01/06/2021 Page(s): 23-25 |
Article Preview |
|
|
|
|
