

Transcription of Text
Abstract
Optical Character Recognition (OCR) is a technology that recognizes text in documents and converts it into an editable machine-readable format. Feature extraction, recognition, and classification into appropriate labels are the primary components of an OCR system. This paper uses image processing and (OCR) based architecture to segment, recognize, and identify documents. Moreover, handwriting has evolved, as evidenced by the different types of handwritten characters such as digit, numeral, cursive script, and symbols English and other languages. The automatic recognition of text can be an extremely useful application in healthcare, education and personal care. The construction of text recognition systems is possible with the help of Easy OCR which has some basic components of data acquisition, processing and Query and visualization. As the name implies, Easy OCR is a Python package that allows computer vision professionals to do Optical Character Recognition with ease. Additionally, our model “Text Transcription” also translates the detected text into user desirable language and generates an audio output to ease out the reading from user’s end by making it more functional and user-friendly. It is two-way system which also does the conversion of speech to text.
References
Patel, S. R., & Jha, J. (2015, January). Notice of Removal: Handwritten character recognition using machine learning approach-A survey. In 2015 International Conference on Electrical, Electronics, Signals, Communication and Optimization (EESCO) (pp. 1-5). IEEE.
Ghosh, M. M. A., & Maghari, A. Y. (2017, October). A comparative study on handwriting digit recognition using neural networks. In 2017 international conference on promising electronic technologies (ICPET) (pp. 77-81). IEEE.
Shi, B., Bai, X., & Yao, C. (2016). An end-to-end trainable neural network for image-based sequence recognition and its application to scene text recognition. IEEE transactions on pattern analysis and machine intelligence, 39(11), 2298-2304.
Refaey, M. A. (2015, April). Ruled lines detection and removal in grey level handwritten image documents. In 2015 6th International Conference on Information and Communication Systems (ICICS) (pp. 218-221). IEEE.
Zhang, D. K. (2008, December). An extended opening operation and its application in image processing. In 2008 International Conference on MultiMedia and Information Technology (pp. 377-380). IEEE.
Scheidl, H., Fiel, S., & Sablatnig, R. (2018, August). Word beam search: A connectionist temporal classification decoding algorithm. In 2018 16th International Conference on Frontiers in Handwriting Recognition (ICFHR) (pp. 253-258). IEEE.
Islam, K. T., Mujtaba, G., Raj, R. G., & Nweke, H. F. (2017, September). Handwritten digits recognition with artificial neural network. In 2017 International Conference on Engineering Technology and Technopreneurship (ICE2T) (pp. 1-4). IEEE.
Yousaf, A., Khan, M. J., Khan, M. J., Javed, N., Ibrahim, H., Khurshid, K., & Khurshid, K. (2019, January). Size invariant handwritten character recognition using single layer feedforward backpropagation neural networks. In 2019 2nd International Conference on Computing, Mathematics and Engineering Technologies (iCoMET) (pp. 1-7). IEEE.
Lakdawala, B., Khan, F., Khan, A., Tomar, Y., Gupta, R., & Shaikh, A. (2018, April). Voice to Text transcription using CMU Sphinx A mobile application for healthcare organization. In 2018 Second International Conference on Inventive Communication and Computational Technologies (ICICCT) (pp. 749-753). IEEE.
Carbonell, M., Mas, J., Villegas, M., Fornés, A., & Lladós, J. (2019, September). End-to-end handwritten text detection and transcription in full pages. In 2019 International Conference on Document Analysis and Recognition Workshops (ICDARW) (Vol. 5, pp. 29-34). IEEE..
Refbacks
- There are currently no refbacks.