Open Access Open Access  Restricted Access Subscription Access

Voice & Text Translator

Adiba Noor, Krish Chopra, Ayush Minj, Priyesh Kumar, Kauleshwar Prasad, Dinesh Kumar Bhawnani

Abstract


The Voice and Text Translator is about translating one form of speech to another. This is a multi-tasking project which can perform various tasks at the same time. The definition of voice recognition should be taken into account, as it is often correlated with the process of identifying a person from his or her voice, i.e., the recognition of a speaker. The main aim of this app is to provide a mechanism for Speech to Speech. It further provides the mechanism for Speech to Text, Text to Speech, and also Text to Text in various languages. This software which deals with speech recognition has to get adapted to the unpredictable and highly variable nature of the human race. Every algorithm that is involved in the speech recognition process is tested and trained on different speaking styles, languages, accents, phrasings, or speaking patterns. Moreover, all these softwares also have to separate the actually spoken speech(audio) from the unwanted background noise that often accompanies these signals.

This also contains some advanced features like voice recognition, understanding, and conversion, which are difficult for a machine to perform. But nowadays AI and ML are dominating the Tech. At its core, this project is built in the python programming language. Python is known for its rich and vast library and its usage in almost all kinds of projects. So, this Translator also contains some very precious modules of Python like Google-Trans, gTTs, etc. A broad array and series of research in the fields of computer science and linguistics are used in the speech recognition process. To have an easier life or to take part in the trending technologies that include hands-free use of any device, almost all the modern devices are adapting and shifting towards integrating speech recognition functions into their device and making up with the new trending technologies.

After the implementation of this project, there would be no need for any intermediate person who conveys a message from source to target, like in earlier days.


Full Text:

PDF

References


Arora, K., Arora, S., & Roy, M. K. (2013). Speech to speech translation: a communication boon. CSI transactions on ICT, 1(3), 207-213.

Bano, S., Jithendra, P., Niharika, G. L., & Sikhi, Y. (2020, November). Speech to text translation enabling multilingualism. In 2020 IEEE International Conference for Innovation in Technology (INOCON) (pp. 1-4). IEEE.

Limbu, S. H. (2020). Direct Speech to Speech Translation Using Machine Learning.

Bahar, P., Bieschke, T., & Ney, H. (2019, December). A comparative study on end-to-end speech to text translation. In 2019 IEEE Automatic Speech Recognition and Understanding Workshop (ASRU) (pp. 792-799). IEEE.

Patil, S., Pradeep, N.. Speech translation system for language barrier reduction. International Research Journal of Engineering and Technology (IRJET)


Refbacks

  • There are currently no refbacks.