Language Interpreter and Speaker

Ruchi Bari; Mrunmayee Apte; Aakanksha Mohite; Sainath Patil

doi:10.17010/ijcs/2022/v7/i2/169682

Language Interpreter and Speaker

Ruchi Bari ¹, Mrunmayee Apte ¹, Aakanksha Mohite ¹, Sainath Patil ²

Affiliations
1 Student, Vidyavardhini College of Engineering and Technology, K.T. Marg, Vartak College Campus, Vasai Road (W), District Palghar, Vasai, Maharashtra - 401 202, India
2 Assistant Professor (Guide), Vidyavardhini College of Engineering and Technology, K.T. Marg, Vartak College Campus, Vasai Road (W), District Palghar, Vasai, Maharashtra - 401 202, India

Language Interpreter and Speaker is a device for identifying the language of the written image text and then converting the same text to speech format. This device would surely be useful for blind and visually impaired people. Language identification (LI) is the method in which we identify the natural language of the given content. It is the process of categorizing a document on the basis of its language. In this generation, we are heading towards a phase where computers would be capable of doing all things that humans can do. Recognition of language used is the initial requirement before reading or learning. To start with any of the tasks, humans first try to understand the task and then process the task. Similarly, for language identification, the machine needs to learn the language and once learning is complete, it should be able to recognize the language. The project is divided into three parts. Initially, the handwritten image text would be converted to normal text. In the second part, the language would be identified from the converted text and last, the text would be converted to audio format. This paper discusses the implementation of this idea, gives an approach to problems and challenges that we came across, and some solutions.

Keywords

AlexNet, CNN (Convolution Neural Network), gTTS (google-text-to-speech), Image Processing.

I-Scholar

Journal Help

User

Subscription Login to verify subscription

Notifications

Journal Content
Browse

Font Size

Information

V. V. Mainkar, J. A. Katkar, A. B. Upade, and P. R. Pednekar, ”Handwritten character recognition to obtain editable text,” in 2020 Int. Conf. Electronics Sustainable Communication Syst., 2020, pp. 599–602, doi: 10.1109/ICESC48915.2020.9155786

N. Jayanthi, H. Harsha, N. Jain, and I. S. Dhingra, "Language detection of text document image," in 2020 7th Int. Conf. Signal Process. Integr. Networks (SPIN), 2020, pp. 647–653, doi: 10.1109/SPIN48934.2020.9071167

S. C. Madre and S. B. Gundre, “OCR based image text to speech conversion using MATLAB,” in 2018 2nd Int. Conf. Intelligent Computing Control Systems, 2018, pp. 858–861, doi: 10.1109/ICCONS.2018.8663023

M. B. Bora, D. Daimary, K. Amitab, and D. Kandar, “Handwritten character recognition from images using CNN-ECOC,” Procedia Comput. Sci., vol. 167, 2020, pp. 2403–2409, doi: 10.1016/j.procs.2020.03.293

A. Choudhary, R. Rishi, and S. Ahlawat, “Off-line handwritten character recognition using features extracted from Binarization technique,” AASRI Procedia, vol. 4, pp. 306–312, 2013, doi: 10.1016/j.aasri.2013.10.045

“Handwritten character,” [Online]. Available: https://www.kaggle.com/vaibhao/handwritten-characters

“IAM-Dataset,” [Online]. Available: https://www.kaggle.com/datasets/naderabdalghani/iam- handwritten-forms-dataset

Abstract Views: 175

PDF Views: 0

Language Interpreter and Speaker

Abstract Views: 175 | PDF Views: 0

Authors

Ruchi Bari
Student, Vidyavardhini College of Engineering and Technology, K.T. Marg, Vartak College Campus, Vasai Road (W), District Palghar, Vasai, Maharashtra - 401 202, India

Mrunmayee Apte
Student, Vidyavardhini College of Engineering and Technology, K.T. Marg, Vartak College Campus, Vasai Road (W), District Palghar, Vasai, Maharashtra - 401 202, India

Aakanksha Mohite
Student, Vidyavardhini College of Engineering and Technology, K.T. Marg, Vartak College Campus, Vasai Road (W), District Palghar, Vasai, Maharashtra - 401 202, India

Sainath Patil
Assistant Professor (Guide), Vidyavardhini College of Engineering and Technology, K.T. Marg, Vartak College Campus, Vasai Road (W), District Palghar, Vasai, Maharashtra - 401 202, India

Abstract

Keywords

AlexNet, CNN (Convolution Neural Network), gTTS (google-text-to-speech), Image Processing.

References

DOI: https://doi.org/10.17010/ijcs%2F2022%2Fv7%2Fi2%2F169682

Username
Password
Remember me

Username
Password
Remember me

Indian Journal of Computer Science

Indian Journal of Computer Science

Language Interpreter and Speaker

Subscribe/Renew Journal

Keywords

Language Interpreter and Speaker

Authors

Abstract

Keywords

References