Open Access Open Access  Restricted Access Subscription Access

Language Interpreter and Speaker


Affiliations
1 Student, Vidyavardhini College of Engineering and Technology, K.T. Marg, Vartak College Campus, Vasai Road (W), District Palghar, Vasai, Maharashtra - 401 202, India
2 Assistant Professor (Guide), Vidyavardhini College of Engineering and Technology, K.T. Marg, Vartak College Campus, Vasai Road (W), District Palghar, Vasai, Maharashtra - 401 202, India

   Subscribe/Renew Journal


Language Interpreter and Speaker is a device for identifying the language of the written image text and then converting the same text to speech format. This device would surely be useful for blind and visually impaired people. Language identification (LI) is the method in which we identify the natural language of the given content. It is the process of categorizing a document on the basis of its language. In this generation, we are heading towards a phase where computers would be capable of doing all things that humans can do. Recognition of language used is the initial requirement before reading or learning. To start with any of the tasks, humans first try to understand the task and then process the task. Similarly, for language identification, the machine needs to learn the language and once learning is complete, it should be able to recognize the language. The project is divided into three parts. Initially, the handwritten image text would be converted to normal text. In the second part, the language would be identified from the converted text and last, the text would be converted to audio format. This paper discusses the implementation of this idea, gives an approach to problems and challenges that we came across, and some solutions.

Keywords

AlexNet, CNN (Convolution Neural Network), gTTS (google-text-to-speech), Image Processing.
User
Subscription Login to verify subscription
Notifications
Font Size

  • V. V. Mainkar, J. A. Katkar, A. B. Upade, and P. R. Pednekar, ”Handwritten character recognition to obtain editable text,” in 2020 Int. Conf. Electronics Sustainable Communication Syst., 2020, pp. 599–602, doi: 10.1109/ICESC48915.2020.9155786
  • N. Jayanthi, H. Harsha, N. Jain, and I. S. Dhingra, "Language detection of text document image," in 2020 7th Int. Conf. Signal Process. Integr. Networks (SPIN), 2020, pp. 647–653, doi: 10.1109/SPIN48934.2020.9071167
  • S. C. Madre and S. B. Gundre, “OCR based image text to speech conversion using MATLAB,” in 2018 2nd Int. Conf. Intelligent Computing Control Systems, 2018, pp. 858–861, doi: 10.1109/ICCONS.2018.8663023
  • M. B. Bora, D. Daimary, K. Amitab, and D. Kandar, “Handwritten character recognition from images using CNN-ECOC,” Procedia Comput. Sci., vol. 167, 2020, pp. 2403–2409, doi: 10.1016/j.procs.2020.03.293
  • A. Choudhary, R. Rishi, and S. Ahlawat, “Off-line handwritten character recognition using features extracted from Binarization technique,” AASRI Procedia, vol. 4, pp. 306–312, 2013, doi: 10.1016/j.aasri.2013.10.045
  • “Handwritten character,” [Online]. Available: https://www.kaggle.com/vaibhao/handwritten-characters
  • “IAM-Dataset,” [Online]. Available: https://www.kaggle.com/datasets/naderabdalghani/iam- handwritten-forms-dataset

Abstract Views: 175

PDF Views: 0




  • Language Interpreter and Speaker

Abstract Views: 175  |  PDF Views: 0

Authors

Ruchi Bari
Student, Vidyavardhini College of Engineering and Technology, K.T. Marg, Vartak College Campus, Vasai Road (W), District Palghar, Vasai, Maharashtra - 401 202, India
Mrunmayee Apte
Student, Vidyavardhini College of Engineering and Technology, K.T. Marg, Vartak College Campus, Vasai Road (W), District Palghar, Vasai, Maharashtra - 401 202, India
Aakanksha Mohite
Student, Vidyavardhini College of Engineering and Technology, K.T. Marg, Vartak College Campus, Vasai Road (W), District Palghar, Vasai, Maharashtra - 401 202, India
Sainath Patil
Assistant Professor (Guide), Vidyavardhini College of Engineering and Technology, K.T. Marg, Vartak College Campus, Vasai Road (W), District Palghar, Vasai, Maharashtra - 401 202, India

Abstract


Language Interpreter and Speaker is a device for identifying the language of the written image text and then converting the same text to speech format. This device would surely be useful for blind and visually impaired people. Language identification (LI) is the method in which we identify the natural language of the given content. It is the process of categorizing a document on the basis of its language. In this generation, we are heading towards a phase where computers would be capable of doing all things that humans can do. Recognition of language used is the initial requirement before reading or learning. To start with any of the tasks, humans first try to understand the task and then process the task. Similarly, for language identification, the machine needs to learn the language and once learning is complete, it should be able to recognize the language. The project is divided into three parts. Initially, the handwritten image text would be converted to normal text. In the second part, the language would be identified from the converted text and last, the text would be converted to audio format. This paper discusses the implementation of this idea, gives an approach to problems and challenges that we came across, and some solutions.

Keywords


AlexNet, CNN (Convolution Neural Network), gTTS (google-text-to-speech), Image Processing.

References





DOI: https://doi.org/10.17010/ijcs%2F2022%2Fv7%2Fi2%2F169682