Open Access Open Access  Restricted Access Subscription Access

Text to Speech Conversion


Affiliations
1 Department of CSE, K L University, Vaddeswarm, Guntur – 522502, Andhra Pradesh, India
2 Department of ECM, K L University, Vaddeswarm, Guntur – 522502, Andhra Pradesh, India
 

The present paper has introduced an innovative, efficient and real-time cost beneficial technique that enables user to hear the contents of text images instead of reading through them. It combines the concept of Optical Character Recognition (OCR) and Text to Speech Synthesizer (TTS) in Raspberry pi. This kind of system helps visually impaired people to interact with computers effectively through vocal interface. Text Extraction from color images is a challenging task in computer vision. Text-to-Speech conversion is a method that scans and reads English alphabets and numbers that are in the image using OCR technique and changing it to voices. This paper describes the design, implementation and experimental results of the device. This device consists of two modules, image processing module and voice processing module. The device was developed based on Raspberry Pi v2 with 900 MHz processor speed.

Keywords

Image Processing, OCR, Text Extraction, Text-to-speech, Voice Processing.
User

Abstract Views: 225

PDF Views: 0




  • Text to Speech Conversion

Abstract Views: 225  |  PDF Views: 0

Authors

S. Venkateswarlu
Department of CSE, K L University, Vaddeswarm, Guntur – 522502, Andhra Pradesh, India
D. B. K. Kamesh
Department of CSE, K L University, Vaddeswarm, Guntur – 522502, Andhra Pradesh, India
J. K. R. Sastry
Department of ECM, K L University, Vaddeswarm, Guntur – 522502, Andhra Pradesh, India
Radhika Rani
Department of CSE, K L University, Vaddeswarm, Guntur – 522502, Andhra Pradesh, India

Abstract


The present paper has introduced an innovative, efficient and real-time cost beneficial technique that enables user to hear the contents of text images instead of reading through them. It combines the concept of Optical Character Recognition (OCR) and Text to Speech Synthesizer (TTS) in Raspberry pi. This kind of system helps visually impaired people to interact with computers effectively through vocal interface. Text Extraction from color images is a challenging task in computer vision. Text-to-Speech conversion is a method that scans and reads English alphabets and numbers that are in the image using OCR technique and changing it to voices. This paper describes the design, implementation and experimental results of the device. This device consists of two modules, image processing module and voice processing module. The device was developed based on Raspberry Pi v2 with 900 MHz processor speed.

Keywords


Image Processing, OCR, Text Extraction, Text-to-speech, Voice Processing.



DOI: https://doi.org/10.17485/ijst%2F2016%2Fv9i38%2F127096