Open Access Open Access  Restricted Access Subscription Access
Open Access Open Access Open Access  Restricted Access Restricted Access Subscription Access

Acoustic Speech Recognition for Marathi Language Using Sphinx


Affiliations
1 Department of Computer Engineering, Dr. D.Y. Patil College of Engineering, India
     

   Subscribe/Renew Journal


Speech recognition or speech to text processing, is a process of recognizing human speech by the computer and converting into text. In speech recognition, transcripts are created by taking recordings of speech as audio and their text transcriptions. Speech based applications which include Natural Language Processing (NLP) techniques are popular and an active area of research. Input to such applications is in natural language and output is obtained in natural language. Speech recognition mostly revolves around three approaches namely Acoustic phonetic approach, Pattern recognition approach and Artificial intelligence approach. Creation of acoustic model requires a large database of speech and training algorithms. The output of an ASR system is recognition and translation of spoken language into text by computers and computerized devices. ASR today finds enormous application in tasks that require human machine interfaces like, voice dialing, and etc. Our key contribution in this paper is to create corpora for Marathi language and explore the use of Sphinx engine for automatic speech recognition.

Keywords

Automatic Speech Recognition (ASR), Speech Analysis, Modeling Techniques, Acoustic-Phonetic Approach, Pattern Recognition Techniques, Language Modeling, Hidden Markov Model, Sphinx Engine, Phonemes, Lexicons.
Subscription Login to verify subscription
User
Notifications
Font Size

Abstract Views: 229

PDF Views: 2




  • Acoustic Speech Recognition for Marathi Language Using Sphinx

Abstract Views: 229  |  PDF Views: 2

Authors

Aman Ankit
Department of Computer Engineering, Dr. D.Y. Patil College of Engineering, India
Sonu Kumar Mishra
Department of Computer Engineering, Dr. D.Y. Patil College of Engineering, India
Rinaz Shaikh
Department of Computer Engineering, Dr. D.Y. Patil College of Engineering, India
Chandraketu Kumar Gupta
Department of Computer Engineering, Dr. D.Y. Patil College of Engineering, India
Prakhar Mathur
Department of Computer Engineering, Dr. D.Y. Patil College of Engineering, India
Soudamini Pawar
Department of Computer Engineering, Dr. D.Y. Patil College of Engineering, India
Anil Cherukuri
Department of Computer Engineering, Dr. D.Y. Patil College of Engineering, India

Abstract


Speech recognition or speech to text processing, is a process of recognizing human speech by the computer and converting into text. In speech recognition, transcripts are created by taking recordings of speech as audio and their text transcriptions. Speech based applications which include Natural Language Processing (NLP) techniques are popular and an active area of research. Input to such applications is in natural language and output is obtained in natural language. Speech recognition mostly revolves around three approaches namely Acoustic phonetic approach, Pattern recognition approach and Artificial intelligence approach. Creation of acoustic model requires a large database of speech and training algorithms. The output of an ASR system is recognition and translation of spoken language into text by computers and computerized devices. ASR today finds enormous application in tasks that require human machine interfaces like, voice dialing, and etc. Our key contribution in this paper is to create corpora for Marathi language and explore the use of Sphinx engine for automatic speech recognition.

Keywords


Automatic Speech Recognition (ASR), Speech Analysis, Modeling Techniques, Acoustic-Phonetic Approach, Pattern Recognition Techniques, Language Modeling, Hidden Markov Model, Sphinx Engine, Phonemes, Lexicons.