ICTACT Journal on Communication Technology (Sep 2016)

ACOUSTIC SPEECH RECOGNITION FOR MARATHI LANGUAGE USING SPHINX

  • Aman Ankit,
  • Sonu Kumar Mishra,
  • Rinaz Shaikh,
  • Chandraketu Kumar Gupta,
  • Prakhar Mathur,
  • Soudamini Pawar,
  • Anil Cherukuri

Journal volume & issue
Vol. 7, no. 3
pp. 1361 – 1365

Abstract

Read online

Speech recognition or speech to text processing, is a process of recognizing human speech by the computer and converting into text. In speech recognition, transcripts are created by taking recordings of speech as audio and their text transcriptions. Speech based applications which include Natural Language Processing (NLP) techniques are popular and an active area of research. Input to such applications is in natural language and output is obtained in natural language. Speech recognition mostly revolves around three approaches namely Acoustic phonetic approach, Pattern recognition approach and Artificial intelligence approach. Creation of acoustic model requires a large database of speech and training algorithms. The output of an ASR system is recognition and translation of spoken language into text by computers and computerized devices. ASR today finds enormous application in tasks that require human machine interfaces like, voice dialing, and etc. Our key contribution in this paper is to create corpora for Marathi language and explore the use of Sphinx engine for automatic speech recognition

Keywords