AUTOMATIC SPEECH TO TEXT USING ACOUSTIC MODELLING AND DEEP LEARNING

  • Unique Paper ID: 159982
  • Volume: 9
  • Issue: 12
  • PageNo: 966-970
  • Abstract:
  • Language is the most important means of communication and speech is its main medium. Many research activities are being conducted on Automatic Speech Recognition. But, ASR systems have a major drawback in their performance i.e., efficiency. Improving the efficiency in an ASR system is quite difficult. Currently, research is being carried out to the finding of the next state of the world using Hidden Markow Model (HMM). Our Study concludes that for ASR systems, Deep Learning techniques is a more suitable application, because it increases the efficiency of the whole process. We are going to represent our work on building a speaker independent, large vocabulary continuous speech recognition system for English and Hindi. Our Study concludes that for ASR systems, Deep Learning techniques is a more suitable application, because it increases the efficiency of the whole process. Based on the conclusion, we will utilize readily available language models to build an offline live Speech recognition to Text System and Translation. Using offline speech recognition toolkits like VOSK and Kaldi an offline model is developed. Argos translate an open-source library is used to translate English speech to Hindi text. Raspberry Pi 4 is used to implement the offline speech recognition module

Copyright & License

Copyright © 2025 Authors retain the copyright of this article. This article is an open access article distributed under the Creative Commons Attribution License which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.

BibTeX

@article{159982,
        author = {Dr.R.Jayalakshmi and Sagar D and Somalaraju Yashwanth Varma and Syed Mohammad Musharraf and TEJASWIN R M and N Venkata Vamshi and Yelugubanti Manikanta},
        title = {AUTOMATIC SPEECH TO TEXT USING ACOUSTIC MODELLING AND DEEP LEARNING},
        journal = {International Journal of Innovative Research in Technology},
        year = {},
        volume = {9},
        number = {12},
        pages = {966-970},
        issn = {2349-6002},
        url = {https://ijirt.org/article?manuscript=159982},
        abstract = {Language is the most important means of communication and speech is its main medium. Many research activities are being conducted on Automatic Speech Recognition. But, ASR systems have a major drawback in their performance i.e., efficiency. Improving the efficiency in an ASR system is quite difficult. Currently, research is being carried out to the finding of the next state of the world using Hidden Markow Model (HMM). Our Study concludes that for ASR systems, Deep Learning techniques is a more suitable application, because it increases the efficiency of the whole process. We are going to represent our work on building a speaker independent, large vocabulary continuous speech 


recognition system for English and Hindi. Our Study concludes that for ASR systems, Deep Learning techniques is a more suitable application, because it increases the efficiency of the whole process. Based on the conclusion, we will utilize readily available language models to build an offline live Speech recognition to Text System and Translation. Using offline speech recognition toolkits like VOSK and Kaldi an offline model is developed. Argos translate an open-source library is used to translate English speech to Hindi text. Raspberry Pi 4 is used to implement the offline speech recognition module
},
        keywords = {Speech recognition system, speech processing, Feature extraction techniques, modelling techniques, applications of SRS, NLP and ASR system, Word Error Rate(WER), CNN and RNN .},
        month = {},
        }

Cite This Article

  • ISSN: 2349-6002
  • Volume: 9
  • Issue: 12
  • PageNo: 966-970

AUTOMATIC SPEECH TO TEXT USING ACOUSTIC MODELLING AND DEEP LEARNING

Related Articles