Text to speech conversion using google Vision Api

  • Unique Paper ID: 154247
  • Volume: 8
  • Issue: 7
  • PageNo: 140-144
  • Abstract:
  • With recent advancement within the technology, we shall implement an assistive device that's capable of capturing a picture from a camera and extracting the text from the captured image and further to convert the text to speech as voice-based output to assist the people. The captured image is analyzed using Google Cloud Vision API Optical Character recognition (OCR). So as to extract text, we use image preprocessing methods to obviate any noise or blur within the captured image so that the accuracy is often increased. Further, we include software-based text to speech to convert the text to speech as voice output. The Google Cloud Speech API integrates with Google Cloud Storage for data storage

Copyright & License

Copyright © 2025 Authors retain the copyright of this article. This article is an open access article distributed under the Creative Commons Attribution License which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.

BibTeX

@article{154247,
        author = {Karun Somasunder M and Amal J.S and Gopal Gopakumar and Suraj V Thomas and Keerthi Krishnan},
        title = {Text to speech conversion using google Vision Api},
        journal = {International Journal of Innovative Research in Technology},
        year = {},
        volume = {8},
        number = {7},
        pages = {140-144},
        issn = {2349-6002},
        url = {https://ijirt.org/article?manuscript=154247},
        abstract = {With recent advancement within the technology, we shall implement an assistive device that's capable of capturing a picture from a camera and extracting the text from the captured image and further to convert the text to speech as voice-based output to assist the people. The captured image is analyzed using Google Cloud Vision API Optical Character recognition (OCR). So as to extract text, we use image preprocessing methods to obviate any noise or blur within the captured image so that the accuracy is often increased. Further, we include software-based text to speech to convert the text to speech as voice output. The Google Cloud Speech API integrates with Google Cloud Storage for data storage},
        keywords = {OCR, Google Cloud Vision API, Text to Speech Conversion, gTTS, Flask},
        month = {},
        }

Cite This Article

  • ISSN: 2349-6002
  • Volume: 8
  • Issue: 7
  • PageNo: 140-144

Text to speech conversion using google Vision Api

Related Articles