Text to speech conversion using google Vision Api
Author(s):
Karun Somasunder M, Amal J.S, Gopal Gopakumar, Suraj V Thomas, Keerthi Krishnan
Keywords:
OCR, Google Cloud Vision API, Text to Speech Conversion, gTTS, Flask
Abstract
With recent advancement within the technology, we shall implement an assistive device that's capable of capturing a picture from a camera and extracting the text from the captured image and further to convert the text to speech as voice-based output to assist the people. The captured image is analyzed using Google Cloud Vision API Optical Character recognition (OCR). So as to extract text, we use image preprocessing methods to obviate any noise or blur within the captured image so that the accuracy is often increased. Further, we include software-based text to speech to convert the text to speech as voice output. The Google Cloud Speech API integrates with Google Cloud Storage for data storage
Article Details
Unique Paper ID: 154247

Publication Volume & Issue: Volume 8, Issue 7

Page(s): 140 - 144
Article Preview & Download


Share This Article

Join our RMS

Conference Alert

NCSEM 2024

National Conference on Sustainable Engineering and Management - 2024

Last Date: 15th March 2024

Call For Paper

Volume 10 Issue 10

Last Date for paper submitting for March Issue is 25 June 2024

About Us

IJIRT.org enables door in research by providing high quality research articles in open access market.

Send us any query related to your research on editor@ijirt.org

Social Media

Google Verified Reviews