Text Reader for Visually Impaired Using Google Cloud Vision API

  • Unique Paper ID: 146508
  • PageNo: 869-873
  • Abstract:
  • Visually impaired people confront a number of visual challenges every day – from reading the label on a frozen dinner to figuring out if they’re at the right bus stop. Probable solutions include Braille wherein tactile information is converted into meaningful patterns. Other visual aids include liquid level indicators, coin sorters and large button telephones for daily living; electronic magnifiers, audio books, text to voice technology as a technological aid. Our aim through this paper is to propose a system that facilitates reading for a blind person. With the help of our system, we extract text from images using google cloud vision API. Our approach is capable of recognizing text in various challenging conditions where traditional OCR systems fail; in the presence of blur, low resolution, low contrast, high image noise, and distortions. The output text is converted into audio output in the form of synthetic speech. Thus, our proposed system will be very helpful to visually impaired person

Copyright & License

Copyright © 2026 Authors retain the copyright of this article. This article is an open access article distributed under the Creative Commons Attribution License which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.

BibTeX

@article{146508,
        author = {Yash Shirke and Paras Doshi and Tejas Hegde and Pranav Dhanvij},
        title = {Text Reader for Visually Impaired Using Google Cloud Vision API },
        journal = {International Journal of Innovative Research in Technology},
        year = {},
        volume = {4},
        number = {12},
        pages = {869-873},
        issn = {2349-6002},
        url = {https://ijirt.org/article?manuscript=146508},
        abstract = {Visually impaired people confront a number of visual challenges every day – from reading the label on a frozen dinner to figuring out if they’re at the right bus stop. Probable solutions include Braille wherein tactile information is converted into meaningful patterns. Other visual aids include liquid level indicators, coin sorters and large button telephones for daily living; electronic magnifiers, audio books, text to voice technology as a technological aid. Our aim through this paper is to propose a system that facilitates reading for a blind person. With the help of our system, we extract text from images using google cloud vision API. Our approach is capable of recognizing text in various challenging conditions where traditional OCR systems fail; in the presence of blur, low resolution, low contrast, high image noise, and distortions. The output text is converted into audio output in the form of synthetic speech. Thus, our proposed system will be very helpful to visually impaired person},
        keywords = {OCR, Google Cloud Vision API, Text to Speech, Raspberry Pi.},
        month = {},
        }

Cite This Article

Shirke, Y., & Doshi, P., & Hegde, T., & Dhanvij, P. (). Text Reader for Visually Impaired Using Google Cloud Vision API . International Journal of Innovative Research in Technology (IJIRT), 4(12), 869–873.

Related Articles