AI Powered Visual Assistant Using Object Detection And Text Recognition

  • Unique Paper ID: 186824
  • Volume: 12
  • Issue: 6
  • PageNo: 2336-2343
  • Abstract:
  • The “AI-Powered Visual Assistant Using Object Detection and Text Recognition” project aims to help individuals with visual impairments by providing real-time awareness of their surroundings. The system uses a mobile camera to capture live images, which are processed through on-device machine learning models for object detection and Optical Character Recognition (OCR). Android’s Text-to-Speech (TTS) engine then converts the identified objects and recognized text into audible speech, offering users immediate and clear audio feedback. The application, developed using Java and Kotlin in the Android Studio environment, ensures smooth, lag-free operation without relying on external servers, thereby preserving both efficiency and privacy. The system delivers reliable performance under varying lighting and environmental conditions through optimized model integration and image preprocessing. By leveraging artificial intelligence and computer vision, this solution enhances accessibility, mobility, and independence for visually challenged users. Furthermore, it demonstrates how AI-driven mobile applications can contribute to inclusive technology development, empowering users to interact confidently with their environment in daily life.

Copyright & License

Copyright © 2025 Authors retain the copyright of this article. This article is an open access article distributed under the Creative Commons Attribution License which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.

BibTeX

@article{186824,
        author = {Pavithra A and Poornima S M and Nisha M and Anitha R},
        title = {AI Powered Visual Assistant Using Object Detection And Text Recognition},
        journal = {International Journal of Innovative Research in Technology},
        year = {2025},
        volume = {12},
        number = {6},
        pages = {2336-2343},
        issn = {2349-6002},
        url = {https://ijirt.org/article?manuscript=186824},
        abstract = {The “AI-Powered Visual Assistant Using Object Detection and Text Recognition” project aims to help individuals with visual impairments by providing real-time awareness of their surroundings. The system uses a mobile camera to capture live images, which are processed through on-device machine learning models for object detection and Optical Character Recognition (OCR). Android’s Text-to-Speech (TTS) engine then converts the identified objects and recognized text into audible speech, offering users immediate and clear audio feedback. The application, developed using Java and Kotlin in the Android Studio environment, ensures smooth, lag-free operation without relying on external servers, thereby preserving both efficiency and privacy. The system delivers reliable performance under varying lighting and environmental conditions through optimized model integration and image preprocessing. By leveraging artificial intelligence and computer vision, this solution enhances accessibility, mobility, and independence for visually challenged users. Furthermore, it demonstrates how AI-driven mobile applications can contribute to inclusive technology development, empowering users to interact confidently with their environment in daily life.},
        keywords = {Artificial Intelligence (AI), Computer Vision, Object Detection, Optical Character Recognition (OCR), Text-to-Speech (TTS), Assistive Technology, Android Application, Visually Impaired.},
        month = {November},
        }

Cite This Article

  • ISSN: 2349-6002
  • Volume: 12
  • Issue: 6
  • PageNo: 2336-2343

AI Powered Visual Assistant Using Object Detection And Text Recognition

Related Articles