JARVIS: A Virtual Speech Assistant with Image Recognition
Viren Tamhane, Vedant Tapadia, Atharva Gonjari, Tanishq Valyal
Virtual Speech Assistant is an application performing tasks on behalf of an individual. They are also referred as Intelligent Virtual Assistant which interprets human speech and respond via synthesized voices. Virtual Assistant finds their usage in various applications such as home automation, manage to-do tasks and media playback through voice. In this project we will try to combine the algorithm behind image recognition with a virtual speech assistant. We have used Convolutional Neural Networks (CNN) for image classification and recognition. A parser named GTTS (Google Text-To-Speech) is employed to recognize the speech. This report discusses ways in which new technology could be harnessed to create an intelligent Virtual Speech Assistant with a focus on user-based information. It is suggested that new technologies may soon make the idea of virtual speech assistants a reality. Experiments conducted on this system, combined with user testing, have provided evidence that a basic program with CNN algorithms in the form of a VSA and basic GTTS have the ability to function without the need for other type of human input.
Article Details
Unique Paper ID: 154959

Publication Volume & Issue: Volume 8, Issue 12

Page(s): 888 - 891
Article Preview & Download

Share This Article

Conference Alert


AICTE Sponsored National Conference on Smart Systems and Technologies

Last Date: 25th November 2023

SWEC- Management


Last Date: 7th November 2023

Go To Issue

Call For Paper

Volume 10 Issue 1

Last Date for paper submitting for March Issue is 25 June 2023

About Us

IJIRT.org enables door in research by providing high quality research articles in open access market.

Send us any query related to your research on editor@ijirt.org

Social Media

Google Verified Reviews