GenVox: A Voice-Activated Multi-Modal Generative AI Companion

SIDDAMSETTI VEERA VENKATA PRASANNA KUMAR; LOPINTI GANESH KUMAR; PANCHALA MANIKANTA; JAMPANA MAHALAKSHMI; Dr. D. Anusha

GenVox: A Voice-Activated Multi-Modal Generative AI Companion

Authors: SIDDAMSETTI VEERA VENKATA PRASANNA KUMAR, LOPINTI GANESH KUMAR, PANCHALA MANIKANTA, JAMPANA MAHALAKSHMI, Dr. D. Anusha

Unique Paper ID: 175182
Volume: 11
Issue: 11
PageNo: 2051-2057

Keywords: No Keywords Found

Abstract:
This paper describes GenVox is a multimodal voice-driven generative AI assistant that embeds the new innovation in natural language processing (NLP), image generation, and sound generation. GenVox enables users to interact in voice commands and receive feedback in various forms like text, images, and sound. Developed as an individual assistant, a creative companion, and a learning friend, GenVox employs generative AI models to respond accordingly in order to address the requirements of the users. Major features include voice-guided text generation, AI-based story creation, content generation, question and answer, summarization, voice-activated image generation, and interactive cross-modal content generation. The project also employs technologies such as large language models (LLMs), Google gTTS for text-to-speech synthesis, Pyttsx3 for natural language processing, Python 3.0 for coding, Kivy for Android app construction, and APIs from Together AI and Hugging Face for retrieval of pre-existing generative models.

Download article

email to a friend

Cite This Article

ISSN: 2349-6002
Volume: 11
Issue: 11
PageNo: 2051-2057

GenVox: A Voice-Activated Multi-Modal Generative AI Companion

Available:https://ijirt.org/Article?manuscript=175182

Impact Factor
8.01 (Year 2024)

UGC Approved
Journal no 47859

Join Our IPN

IJIRT Partner Network

Submit your research paper and those of your network (friends, colleagues, or peers) through your IPN account, and receive 800 INR for each paper that gets published.

Join Now

Latest Publication

Recent Conferences

NCSEM 2024

National Conference on Sustainable Engineering and Management - 2024 Last Date: 15th March 2024

Submit inquiry

GenVox: A Voice-Activated Multi-Modal Generative AI Companion

GenVox: A Voice-Activated Multi-Modal Generative AI Companion

Related Articles

Join Our IPN

IJIRT Partner Network

Latest Publication

Archive

Recent Conferences

NCSEM 2024