In the contemporary era, the rapid advancements in artificial intelligence and natural language processing have paved the way for intelligent virtual assistants, revolutionizing the way humans interact with computers. This paper presents MARSAI, a sophisticated ChatGPT clone integrated with cutting-edge voice assistance technology, designed to elevate the user experience in human-computer interactions. MARSAI, short for Multimodal AI-based Responsive Speech Assistant and Interpreter, combines the power of text-based chatbots with the intuitiveness of voice-enabled assistants, creating a seamless and interactive communication platform
The proposed system utilizes OpenAI's GPT-3.5 architecture, enhancing it with custom-trained algorithms to comprehend and respond to user queries in natural language. Moreover, MARSAI incorporates automatic speech recognition (ASR) and text-to-speech (TTS) technologies, enabling users to interact with the system through spoken language. The integration of ASR and TTS is achieved using state-of-the-art neural networks, ensuring high accuracy and naturalness in speech interactions.
Article Details
Unique Paper ID: 162057
Publication Volume & Issue: Volume 10, Issue 7
Page(s): 319 - 322
Article Preview & Download
Share This Article
Join our RMS
Conference Alert
NCSEM 2024
National Conference on Sustainable Engineering and Management - 2024