Voice-Activated and Gesture-Controlled Intelligent Chatbot with Integrated Task Automation

  • Unique Paper ID: 176052
  • PageNo: 5116-5122
  • Abstract:
  • As technology progresses, advanced HCI subclasses such as multimodal systems impact and improve usability and accessibility. This paper introduces a new type of Voice-Activated and Gesture-Controlled Intelligent Chatbot with system automation features. It seeks to mitigate the problems posed by the conventional single-modal interaction systems. The suggested model implements a hybrid execution strategy using both voice and hand command modulations. System controls such as media playback, volume adjustment, file management, and window operations serve as functionalities of a user-friendly system. For real time hand gesture recognition, MediaPipe is used. SpeechRecognition captures voice input and processes them, while context-aware responses are provided by Google Gemini AI. Experiment validation reveals the proposed model performs well with a gesture recognition accuracy of 99.2%, AI response accuracy of 98.5%, and voice recognition accuracy of 97.1%. In comparison to other systems, this one stands out because of its hybrid execution, versatility, and real time capabilities. This model facilitates total hands-free command control for smart environments and is ideal for accessibility and automation technology, broadening the scope of interaction within the realm of intelligent environments

Copyright & License

Copyright © 2026 Authors retain the copyright of this article. This article is an open access article distributed under the Creative Commons Attribution License which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.

BibTeX

@article{176052,
        author = {Kandregula Nuraj Mani Sai and Dr. D. Sirisha and E Sampath Kumar and K Kartheek and G Sujeevan Rao and B Akshay Kumar},
        title = {Voice-Activated and Gesture-Controlled Intelligent Chatbot with Integrated Task Automation},
        journal = {International Journal of Innovative Research in Technology},
        year = {2025},
        volume = {11},
        number = {11},
        pages = {5116-5122},
        issn = {2349-6002},
        url = {https://ijirt.org/article?manuscript=176052},
        abstract = {As technology progresses, advanced HCI subclasses such as multimodal systems impact and improve usability and accessibility. This paper introduces a new type of Voice-Activated and Gesture-Controlled Intelligent Chatbot with system automation features. It seeks to mitigate the problems posed by the conventional single-modal interaction systems. The suggested model implements a hybrid execution strategy using both voice and hand command modulations. System controls such as media playback, volume adjustment, file management, and window operations serve as functionalities of a user-friendly system. For real time hand gesture recognition, MediaPipe is used. SpeechRecognition captures voice input and processes them, while context-aware responses are provided by Google Gemini AI. Experiment validation reveals the proposed model performs well with a gesture recognition accuracy of 99.2%, AI response accuracy of 98.5%, and voice recognition accuracy of 97.1%. In comparison to other systems, this one stands out because of its hybrid execution, versatility, and real time capabilities. This model facilitates total hands-free command control for smart environments and is ideal for accessibility and automation technology, broadening the scope of interaction within the realm of intelligent environments},
        keywords = {Gesture Recognition, Voice-Activated Chatbot, AI Chatbot, System Automation, Human-Computer Interaction (HCI), MediaPipe, Speech Recognition, Google Gemini AI, Hybrid Execution Model, Smart Systems.},
        month = {April},
        }

Cite This Article

Sai, K. N. M., & Sirisha, D. D., & Kumar, E. S., & Kartheek, K., & Rao, G. S., & Kumar, B. A. (2025). Voice-Activated and Gesture-Controlled Intelligent Chatbot with Integrated Task Automation. International Journal of Innovative Research in Technology (IJIRT), 11(11), 5116–5122.

Related Articles