Real Time Language Translation for Video Calls

  • Unique Paper ID: 188190
  • Volume: 12
  • Issue: 7
  • PageNo: 881-889
  • Abstract:
  • Communicating effectively in global video calls is crucial in today's interconnected world, yet traditional video conferencing applications often lack real-time translation, leading to language barriers and miscommunication. This project is an intelligent video call application designed to enhance communication by integrating real-time translation and captioning, ensuring that participants can understand each other regardless of their native language. At its core, this application allows users to create and join video rooms with advanced functionalities such as real-time multilingual transcription, live captioning, real-time synchronization (via Pusher), and high-quality video streaming (via WebRTC / LiveKit).Built on the T3 Stack (Next.js).ts, the application leverages a PostgreSQL database with Prisma ORM for cloud storage, tRPC for a type-safe API, and LiveKit for scalable video services. By leveraging cutting-edge NLP techniques and AI services, this project enhances user interactions through Speech-to-Text (via the Web Speech API), Machine Translation (via Microsoft Translator), Text-to-Speech, and AI-driven Summarization. This approach not only removes the manual effort of translation but also supports users in fully understanding the conversation, enabling better collaboration and meeting outcomes.

Copyright & License

Copyright © 2025 Authors retain the copyright of this article. This article is an open access article distributed under the Creative Commons Attribution License which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.

BibTeX

@article{188190,
        author = {Nagarjun Nayak and Aprameya P and Karthik R and Kaushal C Acharya and Rashmi P C},
        title = {Real Time Language Translation for Video Calls},
        journal = {International Journal of Innovative Research in Technology},
        year = {2025},
        volume = {12},
        number = {7},
        pages = {881-889},
        issn = {2349-6002},
        url = {https://ijirt.org/article?manuscript=188190},
        abstract = {Communicating effectively in global video calls is crucial in today's interconnected world, yet traditional video conferencing applications often lack real-time translation, leading to language barriers and miscommunication. This project is an intelligent video call application designed to enhance communication by integrating real-time translation and captioning, ensuring that participants can understand each other regardless of their native language. At its core, this application allows users to create and join video rooms with advanced functionalities such as real-time multilingual transcription, live captioning, real-time synchronization (via Pusher), and high-quality video streaming (via WebRTC / LiveKit).Built on the T3 Stack (Next.js).ts, the application leverages a PostgreSQL database with Prisma ORM for cloud storage, tRPC for a type-safe API, and LiveKit for scalable video services. By leveraging cutting-edge NLP techniques and AI services, this project enhances user interactions through Speech-to-Text (via the Web Speech API), Machine Translation (via Microsoft Translator), Text-to-Speech, and AI-driven Summarization. This approach not only removes the manual effort of translation but also supports users in fully understanding the conversation, enabling better collaboration and meeting outcomes.},
        keywords = {Video Conferencing; Real-time Translation; Machine Translation; Live Captions; Web Application; Next.js; tRPC; LiveKit; WebRTC; Web Speech API; Prisma ORM; Artificial Intelligence; NLP (Natural Language Processing)},
        month = {December},
        }

Cite This Article

  • ISSN: 2349-6002
  • Volume: 12
  • Issue: 7
  • PageNo: 881-889

Real Time Language Translation for Video Calls

Related Articles