Analysis of Speech Emotion: A Survey of SER and FER with Multilingual Dialects

  • Unique Paper ID: 169426
  • PageNo: 994-1001
  • Abstract:
  • With applications in customer service, healthcare, and entertainment, emotion identification is essential to enhancing human-computer interaction. Either Speech Emotion Recognition (SER) or Facial Emotion Recognition (FER) systems have been a major component of traditional approaches to emotion analysis. However, more resilient, multi-modal techniques that integrate speech and facial expressions for a thorough comprehension of human emotions are required due to the growing complexity of real-world applications. With an emphasis on how these systems might be combined for improved emotion recognition, this survey investigates the synergies between SER and FER technologies. An overview of the state of research in SER and FER is given in this study, with a focus on multi-modal systems that take dialect and cultural quirks into account. We hope to provide future research directions that could result in more precise and culturally sensitive emotion identification systems by reviewing current developments, difficulties, and possible solutions.

Copyright & License

Copyright © 2026 Authors retain the copyright of this article. This article is an open access article distributed under the Creative Commons Attribution License which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.

BibTeX

@article{169426,
        author = {Arundhati Wani and Rujuta Kulkarni and Ananthan Nair and Vishvjita Savkare and Ms. Punam Chavan},
        title = {Analysis of Speech Emotion: A Survey of SER and FER with Multilingual Dialects},
        journal = {International Journal of Innovative Research in Technology},
        year = {2024},
        volume = {11},
        number = {6},
        pages = {994-1001},
        issn = {2349-6002},
        url = {https://ijirt.org/article?manuscript=169426},
        abstract = {With applications in customer service, healthcare, and entertainment, emotion identification is essential to enhancing human-computer interaction. Either Speech Emotion Recognition (SER) or Facial Emotion Recognition (FER) systems have been a major component of traditional approaches to emotion analysis. However, more resilient, multi-modal techniques that integrate speech and facial expressions for a thorough comprehension of human emotions are required due to the growing complexity of real-world applications. With an emphasis on how these systems might be combined for improved emotion recognition, this survey investigates the synergies between SER and FER technologies. An overview of the state of research in SER and FER is given in this study, with a focus on multi-modal systems that take dialect and cultural quirks into account. We hope to provide future research directions that could result in more precise and culturally sensitive emotion identification systems by reviewing current developments, difficulties, and possible solutions.},
        keywords = {emotion recognition; facial emotion recognition; dialects; multi-modal system},
        month = {November},
        }

Cite This Article

Wani, A., & Kulkarni, R., & Nair, A., & Savkare, V., & Chavan, M. P. (2024). Analysis of Speech Emotion: A Survey of SER and FER with Multilingual Dialects. International Journal of Innovative Research in Technology (IJIRT), 11(6), 994–1001.

Related Articles