Advanced Feature Extraction for Robust Speech Recognition Based on Maximizing the Sharpness of the Power Distribution

  • Unique Paper ID: 154606
  • Volume: 8
  • Issue: 11
  • PageNo: 341-343
  • Abstract:
  • This paper presents a new robust feature extraction algorithm based on a modified approach to power bias subtraction combined with applying a threshold to the power spectral density. Power bias level is selected as a level above which the signal power distribution is sharpest. The sharpness is measured using the ratio of arithmetic mean to the geometric mean of medium-duration power. When subtracting this bias level, power flooring is applied to enhance robustness. These new ideas are employed to enhance our recently introduced feature extraction algorithm PNCC (Power Normalized Cepstral Coefficient). While simpler than our previous PNCC, experimental results show that this new PNCC is showing better performance than our previous implementation.

Copyright & License

Copyright © 2025 Authors retain the copyright of this article. This article is an open access article distributed under the Creative Commons Attribution License which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.

BibTeX

@article{154606,
        author = {Dr. B. C. Premkumar and Dr. Prasanna Kumar. C and Dr. Suresh D},
        title = {Advanced Feature Extraction for Robust Speech Recognition Based on Maximizing the Sharpness of the Power Distribution},
        journal = {International Journal of Innovative Research in Technology},
        year = {},
        volume = {8},
        number = {11},
        pages = {341-343},
        issn = {2349-6002},
        url = {https://ijirt.org/article?manuscript=154606},
        abstract = {This paper presents a new robust feature extraction algorithm based on a modified approach to power bias subtraction combined with applying a threshold to the power spectral density. Power bias level is selected as a level above which the signal power distribution is sharpest. The sharpness is measured using the ratio of arithmetic mean to the geometric mean of medium-duration power. When subtracting this bias level, power flooring is applied to enhance robustness. These new ideas are employed to enhance our recently introduced feature extraction algorithm PNCC (Power Normalized Cepstral Coefficient). While simpler than our previous PNCC, experimental results show that this new PNCC is showing better performance than our previous implementation.},
        keywords = {Robust speech recognition, physiological modeling, sharpness of power distribution, power flooring, auditory threshold.},
        month = {},
        }

Related Articles