Nithin Kumar B, Manoj K R, Nagendra N S, Deepa Angadi
Keywords:
Random Forest, K Nearest Neighbors, XGBoost, MFCC features.
Abstract
Gender, Age and Accent recognition that is solely achieved by human speech which is an interesting subject in the field of Automatic Speech Recognition systems. In this project, we have introduced a way of classifying a human’s speech into different classes of Gender, Age and Accent. The classification used in this project is mainly based on Machine Learning Models. The Models are systematically trained on the audio data that is obtained from Mozilla Common Voice. The dataset only contained the audio files with the respective labels associated with it. The noisy data from the dataset have been filtered out before it is fed into the Machine Learning models. In our project, we are using Mel Frequency Cepstral Coefficients (MFCC). We are extending Accent classification into American, European and Indian. But, in our project “Voice based Age, Accent and Gender Recognition” we have extended our work to the classification of speech into Gender, Age and Accent.
Article Details
Unique Paper ID: 152296
Publication Volume & Issue: Volume 8, Issue 2
Page(s): 841 - 845
Article Preview & Download
Share This Article
Conference Alert
NCSST-2021
AICTE Sponsored National Conference on Smart Systems and Technologies
Last Date: 25th November 2021
SWEC- Management
LATEST INNOVATION’S AND FUTURE TRENDS IN MANAGEMENT