Audio Classification of Cats and Dogs using Python

  • Unique Paper ID: 179134
  • Volume: 11
  • Issue: 12
  • PageNo: 5366-5371
  • Abstract:
  • This paper presents a novel edge-optimized deep learning framework for real-time classification of cat and dog vocalizations, addressing key challenges in residential audio monitoring. Leveraging a MobileNetV2-inspired CNN architecture trained on MFCC features (Davis & Mermelstein, 1980), our solution achieves 94.32% accuracy (F1=0.93) while reducing model size by 43% compared to ResNet-18 baselines. The pipeline incorporates: • Robust preprocessing: Noise filtering + adaptive segmentation • Targeted augmentation: Time stretching (±20%) and pitch shifting (±2 semitones) • Edge deployment: <3s inference on Raspberry Pi (validated via stratified cross-validation) Outperforming SVM approaches by 12.7% (p<0.01), this work enables practical applications in smart pet care and veterinary acoustics. Future extensions will explore IoT integration and multi-species classification.

Cite This Article

  • ISSN: 2349-6002
  • Volume: 11
  • Issue: 12
  • PageNo: 5366-5371

Audio Classification of Cats and Dogs using Python

Related Articles