email spam filtering with machine learning

  • Unique Paper ID: 166692
  • Volume: 11
  • Issue: 2
  • PageNo: 1638-1646
  • Abstract:
  • Email spam, often known as junk email, comprises unsolicited and irrelevant messages sent in bulk. These emails can range from promotional content to malicious links and phishing attempts, posing significant risks to recipients. The adverse effects of spam are both social and economic, impacting individuals and organizations by reducing productivity and increasing security threats. It explores the application of machine learning techniques for effective spam detection. Machine learning, a subset of artificial intelligence, has demonstrated superior capabilities in identifying spam through pattern recognition and adaptation to new spam tactics. The study leverages various algorithms, including Naive Bayes, Support Vector Machines (SVMs), decision trees, and deep learning approaches, to enhance the accuracy and scalability of spam filters. The methodology involves collecting and preprocessing data from multiple sources, including the Enron Email Dataset and the SpamAssassin Public Corpus. Feature extraction techniques such as Term Frequency-Inverse Document Frequency (TF-IDF) and N-grams are employed to distinguish spam from legitimate emails. The research addresses class imbalance through techniques like oversampling and Synthetic Minority Oversampling Technique (SMOTE). Evaluation of the developed models highlights Logistic Regression as an effective tool for binary classification in spam filtering. The results demonstrate a high accuracy rate, with significant potential for reducing false negatives and improving email security. This study underscores the importance of advanced machine learning approaches in mitigating the pervasive issue of email spam, aiming to enhance user experience and organizational productivity.

Cite This Article

  • ISSN: 2349-6002
  • Volume: 11
  • Issue: 2
  • PageNo: 1638-1646

email spam filtering with machine learning

Related Articles