A Study on Data Cleaning Techniques for Large Datasets

  • Unique Paper ID: 180995
  • Volume: 12
  • Issue: 1
  • PageNo: 3513-3514
  • Abstract:
  • Abstract—Data cleaning is an essential phase in the data preparation process, especially when working with large datasets. These datasets include often missing value, duplicate records, noise and anomalies that must be addressed for reliable analysis and decision making. This research examines several types of data cleaning techniques, including missing value copying, deduction, external identification and generalization, and evaluating their application on a large-scale dataset. The paper also reviews modern equipment and outlines that facilitates automatic and scalable data cleaning

Cite This Article

  • ISSN: 2349-6002
  • Volume: 12
  • Issue: 1
  • PageNo: 3513-3514

A Study on Data Cleaning Techniques for Large Datasets

Related Articles