A Study on Data Cleaning Techniques for Large Datasets

  • Unique Paper ID: 180995
  • Volume: 12
  • Issue: 1
  • PageNo: 3513-3514
  • Abstract:
  • Abstract—Data cleaning is an essential phase in the data preparation process, especially when working with large datasets. These datasets include often missing value, duplicate records, noise and anomalies that must be addressed for reliable analysis and decision making. This research examines several types of data cleaning techniques, including missing value copying, deduction, external identification and generalization, and evaluating their application on a large-scale dataset. The paper also reviews modern equipment and outlines that facilitates automatic and scalable data cleaning

Copyright & License

Copyright © 2025 Authors retain the copyright of this article. This article is an open access article distributed under the Creative Commons Attribution License which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.

BibTeX

@article{180995,
        author = {Divya Kashid},
        title = {A Study on Data Cleaning Techniques for Large Datasets},
        journal = {International Journal of Innovative Research in Technology},
        year = {2025},
        volume = {12},
        number = {1},
        pages = {3513-3514},
        issn = {2349-6002},
        url = {https://ijirt.org/article?manuscript=180995},
        abstract = {Abstract—Data cleaning is an essential phase in the data preparation process, especially when working with large datasets. These datasets include often missing value, duplicate records, noise and anomalies that must be addressed for reliable analysis and decision making. This research examines several types of data cleaning techniques, including missing value copying, deduction, external identification and generalization, and evaluating their application on a large-scale dataset. The paper also reviews modern equipment and outlines that facilitates automatic and scalable data cleaning},
        keywords = {},
        month = {June},
        }

Cite This Article

  • ISSN: 2349-6002
  • Volume: 12
  • Issue: 1
  • PageNo: 3513-3514

A Study on Data Cleaning Techniques for Large Datasets

Related Articles