Air Quality Prediction Using Machine Learning Algorithm in Maharashtra

  • Unique Paper ID: 182540
  • Volume: 12
  • Issue: 2
  • PageNo: 2440-2442
  • Abstract:
  • Air pollution poses a severe and multifaceted risk to public health, economic productivity, and environmental sustainability, particularly in rapidly urbanizing regions such as Maharashtra, India. This research presents a comprehensive comparison of classical and ensemble machine learning algorithms—for example, Linear Regression, Random Forest, and XGBoost—in forecasting the Air Quality Index (AQI) across three major metropolitan areas: Mumbai, Pune, and Nagpur. The dataset comprises hourly pollutant concentrations (PM2.5, PM10, O3, NO2, CO, SO2) and meteorological variables (temperature, humidity, wind speed, rainfall) collected via Mendeley Data and the CPCB API over a two-year period. Key steps include rigorous data preprocessing, advanced feature engineering—including lag and interaction terms—and systematic hyperparameter tuning with five-fold cross-validation. Model performance is evaluated using RMSE, MAE, and R² metrics. Results indicate that XGBoost consistently yields superior predictive accuracy (R² up to 0.92), while Random Forest offers robust interpretability through feature importance analysis. City-specific findings reveal that PM2.5 and NO2 are the dominant drivers of AQI variation in Mumbai, whereas meteorological factors play a larger role in Pune and Nagpur. These insights can guide targeted mitigation strategies and inform data-driven policy development.

Cite This Article

  • ISSN: 2349-6002
  • Volume: 12
  • Issue: 2
  • PageNo: 2440-2442

Air Quality Prediction Using Machine Learning Algorithm in Maharashtra

Related Articles