Khanaum, Mosammat Mustari and Borhan, Md Saidul and Ferdoush, Farzana and Russel, Mohammed Ali Nause and Murshed, Mustafa (2023) Unveiling the Predictive Capabilities of Machine Learning in Air Quality Data Analysis: A Comparative Evaluation of Different Regression Models. Open Journal of Air Pollution, 12 (04). pp. 142-159. ISSN 2169-2653
ojap_2023120810325333.pdf - Published Version
Download (7MB)
Abstract
Air quality is a critical concern for public health and environmental regulation. The Air Quality Index (AQI), a widely adopted index by the US Environmental Protection Agency (EPA), serves as a crucial metric for reporting site-specific air pollution levels. Accurately predicting air quality, as measured by the AQI, is essential for effective air pollution management. In this study, we aim to identify the most reliable regression model among linear discriminant analysis (LDA), quadratic discriminant analysis (QDA), logistic regression, and K-nearest neighbors (KNN). We conducted four different regression analyses using a machine learning approach to determine the model with the best performance. By employing the confusion matrix and error percentages, we selected the best-performing model, which yielded prediction error rates of 22%, 23%, 20%, and 27%, respectively, for LDA, QDA, logistic regression, and KNN models. The logistic regression model outperformed the other three statistical models in predicting AQI. Understanding these models' performance can help address an existing gap in air quality research and contribute to the integration of regression techniques in AQI studies, ultimately benefiting stakeholders like environmental regulators, healthcare professionals, urban planners, and researchers.
Item Type: | Article |
---|---|
Subjects: | Pustakas > Geological Science |
Depositing User: | Unnamed user with email support@pustakas.com |
Date Deposited: | 13 Dec 2023 11:17 |
Last Modified: | 13 Dec 2023 11:17 |
URI: | http://archive.pcbmb.org/id/eprint/1754 |