Please use this identifier to cite or link to this item: http://hdl.handle.net/2122/14721
Authors: Campanile, Lelio* 
Cantiello, Pasquale* 
Iacono, Mauro* 
Lotito, Roberta* 
Marulli, Fiammetta* 
Mastroianni, Michele* 
Title: Applying Machine Learning to Weather and Pollution Data Analysis for a Better Management of Local Areas: The Case of Napoli, Italy
Journal: Proceedings of the 6th International Conference on Internet of Things, Big Data and Security 
Series/Report no.: / (2021)
Publisher: SciTePress
Issue Date: Apr-2021
DOI: 10.5220/0010540003540363
Keywords: Air Quality
Forecasting
Machine Learning
Regression
Data Analysis
Campania
Abstract: Local pollution is a problem that affects urban areas and has effects on the quality of life and on health conditions. In order to not develop strict measures and to better manage territories, the national authorities have applied a vast range of predictive models. Actually, the application of machine learning has been studied in the last decades in various cases with various declination to simplify this problem. In this paper, we apply a regression-based analysis technique to a dataset containing official historical local pollution and weather data to look for criteria that allow forecasting critical conditions. The methods was applied to the case study of Napoli, Italy, where the local environmental protection agency manages a set of fixed monitoring stations where both chemical and meteorological data are recorded. The joining of the two raw dataset was overcome by the use of a maximum inclusion strategy as performing the joining action with ”outer” mode. Among the four different r egression models applied, namely the Linear Regression Model calculated with Ordinary Least Square (LN-OLS), the Ridge regression Model (Ridge), the Lasso Model (Lasso) and Supervised Nearest Neighbors Regression (KNN), the Ridge regression model was found to better perform with an R2 (Coefficient of Determination) value equal to 0.77 and low value for both MAE (Mean Absolute Error) and MSE (Mean Squared Error), equal to 0.12 and 0.04 respectively.
Appears in Collections:Article published / in press

Files in This Item:
File Description SizeFormat
2021_IoTBDS_AI_envsensing_CR.pdfFull text paper4.04 MBAdobe PDFView/Open
Show full item record

Page view(s)

263
checked on Apr 17, 2024

Download(s)

149
checked on Apr 17, 2024

Google ScholarTM

Check

Altmetric