Theses

A COMPARATIVE STUDY ON PERFORMANCE OF IOT-DRIVEN ML-ENABLED FORECASTING MODELS FOR EFFICIENT AIR QUALITY MONITORING

Bara Ksiksi, United Arab Emirates UniversityFollow

Date of Award

4-2026

Document Type

Thesis

DOI

https://doi.org/10.83048/enhy-2714

Degree Name

Master of Science in Internet of Things

Department

Computer and Network Engineering

First Advisor

Bassem Mokhtar

Second Advisor

Parag Kulkarni

Abstract

Air pollution is one of the most critical environmental challenges affecting public health globally, responsible for approximately 4.2 million premature deaths annually according to the World Health Organisation. This thesis presents a comparative study of IoT-driven machine learning forecasting models for air quality monitoring in Abu Dhabi, UAE, introducing a zonal approach combined with satellite-based spatial validation. The primary objective is to evaluate forecasting performance across three distinct activity zones using ground station data from the Environment Agency Abu Dhabi (EAD), and to incorporate a spatial validation component using satellite imagery to assess the consistency of ground-based predictions at a larger scale. The pollution parameters examined include particulate matter (PM10) and gaseous pollutants comprising ozone (O3), nitrogen dioxide (NO2), sulfur dioxide (SO2), and carbon monoxide (CO). The study follows a four-scenario progressive framework applied across three monitoring zones: Hamdan Street (urban), Liwa (regional), and Mussafah (industrial). Using the EAD pollution dataset spanning 2007 to 2012, each scenario introduces incremental improvements including a long-horizon baseline, seasonal hybridisation, horizon reduction, and multivariate optimisation. Four models were evaluated: Autoregressive Integrated Moving Average (ARIMA), Seasonal ARIMA (SARIMA), Long Short-Term Memory (LSTM), and SARIMA-LSTM hybrid. Results indicate that no single model was universally optimal, as performance was heavily dependent on the zone and pollutant combination. Using RMSE and R2 as primary evaluation metrics, the SARIMA-LSTM hybrid demonstrated superior performance for NO2 in the urban zone, while standalone ARIMA performed best for PM10 in the industrial zone. For certain pollutants, increased model complexity did not yield improved results, underscoring the importance of the zonal approach adopted in this study. Satellite imagery comparison revealed strong spatial correlation with ground station predictions, demonstrating the potential of integrating remote sensing with conventional ground-based approaches for more comprehensive air quality monitoring. The findings of this study contribute to the development of early warning systems and evidence-based policy decisions concerning air pollution through 24-hour ahead forecasting, while addressing a research gap in zone-specific forecasting frameworks for the UAE region.

Arabic Abstract

دراسة مقارنة في أداء نماذج التنبؤ بتعلم الآلة المدعومة بإنترنت الأشياء لمراقبة جودة الهواء بكفاءة

يُعد تلوث الهواء من أخطر القضايا البيئية التي تؤثر بشكل كبير على صحة الإنسان حول العالم، حيث يُقدَّر أنه يسبب حوالي 4.2 مليون حالة وفاة مبكرة سنويًا. تركز هذه الرسالة على تحسين نماذج التنبؤ بتلوث الهواء من خلال استخدام نهج قائم على تقسيم المناطق، بالإضافة إلى الاستفادة من بيانات الأقمار الصناعية للتحقق المكاني. الهدف الرئيسي هو دراسة هذا النهج باستخدام بيانات محطات الرصد الأرضية، وإضافة بُعد مكاني عبر صور الأقمار الصناعية من أجل تحسين دقة النتائج. تعتمد الدراسة على أربع مراحل رئيسية، مع التركيز على ثلاث مناطق مختلفة.

تشمل هذه المراحل: إنشاء نموذج أساسي للتنبؤ طويل المدى، ثم إدخال التأثيرات الموسمية باستخدام نماذج هجينة، ثم تقليل فترة التنبؤ، وأخيرًا تحسين النماذج باستخدام عدة متغيرات. وقد تم استخدام نماذج ARIMA وARIMA-LSTM وSARIMA-LSTM و LSTM، مع تطبيقها على مناطق شارع حمدان، وليوا، ومصفح. أظهرت النتائج أنه لا يوجد نموذج واحد هو الأفضل في جميع الحالات، بل يعتمد الأداء على نوع المنطقة ونوع الملوث. فعلى سبيل المثال، كان نموذج SARIMA-LSTM الأفضل لملوث ثاني أكسيد النيتروجين (NO₂) في المنطقة الحضرية، بينما كان نموذج ARIMA هو الأفضل لملوث PM₁₀ في المنطقة الصناعية.

كما أظهرت الدراسة أن تعقيد النموذج ليس دائمًا العامل الأهم، مما يؤكد أهمية تقسيم المناطق في تحسين النتائج. وعند مقارنة نتائج الأقمار الصناعية مع بيانات المحطات الأرضية، وُجد توافق كبير بينهما، مما يدل على إمكانية دمج الطريقتين للحصول على نتائج أكثر دقة في المستقبل.

تُسهم هذه الرسالة في تحسين أنظمة الإنذار المبكر ودعم اتخاذ القرار في ما يتعلق بتلوث الهواء، خاصة من خلال التنبؤ لمدة 24 ساعة مسبقًا. كما تعالج الدراسة بعض الفجوات البحثية، حيث إن العديد من الدراسات تركز على مناطق حضرية عامة، دون الاهتمام باختلاف طبيعة المناطق أو دمج البيانات المكانية مع البيانات الأرضية.

Recommended Citation

Ksiksi, Bara, "A COMPARATIVE STUDY ON PERFORMANCE OF IOT-DRIVEN ML-ENABLED FORECASTING MODELS FOR EFFICIENT AIR QUALITY MONITORING" (2026). Theses. 1422.
https://scholarworks.uaeu.ac.ae/all_theses/1422

Download

Included in

Computer Sciences Commons, Electrical and Computer Engineering Commons

COinS