Logo do repositório
 
A carregar...
Miniatura
Publicação

Sharing ML models using IoT Communities of Interest based on Data Similarity and Location

Utilize este identificador para referenciar este registo.
Nome:Descrição:Tamanho:Formato: 
TGEO297_H.pdf3.53 MBAdobe PDF Ver/Abrir

Resumo(s)

The increasing number of Internet of Things devices, propelled by technological advancements and widespread Internet coverage, has led to an unprecedented surge in data generation. This influx of data, often characterized as Big Data, poses significant challenges in terms of handling, processing, and extracting meaningful insights. The rise of Artificial Intelligence is crucial for the managment of Big Data, establishing a foundation for the perfect symbiosis of Internet of Things and Artificial Intelligence. Internet of Things devices generate enough data to feed Machine Learning a subset of Artificial Intelligence. Machine Learning specializes in recognizing pat terns, discerning intricate trends and anomalies, streamlining data analysis through automation, and exhibiting scalability to effortlessly accommodate expanding data quantities. Machine Learning’s strengths in predictive analytics and real-time processing makes it highly suitable for prompt decision-making. Despite promising prospects, creating and deploying Machine Learning models in numerous heterogeneous Internet of Things devices and ecosystems presents a formidable and competitive task. Similarly, developing a universal ML model capable of encompassing all IoT devices worldwide is an impractical endeavor since each IoT device comes with its unique characteristics and functionalities, making a one-size fits-all model unfeasible. Therefore, this study proposes a novel solution in which Machine Learning models are shared among Internet of Things devices based on their similarity in purpose, domain, and context. This strategy leverages the concept of Communities of Interest within the Social Internet of Things framework. The main goal of this master thesis is to develop an efficient method for sharing Machine Learning models across Internet of Things devices. To achieve this, the research work proposes a novel approach focused on distributing Machine Learning models among Internet of Things Communities of Interest based on the similarity of Internet of Things data streams and geospatial components such as location and elevation. To validate this approach, the study adopted a cluster-based strategy to form Internet of Things Communities of Interest. Initially, a thorough similarity analysis of IoT weather sensor data streams was conducted using both Dynamic Time Warping and Spearman’s correlation methods. Evaluation of the similarity results revealed that Spearman’s correlation performed better than Dynamic Time Warping, producing higher-quality and more coherent clusters. Thus, the study proceeded with K-means clustering using the outcomes of Spearman’s correlation analysis and goespatial data to form clusters, guided by the optimal number of clusters, four, determined through the elbow method. These clusters formed the foundation for Internet of Things - Communities of Interest, essential for the development, validation, testing, and sharing of Machine Learning models. Evaluation of Machine Learning model performance during the sharing and testing phases revealed that the majority of the Machine Learning models performed better when trained, tested, and shared within the same Community of Interest dataset. On the contrary, models trained on a different Community of Interest exhibited poorer performance when tested on members of another Community of Interest. The findings of this study demonstrate that it is possible to delineate geospatial zones based on the inherent similarity of Internet of Things data streams, and to craft and validate Machine Learning models tailored to the unique characteristics of each zone. It also establishes that it possible to leverage geospatial components for sharing and reusing pre-trained Machine Learning models among Internet of Things devices.

Descrição

Dissertation submitted in partial fulfilment of the requirements for the Degree of Master of Science in Geospatial Technologies

Palavras-chave

Contexto Educativo

Citação

Projetos de investigação

Unidades organizacionais

Fascículo