Seminar Topics

NB! Seminar topics will be updated here at the start of the first seminar

Topics of Chinmaya Dehury

email: chinmaya.dehury@ut.ee

C1: A short review on clustering algorithms for edge computing environment

Edge computing refers to bringing the computing capacity to the edge of the network. The edge devices are equipped with a limited computing capacity. When we talk about edge computing, we need to think about the characteristics of edge data, edge resources, edge devices and many more. On the other hand clustering strategies in IoT and wireless communications are applied mostly to reduce the energy consumption and further improve the network resource utilisation. In this topic, the student needs to go through the algorithms/strategies that are specifically designed for clustering the edge data or the devices or the users.

The student need to answer following:

What are clustering algorithms present in computer science, e.g. K-Means, Affinity Propagation (AP), Mean-shift, Spectral clustering, etc. ?
The general working principles of some (2-3) popular clustering algorithms?
Comparison of clustering algorithms? (features, applicability, limitations, advantages, are they lightweight, scalability, reliability, adaptive and dynamic, etc,)

Some articles to start investigating:

Clustering Algorithms on Low-Power and High-Performance Devices for Edge Computing Environments, https://www.mdpi.com/1424-8220/21/16/5395
Task Classification and Scheduling Based on K-Means Clustering for Edge Computing, https://doi.org/10.1007/s11277-020-07343-w
A Comprehensive Survey of Clustering Algorithms, https://doi.org/10.1007/s40745-015-0040-1

C2: [TAKEN] X Discovery: A short survey

Discoverability mechanism lets an entity X on a network be discoverable. The entity X can be a service, a device, a network or knowledge. For instance, the service discovery protocol is used by a client device to find out about the services it can use on a server device. On the other hand, edge computing allows the IoT device generated data or client data to be processed by the nearest computing device present at the periphery of the network. This topic would focus on the study of discovery systems in edge computing, especially the study of device, service, and knowledge discovery at the edge. Some of the questions the student need to focus on, are:

What do you mean by X discoverability?
How the X discovery works at the edge, where X can be device, service, or knowledge?

Some of the references to start with:

Device Discovery in D2D Communication: A Survey (https://ieeexplore.ieee.org/abstract/document/8835011)
Service Discovery (https://www.dfki.de/~klusch/i2s/SD_essay_klusch2013.pdf)
Towards Service Discovery and Invocation in Data-Centric Edge Networks (https://ieeexplore.ieee.org/abstract/document/8888081)
Collaborative Learning-Based Industrial IoT API Recommendation for Software-Defined Devices: The Implicit Knowledge Discovery Perspective (https://ieeexplore.ieee.org/abstract/document/9208715)

C3: [TAKEN] A tutorial on service discovery mechanisms with serf

Serf, from Hashicorp, is a tool for cluster membership, failure detection, and orchestration that is decentralized, fault-tolerant and highly available. It is extremely lightweight: it uses 5 to 10 MB of resident memory and primarily communicates using infrequent UDP messages. In this topic, student make a tutorial on how to use serf.

Create flask app that simply print "hello world" with the container ID/IP
Containerize the flask app
- inside the container, also install Serf
Run multiple such containers
Demonstrate the use of serf in forming cluster

Best place to start with is following the official document: https://www.serf.io/intro/index.html

C4: [TAKEN] What is observability and a comparison on observability tools

Observability primarily deals with measuring and understanding the internal states of a system. For this purpose, administrators primarily relies on "logs", "metrics" and "traces". In this topic, student's responsiblity is to further dig into the concept of observabilty. Student needs to also find the tools that are used in Observability.

Research Questions:

What are diferent observability techniques?
How can such techniques be optimized for resource-constrained devices?

Some references to start with:

A Survey on Observability of Distributed Edge & Container-Based Microservices, 10.1109/ACCESS.2022.3193102, https://ieeexplore.ieee.org/document/9837035

C5: [TAKEN] What is Edge Intelligence for a non-AI expert?

Edge Intelligence (EI) is often seen as the intersection of Edge Computing (EC) and Artificial Intelligence (AI), leading to two primary interpretations: (a) the deployment of AI algorithms or models on edge infrastructure, and (b) the use of AI/ML techniques to manage the resources of edge infrastructure. However, for someone without expertise in AI, or for those seeking to understand EI beyond the simple convergence of EC and AI, we need to dig deeper. What does “intelligence” mean in a broader sense? What does it signify in the context of edge infrastructure, such as in the smart city framework? Can we say edge devices are intelligent, and if so, what justifies this characterization? These are the questions that the student needs to explore to gain a comprehensive understanding of Edge Intelligence.

Some articles to start investigating:

Edge Intelligence: The Confluence of Edge Computing and Artificial Intelligence, 10.1109/JIOT.2020.2984887, https://ieeexplore.ieee.org/document/9052677/
Disclosing Edge Intelligence: A Systematic Meta-Survey, 10.3390/bdcc7010044, https://www.mdpi.com/2504-2289/7/1/44

C6: Light-weight open-source edge orchestrator

An edge orchestrator is responsible for managing and coordinating tasks, applications, and resources within the edge infrastructure. For instance, KubeEdge allows for managing the lifecycle of containerized applications on edge devices. In this topic, students should explore existing open-source orchestration tools, compare them, assess their capabilities and limitations, and identify any missing features.

Research Questions to investigate:

What are the essential capabilities/features expected from an edge orchestrator?
Which open-source orchestration tools are available?
What capabilities do these tools offer?
What capabilities are missing or need improvement in these tools?

Some references to start with:

To understand edge computing:
- Management and Orchestration of Edge Computing for IoT: A Comprehensive Survey, https://doi.org/10.1109/JIOT.2023.3245611, https://ieeexplore.ieee.org/abstract/document/10045724
Some edge computing-related tools:
- https://github.com/qijianpeng/awesome-edge-computing
- Oakestra: A Lightweight Hierarchical Orchestration Framework for Edge Computing, https://www.usenix.org/conference/atc23/presentation/bartolomeo

Your topic

You may come with your own topic.....

Topics of Pelle Jakovits

email: jakovits@ut.ee

Topic P.1. Synthetic data generators for generic real-time realistic Smart City data

The goal of this topic is to study the approaches for synthetic IoT data generation for emulating the behavior of industrial and Smart City IoT devices. The second goal is to evaluate existing state-of-the-art synthetic data generation tools and IoT data anonymization approaches.

Research questions:

What are the scalable solutions for generating IoT data that can mimic the behavior of real-world IoT data streams?
Are the any automated solutions for generating synthetic IoT data based on real data that do not expose the content of the real data?
What are the limitations of existing tools and approaches? For example: can they be used on all possible types of IoT or Smart City data?