DARO: An Auction-Based Multi-Agent Reinforcement Learning Framework for Task Scheduling in the Cloud Continuum

Journal

Published on:

Syed Mafooq Ul Hassan, Marios Touloupou and Herodotos Herodotou (Department of Electrical Engineering and Computer Science and Engineering, Cyprus University of Technology, Limassol, Cyprus);
Jacopo Castellini, Pablo Strasser and Alexandros Kalousis (Haute Ecole de Gestion de Genève, University of Applied Sciences and Arts Western Switzerland (HES-SO), Carouge, Geneva, Switzerland)

The increasing heterogeneity and dynamism of cloud–edge-IoT infrastructures demand scalable, intelligent scheduling strategies that go beyond traditional centralised approaches. This paper presents DARO, a distributed, asynchronous scheduling framework that leverages multi-agent reinforcement learning (MARL) to enable autonomous, cooperative task allocation across the cloud continuum. DARO integrates natively with Kubernetes, introducing a decentralised, auction-based mechanism in which node-local agents submit bids for incoming tasks based on partial observations and a learned policy. The agents are trained using a value decomposition method (QMIX) under the centralised training-decentralised execution paradigm. We formalise the problem as a decentralised partially observable Markov decision process (Dec-POMDP), design a multi-factor reward function to guide learning, and implement the system within a high-fidelity Kubernetes simulator. The experimental evaluation demonstrates that the agents effectively learn balanced and resource-efficient task placement strategies. DARO demonstrates strong potential to serve as a robust scheduling layer for dynamic and large-scale distributed environments.

 
Hassan, S. M. U., Touloupou, M., Castellini, J., Strasser, P., Kalousis, A. and Herodotou, H. (2026). DARO: An Auction-Based Multi-Agent Reinforcement Learning Framework for Task Scheduling in the Cloud Continuum. In Proceedings of the 16th International Conference on Cloud Computing and Services Science - CLOSER; ISBN 978-989-758-829-7; ISSN 2184-5042, SciTePress, pages 293-300.

DOI: 10.5220/0014920000004039