Scalable Load Balancing with Distributed Algorithms and the Packing Model
Vinicius Freitas
17 December 2019, 10:30 Salle/Bat : 465/PCRI-N
Contact :
Activités de recherche : High-performance computing
Résumé :
Load imbalance is a recurring problem in High Performance Computing (HPC), which leads to suboptimal performance via the under-use of available resources. As computing systems grow larger, resource management and load balancing become a costly process, especially for dynamic applications that demand periodical workload balance. With this in mind, we believe that future generation load balancing algorithms should look towards scaling along computing systems. In order to express solutions to the aforementioned issues, we propose a distributed scheduling model based on large-scale parallel machines and HPC systems. Additionally, we present a task packing model to minimize the decision costs of distributed algorithms, and present two scheduling strategies implemented in the Charm++ runtime system that use said model.