TY - GEN
T1 - Efficient Scheduling for Multi-Job Federated Learning Systems with Client Sharing
AU - Fu, Boqian
AU - Chen, Fahao
AU - Li, Peng
AU - Su, Zhou
N1 - Publisher Copyright:
© 2023 IEEE.
PY - 2023
Y1 - 2023
N2 - Federated Learning (FL) has emerged as a promising learning approch for data distributed across edge devices. Existing research mainly focuses on single-job FL systems. However, in practical scenarios, multiple FL jobs are often submitted simultaneously. Simply applying single-job optimizations to multi-job FL systems results in sub-optimal system performance. Specifically, we find considerably low resource utilization on the client side due to device heterogeneity. In this paper, we exploit opportunities in multi-job FL systems to improve resource utilization by client sharing: (1) clients not selected for one FL job could be allocated to another FL job, and (2) clients that complete their tasks early in one FL job could be preemptively assigned to another job. We propose an efficient scheduling algorithm for multi-job FL systems, namely GMFL. This scheduling algorithm promptly assigns an available job to a client as soon as it becomes available. To ensure training convergence, we carefully select jobs for each client while considering several constraints. We conduct experiments using four popular models across four different datasets to evaluate the performance of the proposed scheduling algorithm. Experimental results show that our proposed scheduling algorithm significantly outperforms existing methods, with a performance improvement of up to 2.03×.
AB - Federated Learning (FL) has emerged as a promising learning approch for data distributed across edge devices. Existing research mainly focuses on single-job FL systems. However, in practical scenarios, multiple FL jobs are often submitted simultaneously. Simply applying single-job optimizations to multi-job FL systems results in sub-optimal system performance. Specifically, we find considerably low resource utilization on the client side due to device heterogeneity. In this paper, we exploit opportunities in multi-job FL systems to improve resource utilization by client sharing: (1) clients not selected for one FL job could be allocated to another FL job, and (2) clients that complete their tasks early in one FL job could be preemptively assigned to another job. We propose an efficient scheduling algorithm for multi-job FL systems, namely GMFL. This scheduling algorithm promptly assigns an available job to a client as soon as it becomes available. To ensure training convergence, we carefully select jobs for each client while considering several constraints. We conduct experiments using four popular models across four different datasets to evaluate the performance of the proposed scheduling algorithm. Experimental results show that our proposed scheduling algorithm significantly outperforms existing methods, with a performance improvement of up to 2.03×.
KW - Federated learning
KW - scheduling algorithm
UR - https://www.scopus.com/pages/publications/85182607782
U2 - 10.1109/DASC/PiCom/CBDCom/Cy59711.2023.10361429
DO - 10.1109/DASC/PiCom/CBDCom/Cy59711.2023.10361429
M3 - 会议稿件
AN - SCOPUS:85182607782
T3 - 2023 IEEE International Conference on Dependable, Autonomic and Secure Computing, International Conference on Pervasive Intelligence and Computing, International Conference on Cloud and Big Data Computing, International Conference on Cyber Science and Technology Congress, DASC/PiCom/CBDCom/CyberSciTech 2023
SP - 891
EP - 898
BT - 2023 IEEE International Conference on Dependable, Autonomic and Secure Computing, International Conference on Pervasive Intelligence and Computing, International Conference on Cloud and Big Data Computing, International Conference on Cyber Science and Technology Congress, DASC/PiCom/CBDCom/CyberSciTech 2023
PB - Institute of Electrical and Electronics Engineers Inc.
T2 - 2023 IEEE International Conference on Dependable, Autonomic and Secure Computing, 2023 International Conference on Pervasive Intelligence and Computing, 2023 International Conference on Cloud and Big Data Computing, 2023 International Conference on Cyber Science and Technology Congress, DASC/PiCom/CBDCom/CyberSciTech 2023
Y2 - 14 November 2023 through 17 November 2023
ER -