← all papers · overview

Efficient Zero-Order Federated Finetuning of Language Models for Resource-Constrained Devices

Abstract

Federated fine-tuning offers a promising approach for tuning Large Language Models (LLMs) on edge devices while preserving data privacy. However, fine-tuning these models on edge devices remains challenging due to high memory, communication, and computational demands. Zero-order optimization with task alignment provides a potential solution, enabling fine-tuning with inference-level memory requirements but requires a longer convergence time. In this paper, we propose \ac{METHOD} that divides the network into two blocks, applying a different number of perturbations per block in a computationally effective way, achieving faster convergence. Our evaluation shows a 1.63×1.6-3\times reduction in computation overhead compared to zero-order state of the art techniques in federated learning.

Related papers

Ranked by semantic similarity — how closely each paper's abstract matches this one (100% = near-identical topic).