The FederatedAveraging Algorithm

The FederatedAveraging Algorithm#

The \(\texttt{FederatedSGD}\) Algorithm#

\[ w^k_{t+1} \leftarrow w_t - \alpha \nabla \ell(w_t; \mathcal{D}_k) \]

\[ w_{t+1} \leftarrow \sum_{k=1}^{K} \frac{n_k}{n} w^k_{t+1} \]

\[ n_k = |\mathcal{D}_k| \]

\[ n = \sum_{k=1}^{K} n_k \]

The \(\texttt{FederatedAveraging}\) Algorithm#

Algorithm 25 (Federated Averaging)

Input: \(K\) clients, the number of local epochs \(E\), the learning rate \(\alpha\), the batch size \(B\), fraction \(C\) of clients to sample, and the number of communication rounds \(T\)
Output: output a global model \(w\)

Server executes:

Initialize global model \(w_0\)
for each round \(t = 1, 2, \dots, T\) do
1. \(m \leftarrow \max(C \cdot K, 1)\)
2. \(S_t \leftarrow\) (random set of \(m\) clients)
3. for each client \(k \in S_t\) do
  1. \(w^k_{t+1} \leftarrow \text{ClientUpdate}(k, w_t)\)
4. \(n \leftarrow \sum_{k \in S_t} n_k\)
5. \(w_{t+1} \leftarrow \sum_{k \in S_t} \frac{n_k}{n} w^k_{t+1}\)

ClientUpdate(\(k, w\)):

\(\mathcal{B} \leftarrow\) (split \(\mathcal{P}_k\) into batches of size \(B\))
for \(e = 1, 2, \dots, E\) do
1. for batch \(b \in \mathcal{B}\) do
  1. \(w \leftarrow w - \alpha \nabla \ell(w; b)\)
return \(w\)

The FederatedAveraging Algorithm

Contents

The FederatedAveraging Algorithm#

The \(\texttt{FederatedSGD}\) Algorithm#

The \(\texttt{FederatedAveraging}\) Algorithm#