signSGD with majority vote is communication efficient and Byzantine fault tolerant
Jeremy Bernstein, Jiawei Zhao, Kamyar Azizzadenesheli & Anima Anandkumar
under review for ICLR '19
We show that when the parameter server aggregates gradient signs by majority vote, the resulting distributed optimisation scheme is both communication efficient and adversarially robust.
signSGD: compressed optimisation for non-convex problems
Jeremy Bernstein, Yu-Xiang Wang, Kamyar Azizzadenesheli & Anima Anandkumar
ICML '18 long talk
We exploit the natural geometry of neural net error landscapes to develop an optimiser that converges as fast as SGD whilst providing cheap gradient communication for distributed training.