SparseProp: Efficient sparse backpropagation for faster training of neural networks at the edge

Nikdan, Mahdi; Pegolotti, Tommaso; Iofinova, Eugenia B; Kurtic, Eldar; Alistarh, Dan-Adrian

SparseProp: Efficient sparse backpropagation for faster training of neural networks at the edge

Nikdan M, Pegolotti T, Iofinova EB, Kurtic E, Alistarh D-A. 2023. SparseProp: Efficient sparse backpropagation for faster training of neural networks at the edge. Proceedings of the 40th International Conference on Machine Learning. ICML: International Conference on Machine Learning, PMLR, vol. 202, 26215–26227.

Download (ext.)

https://doi.org/10.48550/arXiv.2302.04852 [Preprint]

Conference Paper | Published | English

Scopus indexed

Author

Nikdan, Mahdi^ISTA; Pegolotti, Tommaso; Iofinova, Eugenia^ISTA ; Kurtic, Eldar^ISTA; Alistarh, Dan-Adrian^ISTA

Department

Alistarh Group

Grant

Elastic Coordination for Scalable Machine Learning

Series Title

PMLR

Abstract

We provide an efficient implementation of the backpropagation algorithm, specialized to the case where the weights of the neural network being trained are sparse. Our algorithm is general, as it applies to arbitrary (unstructured) sparsity and common layer types (e.g., convolutional or linear). We provide a fast vectorized implementation on commodity CPUs, and show that it can yield speedups in end-to-end runtime experiments, both in transfer learning using already-sparsified networks, and in training sparse networks from scratch. Thus, our results provide the first support for sparse training on commodity hardware.

Publishing Year

2023

Date Published

2023-07-30

Proceedings Title

Proceedings of the 40th International Conference on Machine Learning

Publisher

ML Research Press

Acknowledgement

We would like to thank Elias Frantar for his valuable assistance and support at the outset of this project, and the anonymous ICML and SNN reviewers for very constructive feedback. EI was supported in part by the FWF DK VGSCO, grant agreement number W1260-N35. DA acknowledges generous ERC support, via Starting Grant 805223 ScaleML.

Volume

202

Page

26215-26227

Conference

ICML: International Conference on Machine Learning

Conference Location

Honolulu, Hawaii, HI, United States

Conference Date

2023-07-23 – 2023-07-29

eISSN

2640-3498

IST-REx-ID

14460

Cite this

Nikdan M, Pegolotti T, Iofinova EB, Kurtic E, Alistarh D-A. SparseProp: Efficient sparse backpropagation for faster training of neural networks at the edge. In: Proceedings of the 40th International Conference on Machine Learning. Vol 202. ML Research Press; 2023:26215-26227.

Nikdan, M., Pegolotti, T., Iofinova, E. B., Kurtic, E., & Alistarh, D.-A. (2023). SparseProp: Efficient sparse backpropagation for faster training of neural networks at the edge. In Proceedings of the 40th International Conference on Machine Learning (Vol. 202, pp. 26215–26227). Honolulu, Hawaii, HI, United States: ML Research Press.

Nikdan, Mahdi, Tommaso Pegolotti, Eugenia B Iofinova, Eldar Kurtic, and Dan-Adrian Alistarh. “SparseProp: Efficient Sparse Backpropagation for Faster Training of Neural Networks at the Edge.” In Proceedings of the 40th International Conference on Machine Learning, 202:26215–27. ML Research Press, 2023.

M. Nikdan, T. Pegolotti, E. B. Iofinova, E. Kurtic, and D.-A. Alistarh, “SparseProp: Efficient sparse backpropagation for faster training of neural networks at the edge,” in Proceedings of the 40th International Conference on Machine Learning, Honolulu, Hawaii, HI, United States, 2023, vol. 202, pp. 26215–26227.

Nikdan, Mahdi, et al. “SparseProp: Efficient Sparse Backpropagation for Faster Training of Neural Networks at the Edge.” Proceedings of the 40th International Conference on Machine Learning, vol. 202, ML Research Press, 2023, pp. 26215–27.

All files available under the following license(s):

Copyright Statement:

This Item is protected by copyright and/or related rights. [...]

Link(s) to Main File(s)

URL

https://doi.org/10.48550/arXiv.2302.04852

Access Level

Open Access

Export

Marked Publications

Open Data ISTA Research Explorer

Sources

arXiv 2302.04852

Search this title in

Google Scholar