TY - GEN
T1 - Logarithmic pruning is all you need
AU - Orseau, Laurent
AU - Hutter, Marcus
AU - Rivasplata, Omar
N1 - Publisher Copyright:
© 2020 Neural information processing systems foundation. All rights reserved.
PY - 2021/7/1
Y1 - 2021/7/1
N2 - The Lottery Ticket Hypothesis is a conjecture that every large neural network contains a subnetwork that, when trained in isolation, achieves comparable performance to the large network. An even stronger conjecture has been proven recently: Every sufficiently overparameterized network contains a subnetwork that, at random initialization, but without training, achieves comparable accuracy to the trained large network. This latter result, however, relies on a number of strong assumptions and guarantees a polynomial factor on the size of the large network compared to the target function. In this work, we remove the most limiting assumptions of this previous work while providing significantly tighter bounds: the overparameterized network only needs a logarithmic factor (in all variables but depth) number of neurons per weight of the target subnetwork.
AB - The Lottery Ticket Hypothesis is a conjecture that every large neural network contains a subnetwork that, when trained in isolation, achieves comparable performance to the large network. An even stronger conjecture has been proven recently: Every sufficiently overparameterized network contains a subnetwork that, at random initialization, but without training, achieves comparable accuracy to the trained large network. This latter result, however, relies on a number of strong assumptions and guarantees a polynomial factor on the size of the large network compared to the target function. In this work, we remove the most limiting assumptions of this previous work while providing significantly tighter bounds: the overparameterized network only needs a logarithmic factor (in all variables but depth) number of neurons per weight of the target subnetwork.
UR - http://www.scopus.com/inward/record.url?scp=85104145548&partnerID=8YFLogxK
M3 - Conference contribution
AN - SCOPUS:85104145548
SN - 9781713829546
T3 - Advances in Neural Information Processing Systems
SP - 2925
EP - 2924
BT - Advances in Neural Information Processing Systems 33
A2 - Larochelle, H.
A2 - Ranzato, M.
A2 - Hadsell, R.
A2 - Balcan, M. F.
A2 - Lin, H.
CY - San Diego, CA
T2 - 34th Conference on Neural Information Processing Systems, NeurIPS 2020
Y2 - 6 December 2020 through 12 December 2020
ER -