Logarithmic pruning is all you need

Laurent Orseau, Marcus Hutter, Omar Rivasplata

Research output: Chapter in Book/Conference proceedingConference contributionpeer-review

Abstract

The Lottery Ticket Hypothesis is a conjecture that every large neural network contains a subnetwork that, when trained in isolation, achieves comparable performance to the large network. An even stronger conjecture has been proven recently: Every sufficiently overparameterized network contains a subnetwork that, at random initialization, but without training, achieves comparable accuracy to the trained large network. This latter result, however, relies on a number of strong assumptions and guarantees a polynomial factor on the size of the large network compared to the target function. In this work, we remove the most limiting assumptions of this previous work while providing significantly tighter bounds: the overparameterized network only needs a logarithmic factor (in all variables but depth) number of neurons per weight of the target subnetwork.

Original languageEnglish
Title of host publicationAdvances in Neural Information Processing Systems 33
Subtitle of host publication34th Conference on Neural Information Processing Systems (NeurIPS 2020)
EditorsH. Larochelle, M. Ranzato, R. Hadsell, M. F. Balcan, H. Lin
Place of PublicationSan Diego, CA
Pages2925-2924
Number of pages10
Publication statusPublished - 1 Jul 2021
Event34th Conference on Neural Information Processing Systems, NeurIPS 2020 - Virtual, Online
Duration: 6 Dec 202012 Dec 2020

Publication series

Name Advances in Neural Information Processing Systems
PublisherNeural information processing systems foundation
Volume33
ISSN (Print)1049-5258

Conference

Conference34th Conference on Neural Information Processing Systems, NeurIPS 2020
CityVirtual, Online
Period6/12/2012/12/20

Fingerprint

Dive into the research topics of 'Logarithmic pruning is all you need'. Together they form a unique fingerprint.

Cite this