Most Activation Functions Can Win the Lottery Without Excessive Depth

Burkholz, Rebekka

(2022) Most Activation Functions Can Win the Lottery Without Excessive Depth.

In: NeurIPS 2022.

Conference: NeurIPS Conference on Neural Information Processing Systems

Preview

Text
LTexist_most_actfunc_2022.pdf
Download (474kB) | Preview

Abstract

The strong lottery ticket hypothesis has highlighted the potential for training deep neural networks by pruning, which has inspired interesting practical and theoretical insights into how neural networks can represent functions. For networks with ReLU activation functions, it has been proven that a target network with depth L can be approximated by the subnetwork of a randomly initialized neural network that has double the target’s depth 2L and is wider by a logarithmic factor. We show that a depth L + 1 network is sufficient. This result indicates that we can expect to find lottery tickets at realistic, commonly used depths while only requiring logarithmic overparametrization. Our novel construction approach applies to a large class of activation functions and is not limited to ReLUs.

Item Type:	Conference or Workshop Item (A Paper) (Paper)
Divisions:	Rebekka Burkholz (RB)
Conference:	NeurIPS Conference on Neural Information Processing Systems
Depositing User:	Rebekka Burkholz
Date Deposited:	12 Oct 2022 18:47
Last Modified:	13 Oct 2022 10:11
Primary Research Area:	NRA1: Trustworthy Information Processing
URI:	https://publications.cispa.saarland/id/eprint/3803

Actions

Actions (login required)

View Item