Search
Workshop

Reparametrization invariance in Bayesian approximations

  • Søren Hauberg
E1 05 (Leibniz-Saal)

Abstract

Bayesian analysis of deep neural networks has largely been unsuccessful, often resulting in significant under- or over-fitting. We argue that this is due to inappropriate treatment of the overparametrization that trivially follows from increasing network depth. We give a geometric characterization of overparametrization and show that respecting this geometric structure results in significant improvements to Bayesian approximations. We further discuss the numerical aspects.

Antje Vandenberg

Max Planck Institute for Mathematics in the Sciences Contact via Mail

Michael Bleher

University of Heidelberg & STRUCTURES

Freya Jensen

University of Heidelberg & STRUCTURES

Levin Maier

University of Heidelberg & STRUCTURES

Diaaeldin Taha

Max Planck Institute for Mathematics in the Sciences

Anna Wienhard

Max Planck Institute for Mathematics in the Sciences