Function-Space Models for Deep Learning

Printable PDF

Department of Mathematics,
University of California San Diego

****************************

Math 278B: Mathematics of Information, Data, and Signals

Rahul Parhi

UCSD

Abstract:

Deep learning has been wildly successful in practice and most state-of-the-art artificial intelligence systems are based on neural networks. Lacking, however, is a rigorous mathematical theory that adequately explains the amazing performance of deep neural networks. In this talk, I present a new mathematical framework that provides the beginning of a deeper understanding of deep learning. This framework precisely characterizes the functional properties of trained neural networks. The key mathematical tools which support this framework include transform-domain sparse regularization, the Radon transform of computed tomography, and approximation theory. This framework explains the effect of weight decay regularization in neural network training, the importance of skip connections and low-rank weight matrices in network architectures, the role of sparsity in neural networks, and explains why neural networks can perform well in high-dimensional problems. At the end of the talk we shall conclude with a number of open problems and interesting research directions.

This talk is based on work done in collaboration with Rob Nowak, Ron DeVore, Jonathan Siegel, Joe Shenouda, and Michael Unser.

January 31, 2025

11:00 AM

APM 2402

Research Areas

Mathematics of Information, Data, and Signals

****************************

Department of Mathematics, University of California San Diego

Math 278B: Mathematics of Information, Data, and Signals

Rahul Parhi

UCSD