Principled Scaling of Deep Neural Networks

Seminar

Event Start

2024-02-26 - 09:00

Event End

2024-02-26 - 10:00

Location

Building 9, Level 4, Room 4225

Neural Networks

Soufiane Hayou, Postdoc, Simons Institute, UC Berkeley

Abstract

Neural networks have achieved impressive performance in many applications such as image and speech recognition and generation. State-of-the-art performance is usually achieved via a series of engineered modifications to existing neural architectures and their training procedures. However, a common feature of these systems is their large-scale nature: modern neural networks usually contain Billions - if not 10's of Billions - of trainable parameters, and empirical evaluations (generally) support the claim that increasing the scale of neural networks (e.g. width and depth) boosts the model performance if done correctly. However, given a neural network model, it is not straightforward to address the crucial question `how do we scale the network?'. In this talk, I will show how we can leverage different mathematical results to efficiently scale neural networks, with empirically confirmed benefits.

Brief Biography

Soufiane Hayou obtained his PhD in statistics in 2021 from Oxford where he was advised by Arnaud Doucet and Judith Rousseau. He graduated from Ecole Polytechnique in Paris before joining Oxford. During his PhD, he worked mainly on the theory of randomly initialized infinite-width neural networks on topics including the impact of the hyperparameters on how the 'geometric' information propagates inside the network. He is currently a Researcher at Simons Insitute for the Theory of Computing, on leave from his Peng Tsu Ann Assistant Professorship in mathematics at the National University of Singapore. His current research is focused on the theory and practice of scaling of neural networks.

Event Start

Event End

Location

Abstract

Brief Biography

Events

Decision-Making and Learning through Feedback in Robotics

Forward-Looking Roadmap: Enabling Heterogeneous Integration in the Next Decade

Pontryagin meets Bellman: on combining Pontryagin’s Principle and Dynamic Programming

CEMSE - Computer, Electrical and Mathematical Sciences and Engineering Division

Biological and Environmental Sciences Engineering Division

Physical Science and Engineering Division

Study

Expanding Knowledge

Student Affairs

Living in KAUST

About KAUST

Latest from KAUST

Computer Science Program