AI Café presents: The Information Bottleneck Principle for Analysis and Design of Neural Classifiers

June 11th, 2024 from 15h to 16h CEST

Description of the Talk

The information bottleneck principle, a mathematical formulation of Occam's Razor, aim to create latent representations that are sufficient for a task and maximally compressed – a minimal sufficient statistic. In this talk, we first critically reflect on the application of the information bottleneck principle in deep learning, addressing the question whether and how compression can be connected to generalization performance. We discuss theoretical, experimental, and engineering evidence in the shape of non-vacuous generalization bounds, information plane analyses, and neural classifiers successfully trained using the information bottleneck principle. Taken together, these three perspectives suggest that compressed representations help improving generalization and robustness.

In the second, shorter part of the talk, we argue that (variational) approaches used to implement the intractable information bottleneck objective can also be successfully used to implement other information-theoretic objectives. We concretize this with the example of invariant representation learning for fair classification. We show that the resulting method has interesting and desirable properties, suggesting that information-theoretic objectives can be useful ingredients for deep learning.

REGISTER HERE

SPEAKER:

Bernhard C. Geiger (Know-Center GmbH, Graz, Austria)

Speaker´s Short Bio

Bernhard C. Geiger received the Dipl.-Ing. degree in Electrical Engineering (with distinction), the Dr. techn. degree in Electrical and Information Engineering (with distinction), and the venia docendi in Theoretical Information Engineering from Graz University of Technology, Austria, in 2009, 2014, and 2023, respectively. In 2010, he joined the Signal Processing and Speech Communication Laboratory, Graz University of Technology, as a Research and Teaching Associate. He was a Senior Scientist and Erwin Schrodinger Fellow at the Institute for Communications Engineering, Technical University of Munich, Germany from 2014 to 2017. He is currently a Key Researcher at Know-Center GmbH, Graz, Austria, where he leads the research area on Methods & Algorithms for Artificial Intelligence. His research interests cover information theory for signal processing and machine learning, theory-assisted machine learning, and information-theoretic model reduction for Markov chains and hidden Markov models.

*********

All the recordings of past AI-Cafés are available on this YouTube channel.

AI-Café Team

*********

This Café is organized by Grassroots Arts. If you have questions about the organisation of this AI-Café or if you want to become a Speaker yourself in one of the next Web Cafe Sessions, please contact carmen@grassroots-arts.eu.

The recordings of the past Web Cafes you can find on our AI-Café video channel: https://www.gotostage.com/channel/ai-cafe. Here is the link to the AI-Cafe website: https://ai-cafe.eu/

AI-Cafe WEBCAFE – INFORMATION LEGAL NOTICE > HERE

more AI Cafés

#42

24 . 07 . 2024 / Online

AI Café presents: AI’s Impact on Media and Democracy in the Global South

#41

17 . 07 . 2024 / Online

AI Café presents: Evaluating LMMs on common sense and factuality & LLM-powered Design Assistant for Video Games

#40

09 . 07 . 2024 / Online

AI Café presents: Pressure, hype, and research ethics: exploring the potential of generative AI in academic research across social sciences and humanities

Cookie Settings

AI4Media may use cookies to store your login data, collect statistics to optimize the website’s functionality and to perform marketing actions based on your interests. You can personalize your cookies in .