Reading Group

Get ready for Winter Semester 2025/26!
In the second half of this semester, Craig Dickson will host a reading group on Technical Alignment in AI. Join us for discussions about state of the art research on aligning large language models, including RLHF, constitutional AI, adversarial training, goal misgeneralization, and more!

Past Papers

Variety of Machine Learning in Healthcare

by Tillmann Rheude und Leonhard Kohleick, 2025

A tour through the variety of machine learning in healthcare, from images and cells to genes and multimodal data.

Mechanistic Interpretability

by Lorenz Hufe, 2024 - 2025

Sometimes everybody needs to know what is going on inside a (language) model