machine learning researcher working on probabilistic methods, vision-language models and sample efficient reinforcement learning
Selected publications
Reinforcement Learning via Self-Distillation
Jonas Hübotter, Frederike Lübeck*, Lejs Behric*, Anton Baumann*, Marco Bagatella, Daniel Marta, Ido Hakimi, Idan Shenfeld, Thomas Kleine Buening, Carlos Guestrin, Andreas Krause
ICML 2026; *equal second authorship.
Best Paper Award at ICLR 2026 Workshop on Test-Time Updates and Oral Presentation at ICLR 2026 Workshop on Scaling Post-training for LLMs.