Deep Actor-Critics with Tight Risk Certificates
Preprint on arXiv, 2025
Recursive PAC-Bayesian bounds for actor-critic methods.
I'm a postdoctoral researcher at the University of Southern Denmark in Odense, where I work in the group of Melih Kandemir. I did my PhD at Heidelberg University with Fred Hamprecht and Melih, and spent two great years at Aalto University in Finland working with Samuel Kaski and Harri Lähdesmäki. My research focus is on Bayesian probabilistic machine learning with applications ranging from physics over health data to reinforcement learning. I am an action editor for TMLR, a reviewer for most major machine learning conferences (AAAI, AISTATS, ICLR, ICML, NeurIPS), and a member of the ICBINB initiative.
I'm on the job market starting from December this year.
I'm always interested in new collaborations. Feel free to send me an email!
Preprint on arXiv, 2025
Recursive PAC-Bayesian bounds for actor-critic methods.
Transactions on Machine Learning Research; published version, 2025
Machine Learning for Health Symposium; extended version, 2024
ICML Workshop on Structured Probabilistic Inference & Generative Modeling, 2024
A structured variational auto-encoder constrained with a mixed model for longitudinal data.
Preprint on arXiv, 2025
Fixing plasticity issues in PPO via evidential learning.
International Conference on Learning Representations, 2025
Improving Bayesian optimization by incorporating prior knowledge into the latent space.
Preprint on arXiv, 2024
An evaluation of uncertainty calibration with BNNs and ensembles on LHC physics data.
Preprint on arXiv, 2024
We propose a novel actor-critic algorithm for continuous control in sparse reward settings, using a critic ensemble and PAC-Bayesian bounds on Bellman operator error to enable deep exploration via posterior sampling.
Conference on Neural Information Processing Systems, 2024
We introduce a moment-matching based uncertainty penalty for the Bellman operator in an offline RL setting.
Advances in Approximate Bayesian Inference, 2024
We introduce the first PAC Bayesian bound trainable in a modern deep reinforcement learning pipeline to solve continuous control tasks.
International Conference on Machine Learning, 2024
A model for longitudinal data that can handle irregular observations and model missing not at random data.
International Conference on Artificial Intelligence and Statistics, 2024
We introduce a probabilistic model to combine different patient populations for trial effect estimation.
Nature Scientific Reports, 2024
An evaluation on the differences between a curated medical trial data set and uncurated electronic health records data.
Neural Information Processing Systems, 2023
We introduce a method to incorporate high-dimensional equivariances into (conditional) neural processes.
SciPost Physics, 2022
An evaluation of Bayesian normalizing flow for LHC events.
International Conference on Learning Representations, 2022
Evidential deep learning combined with concepts from neural Turing machines and neural processes.
International Conference on Artificial Intelligence and Statistics, 2021
We introduce a probabilistic method to learn dynamical systems that allows for partial input incorporation.
Advances in Approximate Bayesian Inference, 2021
We combine evidential deep learning and PAC-based regularization with moment-matching approaches from Bayesian deep learning.
SciPost Physics, 2020
We discuss Bayesian deep-learning methods for jet classification in particle physics.
Uncertainty in Artificial Intelligence Conference, 2020
A sampling-free approach to BNN learning via ReLU decomposition.
International Joint Conference on Artificial Intelligence, 2019
We introduce a way to adaptively learn the acquisition function for active learning with reinforcement feedback.
International Conference on Learning Representations, 2019
We present LeMoNADe, an end-to-end learned motif detection method directly operating on calcium imaging videos.
Conference on Computer Vision and Pattern Recognition, 2017
Instance level inference with closed-form updates in a Gaussian processed-based model for weakly supervised multiple instance learning and object detection.
British Machine Vision Conference, 2016
We introduce a model for multiple instance learning with Gaussian processes for bag-level predictions with closed-form updates.