I'm a Research Scientist in the vision team at the Technology Innovation Institute (TII), while finishing my PhD at the Tübingen AI Center, advised by Prof. Hilde Kuehne. I'm also participating in the MIT-IBM Watson Sight and Sound Project.
Prior to this, I finished my Master of Engineering in Applied Mathematics at ENSTA Paris in France and my MVA Master (Mathematics, Vision and Learning) at ENS Paris-Saclay.
Before starting my PhD, I worked for 3 years in industry research teams: at Credit Agricole's Datalab in Montrouge, France, where I mostly focused on NLP and research engines, and at Huawei's Noah's Ark Lab in Paris, working on optimization and Reinforcement Learning.
My current research focuses on multimodal models, especially Video Understanding, Video-Language modeling, Efficient video representation and efficient Multimodal models.
CVPR 2026 (Highlight) — Best Paper Award, A2A Multimodal Workshop
NeurIPS 2022, MetaLearn Workshop