Senior ML Engineer · WSD
Training reasoning LLMs with RL (GRPO, DPO) for document automation; optimizing inference pipelines; experimenting with SSMs, VLMs and long-context architectures.
About
ML engineer with a background in machine learning, statistics and engineering. I read research papers and ship them to production.
Focus areas
Large language models | training & inference | efficiency | state-space models | document automation
But also traveling the world, learning new languages, surfing...
Education
Career
Training reasoning LLMs with RL (GRPO, DPO) for document automation; optimizing inference pipelines; experimenting with SSMs, VLMs and long-context architectures.
Research on sub-quadratic architectures and efficient transformers. Implemented Mamba, Hyena, S4, H3 and RWKV.
Co-founded a sovereign LLM assistant using RAG; diffusion for satellite imagery; NLP tooling and semantic search on GCP.
Implemented computer vision systems using state-of-the-art deep learning architectures.