Victor-Alexandru Darvariu
I am a computer scientist working as a Postdoctoral Researcher at the Oxford Robotics Institute, University of Oxford, where I am a part of the GOALS group led by Nick Hawes. I am also a Retained Lecturer in Engineering Science at Jesus College and an Honorary Research Fellow at UCL Computer Science.
I am interested in reinforcement learning (RL) and artificial intelligence (AI) more broadly. The key insight behind my work is the ability of RL to discover, by trial-and-error, ways of solving decision-making problems that can outperform or complement traditional methods. My work develops rigorous RL methodologies, especially for graph-structured systems (Graph RL), and applies them to disciplines as diverse as robotics, operations research, and statistics (AI for Science).
News
[May 2026] Honoured to be serving the community as Reviewing Chair for the Learning on Graphs (LoG) 2026 Conference. We will be recruiting reviewers and area chairs soon, please look out for announcements!
[May 2026] Our latest work introduces FlowIQN, a theoretically grounded flow-matching critic for distributional RL. By aligning flow matching with optimal transport and Wasserstein geometry, we bridge recent ideas in generative models with the foundations of distributional RL. This improves the accuracy of the return distribution and downstream offline RL performance.
[Mar 2026] Our work Accelerating atomic fine structure determination with graph reinforcement learning, a collaboration with researchers at Imperial College, has been published in Nature Communication Physics. This paper proposes an AI framework to accelerate a fundamental discovery task in atomic physics that takes highly-trained human experts months or even years to complete.
[Feb 2026] My colleague Alex Schutz and I wrote a blog post on using graph neural networks in reinforcement learning. This work, which will be presented at the ICLR 2026 Blogposts Track, covers crucial design decisions and practical implementation concerns. It also includes an implementation example using well-known libraries.
For older news, see the archive.