Research output per year
Research output per year
G6 Kilburn Building
Accepting PhD Students
PhD projects
* Policy gradients in large-scale training;
* Transformers and generative models for decision-making tasks;
* Online learning from interactions and demonstrations.
My research concerns artificial intelligence, with a particular focus on machine learning techniques such as deep reinforcement learning and learning from demonstration. I am applying these methods to problems in video games, robotics and multi-agent systems.
Research output: Preprint/Working paper › Preprint
Research output: Preprint/Working paper › Preprint
Research output: Preprint/Working paper › Preprint
Research output: Preprint/Working paper › Preprint
Research output: Preprint/Working paper › Preprint
Kaski, S. (PI), Alvarez, M. (Researcher), Pan, W. (Researcher), Mu, T. (Researcher), Rivasplata, O. (PI), Sun, M. (Researcher), Mukherjee, A. (Researcher), Caprio, M. (PI), Sonee, A. (Researcher), Leroy, A. (Researcher), Wang, J. (Researcher), Lee, J. (Researcher), Parakkal Unni, M. (Researcher), Sloman, S. (Researcher), Menary, S. (Researcher), Quilter, T. (Researcher), Hosseinzadeh, A. (PGR student), Mousa, A. (PGR student), Glover, E. (PGR student), Das, A. (PGR student), DURSUN, F. (PGR student), Zhu, H. (PGR student), Abdi, H. (PGR student), Dandago, K. (PGR student), Piriyajitakonkij, M. (PGR student), Rachman, R. (PGR student), Shi, X. (PGR student), Keany, T. (PGR student), Liu, X. (PGR student), Jiang, Y. (PGR student), Wan, Z. (PGR student), Evans, I. (Support team), Harrison, M. (Support team) & Machado, M. (PI)
1/10/21 → 30/09/26
Project: Research
Sun, M. (Recipient), Devlin, S. (Recipient), Beck, J. (Recipient), Hofmann, K. (Recipient) & Whiteson, S. (Recipient), 1 Jun 2023
Prize: Prize (including medals and awards)