Miljan Martic
Miljan Martic
DeepMind
Verified email at google.com
Title
Cited by
Cited by
Year
Deep reinforcement learning from human preferences
P Christiano, J Leike, TB Brown, M Martic, S Legg, D Amodei
arXiv preprint arXiv:1706.03741, 2017
4542017
AI safety gridworlds
J Leike, M Martic, V Krakovna, PA Ortega, T Everitt, A Lefrancq, L Orseau, ...
arXiv preprint arXiv:1711.09883, 2017
1822017
Scalable agent alignment via reward modeling: a research direction
J Leike, D Krueger, T Everitt, M Martic, V Maini, S Legg
arXiv preprint arXiv:1811.07871, 2018
672018
Penalizing side effects using stepwise relative reachability
V Krakovna, L Orseau, R Kumar, M Martic, S Legg
arXiv preprint arXiv:1806.01186, 2018
242018
Measuring and avoiding side effects using relative reachability
V Krakovna, L Orseau, M Martic, S Legg
arXiv preprint arXiv:1806.01186, 2018
142018
Deep reinforcement learning from human preferences, 2017
P Christiano, J Leike, TB Brown, M Martic, S Legg, D Amodei
arXiv preprint arXiv:1706.03741, 0
7
Avoiding side effects by considering future tasks
V Krakovna, L Orseau, R Ngo, M Martic, S Legg
arXiv preprint arXiv:2010.07877, 2020
52020
Algorithms for causal reasoning in probability trees
T Genewein, T McGrath, G Delétang, V Mikulik, M Martic, S Legg, ...
arXiv preprint arXiv:2010.12237, 2020
32020
Meta-trained agents implement Bayes-optimal agents
V Mikulik, G Delétang, T McGrath, T Genewein, M Martic, S Legg, ...
arXiv preprint arXiv:2010.11223, 2020
32020
Scaling shared model governance via model splitting
M Martic, J Leike, A Trask, M Hessel, S Legg, P Kohli
arXiv preprint arXiv:1812.05979, 2018
22018
Causal Analysis of Agent Behavior for AI Safety
G Déletang, J Grau-Moya, M Martic, T Genewein, T McGrath, V Mikulik, ...
arXiv preprint arXiv:2103.03938, 2021
2021
The system can't perform the operation now. Try again later.
Articles 1–11