Joel Z Leibo
Joel Z Leibo
Research scientist
Verifierad e-postadress på google.com - Startsida
TitelCiteras avÅr
Reinforcement learning with unsupervised auxiliary tasks
M Jaderberg, V Mnih, WM Czarnecki, T Schaul, JZ Leibo, D Silver, ...
arXiv preprint arXiv:1611.05397, 2016
4642016
Deep q-learning from demonstrations
T Hester, M Vecerik, O Pietquin, M Lanctot, T Schaul, B Piot, D Horgan, ...
Thirty-Second AAAI Conference on Artificial Intelligence, 2018
2762018
Multi-agent reinforcement learning in sequential social dilemmas
JZ Leibo, V Zambaldi, M Lanctot, J Marecki, T Graepel
Proceedings of the 16th Conference on Autonomous Agents and MultiAgent …, 2017
2032017
Deepmind lab
C Beattie, JZ Leibo, D Teplyashin, T Ward, M Wainwright, H Küttler, ...
arXiv preprint arXiv:1612.03801, 2016
1832016
The dynamics of invariant object recognition in the human visual system
L Isik, EM Meyers, JZ Leibo, T Poggio
Journal of neurophysiology 111 (1), 91-102, 2013
1522013
Human-level performance in 3D multiplayer games with population-based reinforcement learning
M Jaderberg, WM Czarnecki, I Dunning, L Marris, G Lever, AG Castaneda, ...
Science 364 (6443), 859-865, 2019
129*2019
Model-free episodic control
C Blundell, B Uria, A Pritzel, Y Li, A Ruderman, JZ Leibo, J Rae, ...
arXiv preprint arXiv:1606.04460, 2016
123*2016
Value-decomposition networks for cooperative multi-agent learning
P Sunehag, G Lever, A Gruslys, WM Czarnecki, V Zambaldi, M Jaderberg, ...
arXiv preprint arXiv:1706.05296, 2017
104*2017
Prefrontal cortex as a meta-reinforcement learning system
JX Wang, Z Kurth-Nelson, D Kumaran, D Tirumala, H Soyer, JZ Leibo, ...
Nature neuroscience 21 (6), 860, 2018
922018
Using fast weights to attend to the recent past
J Ba, GE Hinton, V Mnih, JZ Leibo, C Ionescu
Advances in Neural Information Processing Systems, 4331-4339, 2016
802016
Unsupervised learning of invariant representations in hierarchical architectures
F Anselmi, JZ Leibo, L Rosasco, J Mutch, A Tacchetti, T Poggio
arXiv preprint arXiv:1311.4158, 2013
672013
Unsupervised predictive memory in a goal-directed agent
G Wayne, CC Hung, D Amos, M Mirza, A Ahuja, A Grabska-Barwinska, ...
arXiv preprint arXiv:1803.10760, 2018
602018
How important is weight symmetry in backpropagation?
Q Liao, JZ Leibo, T Poggio
Thirtieth AAAI Conference on Artificial Intelligence, 2016
602016
A multi-agent reinforcement learning model of common-pool resource appropriation
J Perolat, JZ Leibo, V Zambaldi, C Beattie, K Tuyls, T Graepel
Advances in Neural Information Processing Systems, 3643-3652, 2017
562017
Learning invariant representations and applications to face verification
Q Liao, JZ Leibo, T Poggio
Advances in neural information processing systems, 3057-3065, 2013
562013
Unsupervised learning of invariant representations
F Anselmi, JZ Leibo, L Rosasco, J Mutch, A Tacchetti, T Poggio
Theoretical Computer Science 633, 112-121, 2016
542016
Learning and disrupting invariance in visual recognition with a temporal association rule
L Isik, JZ Leibo, T Poggio
Frontiers in computational neuroscience 6, 37, 2012
432012
Why the brain separates face recognition from object recognition
JZ Leibo, J Mutch, T Poggio
Advances in neural information processing systems, 711-719, 2011
402011
The computational magic of the ventral stream: sketch of a theory (and why some deep architectures work).
T Poggio, J Mutch, J Leibo, L Rosasco, A Tacchetti
352012
Emergent communication through negotiation
K Cao, A Lazaridou, M Lanctot, JZ Leibo, K Tuyls, S Clark
arXiv preprint arXiv:1804.03980, 2018
332018
Systemet kan inte utföra åtgärden just nu. Försök igen senare.
Artiklar 1–20