Joel Z Leibo

Citeras av

	Alla	Sedan 2019
Citat	12899	10953
h-index	40	35
i10-index	64	54

2800

1400

700

2100

20132014201520162017201820192020202120222023202468 84 91 153 435 901 1280 1753 2133 2289 2731 751

Offentlig åtkomst

Visa alla

10 artiklar

2 artiklar

tillgänglig

inte tillgänglig

Enligt krav från finansiärer

Medförfattare

Thore GraepelGlobal Lead Computational Science, AI & ML at Altos Labs and Chair of Machine Learning, UCLVerifierad e-postadress på ucl.ac.uk
TOMASO POGGIOMcDermott Professor in Brain Sciences, MITVerifierad e-postadress på ai.mit.edu
Edward HughesStaff Research Engineer, DeepMindVerifierad e-postadress på google.com
Marc LanctotResearch Scientist, Google DeepMindVerifierad e-postadress på google.com
Edgar A. Duéñez-GuzmánGoogle DeepMindVerifierad e-postadress på oeb.harvard.edu
Karl TuylsResearch Scientist, Google DeepMind and Professor of computer science, University of LiverpoolVerifierad e-postadress på google.com
Wojciech Marian Czarnecki.Verifierad e-postadress på google.com
Matthew BotvinickGoogle DeepMind, Yale Law School, University College LondonVerifierad e-postadress på google.com
Charlie BeattieSoftware Engineer, DeepMindVerifierad e-postadress på google.com
Peter SunehagGoogle - DeepMindVerifierad e-postadress på google.com
Tom SchaulSenior Staff Scientist, DeepMindVerifierad e-postadress på nyu.edu
Kevin R. McKeeStaff Research Scientist, Google DeepMindVerifierad e-postadress på deepmind.com
Audrūnas GruslysVerifierad e-postadress på gruslys.com
Raphael KösterGoogle DeepMindVerifierad e-postadress på google.com
Jane X. WangStaff Research Scientist, DeepMindVerifierad e-postadress på google.com
Max JaderbergChief AI Officer, Isomorphic LabsVerifierad e-postadress på robots.ox.ac.uk
Fabio AnselmiAssistant professor at University of Trieste, MIT affiliateVerifierad e-postadress på units.it
Vinicius ZambaldiGoogle DeepmindVerifierad e-postadress på google.com
Dharshan KumaranGoogle DeepMindVerifierad e-postadress på fil.ion.ucl.ac.uk
Lorenzo RosascoMaLGa Machine Learning Genoa Center - Università degli Studi di GenovaVerifierad e-postadress på unige.it

Följ

Joel Z Leibo

Research scientist

Verifierad e-postadress på google.com - Startsida

Cooperation in AI & Neuroscience Multi-Agent Reinforcement Learning Machine Learning


Titel Sortera efter citat Sortera efter år Sortera efter titel	Citeras av Citeras av	År
Value-decomposition networks for cooperative multi-agent learning P Sunehag, G Lever, A Gruslys, WM Czarnecki, V Zambaldi, M Jaderberg, ... arXiv preprint arXiv:1706.05296, 2017	1576	2017
Reinforcement learning with unsupervised auxiliary tasks M Jaderberg, V Mnih, WM Czarnecki, T Schaul, JZ Leibo, D Silver, ... arXiv preprint arXiv:1611.05397, 2016	1350	2016
Deep q-learning from demonstrations T Hester, M Vecerik, O Pietquin, M Lanctot, T Schaul, B Piot, D Horgan, ... Proceedings of the AAAI conference on artificial intelligence 32 (1), 2018	1325*	2018
Learning to reinforcement learn JX Wang, Z Kurth-Nelson, D Tirumala, H Soyer, JZ Leibo, R Munos, ... arXiv preprint arXiv:1611.05763, 2016	1024	2016
Human-level performance in 3D multiplayer games with population-based reinforcement learning M Jaderberg, WM Czarnecki, I Dunning, L Marris, G Lever, AG Castaneda, ... Science 364 (6443), 859-865, 2019	907	2019
Multi-agent reinforcement learning in sequential social dilemmas JZ Leibo, V Zambaldi, M Lanctot, J Marecki, T Graepel arXiv preprint arXiv:1702.03037, 2017	894	2017
Prefrontal cortex as a meta-reinforcement learning system JX Wang, Z Kurth-Nelson, D Kumaran, D Tirumala, H Soyer, JZ Leibo, ... Nature neuroscience 21 (6), 860-868, 2018	606	2018
Deepmind lab C Beattie, JZ Leibo, D Teplyashin, T Ward, M Wainwright, H Küttler, ... arXiv preprint arXiv:1612.03801, 2016	585	2016
Social influence as intrinsic motivation for multi-agent deep reinforcement learning N Jaques, A Lazaridou, E Hughes, C Gulcehre, P Ortega, DJ Strouse, ... International conference on machine learning, 3040-3049, 2019	498*	2019
Model-free episodic control C Blundell, B Uria, A Pritzel, Y Li, A Ruderman, JZ Leibo, J Rae, ... arXiv preprint arXiv:1606.04460, 2016	288*	2016
The dynamics of invariant object recognition in the human visual system L Isik, EM Meyers, JZ Leibo, T Poggio Journal of neurophysiology 111 (1), 91-102, 2014	272	2014
Using fast weights to attend to the recent past J Ba, GE Hinton, V Mnih, JZ Leibo, C Ionescu Advances in neural information processing systems 29, 2016	257	2016
Inequity aversion improves cooperation in intertemporal social dilemmas E Hughes, JZ Leibo, M Phillips, K Tuyls, E Dueñez-Guzman, ... Advances in neural information processing systems 31, 2018	233	2018
A multi-agent reinforcement learning model of common-pool resource appropriation J Perolat, JZ Leibo, V Zambaldi, C Beattie, K Tuyls, T Graepel Advances in neural information processing systems 30, 2017	209	2017
Unsupervised predictive memory in a goal-directed agent G Wayne, CC Hung, D Amos, M Mirza, A Ahuja, A Grabska-Barwinska, ... arXiv preprint arXiv:1803.10760, 2018	194	2018
Open problems in cooperative ai A Dafoe, E Hughes, Y Bachrach, T Collins, KR McKee, JZ Leibo, K Larson, ... arXiv preprint arXiv:2012.08630, 2020	184	2020
Emergent communication through negotiation K Cao, A Lazaridou, M Lanctot, JZ Leibo, K Tuyls, S Clark arXiv preprint arXiv:1804.03980, 2018	177	2018
How important is weight symmetry in backpropagation? Q Liao, J Leibo, T Poggio Proceedings of the AAAI Conference on Artificial Intelligence 30 (1), 2016	166	2016
Unsupervised learning of invariant representations F Anselmi, JZ Leibo, L Rosasco, J Mutch, A Tacchetti, T Poggio Theoretical Computer Science 633, 112-121, 2016	140	2016
Kickstarting deep reinforcement learning S Schmitt, JJ Hudson, A Zidek, S Osindero, C Doersch, WM Czarnecki, ... arXiv preprint arXiv:1803.03835, 2018	132	2018

Systemet kan inte utföra åtgärden just nu. Försök igen senare.

Artiklar 1–20

Citat per år

Dubblettcitat

Sammanfogade citat

Lägg till medförfattareMedförfattare

Följ

Citeras av

Medförfattare