Andrew Barto

Citeras av

	Alla	Sedan 2019
Citat	122684	59363
h-index	92	58
i10-index	214	138

13000

6500

3250

9750

19901991199219931994199519961997199819992000200120022003200420052006200720082009201020112012201320142015201620172018201920202021202220232024398 481 513 502 606 591 573 759 799 873 997 1291 1321 1535 1816 2251 2356 2771 2946 2950 3174 3281 3394 3682 3706 3418 3842 4397 5971 7897 9547 11047 11774 12538 6551

Offentlig åtkomst

Visa alla

11 artiklar

0 artiklar

tillgänglig

inte tillgänglig

Enligt krav från finansiärer

Medförfattare

Richard S. SuttonKeen, Amii, and University of AlbertaVerifierad e-postadress på richsutton.com
George KonidarisBrownVerifierad e-postadress på cs.brown.edu
Charles Andersonprofessor of computer science, Colorado State University. Founder Pattern Exploration, LLC.Verifierad e-postadress på colostate.edu
Roderic GrupenUniversity of MassachusettsVerifierad e-postadress på cs.umass.edu
Scott KuindersmaSenior Director of Robotics Research at Boston DynamicsVerifierad e-postadress på seas.harvard.edu
Andrew FaggUniversity of OklahomaVerifierad e-postadress på cs.ou.edu
Scott NiekumAssociate Professor, University of Massachusetts AmherstVerifierad e-postadress på cs.umass.edu
Özgür ŞimşekProfessor of Artificial Intelligence, University of BathVerifierad e-postadress på bath.ac.uk
Steven BradtkeResearch Engineer, Vistronix, Inc.Verifierad e-postadress på asrcfederal.com
Sridhar MahadevanDirector, Data Science Lab, Adobe Research & Professor, University of Massachusetts, AmherstVerifierad e-postadress på cs.umass.edu
Richard L. LewisProfessor of Psychology, Linguistics and Cognitive Science, University of MichiganVerifierad e-postadress på umich.edu
Theodore J. PerkinsOttawa Hospital Research InstituteVerifierad e-postadress på ohri.ca
Gianluca BaldassarreSenior Researcher, Institute of Cognitive Sciences and Tecnologies, Italian National ResearchVerifierad e-postadress på istc.cnr.it
Amy McGovernUniversity of OklahomaVerifierad e-postadress på ou.edu
Balaraman RavindranProfessor of Computer Science, Indian Institute of Technology MadrasVerifierad e-postadress på cse.iitm.ac.in
Sarah OsentoskiVinci4dVerifierad e-postadress på vinci4d.ai
Marco MirolliResearcher, Istituto di Scienze e Tecnologie della Cognizione, CNR, ItalyVerifierad e-postadress på istc.cnr.it
Nuttapong ChentanezNVIDIA, Chulalongkorn UniversityVerifierad e-postadress på eecs.berkeley.edu
Alicia WolfePostdoc, Wesleyan UniversityVerifierad e-postadress på wesleyan.edu
Matthew BotvinickGoogle DeepMind, Yale Law School, University College LondonVerifierad e-postadress på google.com

Följ

Andrew Barto

University of Massachusetts Amherst

Verifierad e-postadress på cs.umass.edu - Startsida

Reinforcement learning


Titel Sortera efter citat Sortera efter år Sortera efter titel	Citeras av Citeras av	År
Reinforcement learning: An introduction RS Sutton, AG Barto MIT press, 2018	72756	2018
Introduction to reinforcement learning D Ernst, A Louette Feuerriegel, S., Hartmann, J., Janiesch, C., and Zschech, P.(2024 …, 2024	5848	2024
Neuronlike adaptive elements that can solve difficult learning control problems AG Barto, RS Sutton, CW Anderson IEEE transactions on systems, man, and cybernetics, 834-846, 1983	5070	1983
Toward a modern theory of adaptive networks: expectation and prediction. RS Sutton, AG Barto Psychological review 88 (2), 135, 1981	1826	1981
Recent advances in hierarchical reinforcement learning AG Barto, S Mahadevan Discrete event dynamic systems 13, 341-379, 2003	1743	2003
Learning to act using real-time dynamic programming AG Barto, SJ Bradtke, SP Singh Artificial intelligence 72 (1-2), 81-138, 1995	1656	1995
Introduction to reinforcement learning. Vol. 135 RS Sutton, AG Barto MIT press Cambridge 5, 21-22, 1998	1127	1998
Intrinsically motivated reinforcement learning N Chentanez, A Barto, S Singh Advances in neural information processing systems 17, 2004	1030	2004
Linear least-squares algorithms for temporal difference learning SJ Bradtke, AG Barto Machine learning 22 (1), 33-57, 1996	1005	1996
Handbook of learning and approximate dynamic programming J Si, AG Barto, WB Powell, D Wunsch John Wiley & Sons, 2004	974	2004
Improving elevator performance using reinforcement learning R Crites, A Barto Advances in neural information processing systems 8, 1995	899	1995
A model of how the basal ganglia generate and use neural signals that predict reinforcement JC Houk, JL Adams, AG Barto	884	1994
Reinforcement learning is direct adaptive optimal control RS Sutton, AG Barto, RJ Williams IEEE control systems magazine 12 (2), 19-22, 1992	816	1992
Task decomposition through competition in a modular connectionist architecture: The what and where vision tasks RA Jacobs, MI Jordan, AG Barto Cognitive science 15 (2), 219-250, 1991	802	1991
Time-derivative models of Pavlovian reinforcement. RS Sutton, AG Barto The MIT Press, 1990	794	1990
Reinforcement Learning: An Introduction. By Richard’s Sutton AG Barto SIAM Rev 6 (2), 423, 2021	752	2021
Adaptive critics and the basal ganglia AG Barto	728	1994
Reinforcement learning: an introduction MIT Press RS Sutton, AG Barto Cambridge, MA 22447, 10, 1998	663	1998
Learning and sequential decision making AG Barto, RS Sutton, C Watkins University of Massachusetts, 1989	662	1989
Automatic discovery of subgoals in reinforcement learning using diverse density A McGovern, AG Barto	646	2001

Systemet kan inte utföra åtgärden just nu. Försök igen senare.

Artiklar 1–20

Citat per år

Dubblettcitat

Sammanfogade citat

Lägg till medförfattareMedförfattare

Följ

Citeras av

Medförfattare