Följ
Andrew Barto
Andrew Barto
Verifierad e-postadress på cs.umass.edu - Startsida
Titel
Citeras av
Citeras av
År
Reinforcement learning: An introduction
RS Sutton, AG Barto
MIT press, 2018
727562018
Introduction to reinforcement learning
D Ernst, A Louette
Feuerriegel, S., Hartmann, J., Janiesch, C., and Zschech, P.(2024 …, 2024
58482024
Neuronlike adaptive elements that can solve difficult learning control problems
AG Barto, RS Sutton, CW Anderson
IEEE transactions on systems, man, and cybernetics, 834-846, 1983
50701983
Toward a modern theory of adaptive networks: expectation and prediction.
RS Sutton, AG Barto
Psychological review 88 (2), 135, 1981
18261981
Recent advances in hierarchical reinforcement learning
AG Barto, S Mahadevan
Discrete event dynamic systems 13, 341-379, 2003
17432003
Learning to act using real-time dynamic programming
AG Barto, SJ Bradtke, SP Singh
Artificial intelligence 72 (1-2), 81-138, 1995
16561995
Introduction to reinforcement learning. Vol. 135
RS Sutton, AG Barto
MIT press Cambridge 5, 21-22, 1998
11271998
Intrinsically motivated reinforcement learning
N Chentanez, A Barto, S Singh
Advances in neural information processing systems 17, 2004
10302004
Linear least-squares algorithms for temporal difference learning
SJ Bradtke, AG Barto
Machine learning 22 (1), 33-57, 1996
10051996
Handbook of learning and approximate dynamic programming
J Si, AG Barto, WB Powell, D Wunsch
John Wiley & Sons, 2004
9742004
Improving elevator performance using reinforcement learning
R Crites, A Barto
Advances in neural information processing systems 8, 1995
8991995
A model of how the basal ganglia generate and use neural signals that predict reinforcement
JC Houk, JL Adams, AG Barto
8841994
Reinforcement learning is direct adaptive optimal control
RS Sutton, AG Barto, RJ Williams
IEEE control systems magazine 12 (2), 19-22, 1992
8161992
Task decomposition through competition in a modular connectionist architecture: The what and where vision tasks
RA Jacobs, MI Jordan, AG Barto
Cognitive science 15 (2), 219-250, 1991
8021991
Time-derivative models of Pavlovian reinforcement.
RS Sutton, AG Barto
The MIT Press, 1990
7941990
Reinforcement Learning: An Introduction. By Richard’s Sutton
AG Barto
SIAM Rev 6 (2), 423, 2021
7522021
Adaptive critics and the basal ganglia
AG Barto
7281994
Reinforcement learning: an introduction MIT Press
RS Sutton, AG Barto
Cambridge, MA 22447, 10, 1998
6631998
Learning and sequential decision making
AG Barto, RS Sutton, C Watkins
University of Massachusetts, 1989
6621989
Automatic discovery of subgoals in reinforcement learning using diverse density
A McGovern, AG Barto
6462001
Systemet kan inte utföra åtgärden just nu. Försök igen senare.
Artiklar 1–20