Satinder Singh
Satinder Singh
Computer Science and Engineering, University of Michigan
Verified email at umich.edu - Homepage
Title
Cited by
Cited by
Year
Differential evolution: A survey of the state-of-the-art
S Das, PN Suganthan
IEEE transactions on evolutionary computation 15 (1), 4-31, 2010
37462010
Policy Gradient Methods for Reinforcement Learning with Function Approximation
R Sutton, D McAllester, S Singh, Y Mansour
Neural Information Processing Systems, 1999
32671999
Between MDPs and semi-MDPs: A framework for temporal abstraction in reinforcement learning
RS Sutton, D Precup, S Singh
Artificial intelligence 112 (1-2), 181-211, 1999
24431999
Learning to act using real-time dynamic programming
AG Barto, SJ Bradtke, SP Singh
Artificial intelligence 72 (1-2), 81-138, 1995
14231995
Near-optimal reinforcement learning in polynomial time
M Kearns, S Singh
Machine learning 49 (2-3), 209-232, 2002
9112002
Convergence of stochastic iterative dynamic programming algorithms
T Jaakkola, MI Jordan, SP Singh
Advances in neural information processing systems, 703-710, 1994
8621994
Reinforcement learning with replacing eligibility traces
SP Singh, RS Sutton
Machine learning 22 (1-3), 123-158, 1996
8061996
Convergence results for single-step on-policy reinforcement-learning algorithms
S Singh, T Jaakkola, ML Littman, C Szepesvári
Machine learning 38 (3), 287-308, 2000
6912000
Graphical models for game theory
M Kearns, ML Littman, S Singh
arXiv preprint arXiv:1301.2281, 2013
6872013
Intrinsically motivated reinforcement learning
N Chentanez, AG Barto, SP Singh
Advances in neural information processing systems, 1281-1288, 2005
6412005
Action-conditional video prediction using deep networks in atari games
J Oh, X Guo, H Lee, RL Lewis, S Singh
Advances in neural information processing systems, 2863-2871, 2015
5662015
Predictive representations of state
ML Littman, RS Sutton
Advances in neural information processing systems, 1555-1561, 2002
5212002
Learning without state-estimation in partially observable Markovian decision processes
SP Singh, T Jaakkola, MI Jordan
Machine Learning Proceedings 1994, 284-292, 1994
4631994
Reinforcement learning algorithm for partially observable Markov decision problems
T Jaakkola, SP Singh, MI Jordan
Advances in neural information processing systems, 345-352, 1995
4451995
Intrinsically motivated learning of hierarchical collections of skills
AG Barto, S Singh, N Chentanez
Proceedings of the 3rd International Conference on Development and Learning …, 2004
4402004
Transfer of learning by composing solutions of elemental sequential tasks
SP Singh
Machine Learning 8 (3-4), 323-339, 1992
4171992
Optimizing dialogue management with reinforcement learning: Experiments with the NJFun system
S Singh, D Litman, M Kearns, M Walker
Journal of Artificial Intelligence Research 16, 105-133, 2002
4102002
Eligibility traces for off-policy policy evaluation
D Precup
Computer Science Department Faculty Publication Series, 80, 2000
3962000
Reinforcement Learning with Soft State Aggregation
S Singh, T Jaakkola, M Jordan
Neural Information Processing Systems, 1995
3381995
Reinforcement learning for dynamic channel allocation in cellular telephone systems
SP Singh, DP Bertsekas
Advances in neural information processing systems, 974-980, 1997
3261997
The system can't perform the operation now. Try again later.
Articles 1–20