Michael Littman
Title
Cited by
Cited by
Year
Reinforcement learning: A survey
LP Kaelbling, ML Littman, AW Moore
Journal of artificial intelligence research 4, 237-285, 1996
80421996
Planning and acting in partially observable stochastic domains
LP Kaelbling, ML Littman, AR Cassandra
Artificial intelligence 101 (1-2), 99-134, 1998
40761998
Markov games as a framework for multi-agent reinforcement learning
ML Littman
Machine learning proceedings 1994, 157-163, 1994
23031994
Measuring praise and criticism: Inference of semantic orientation from association
PD Turney, ML Littman
ACM Transactions on Information Systems (TOIS) 21 (4), 315-346, 2003
20462003
Activity recognition from accelerometer data
N Ravi, N Dandekar, P Mysore, ML Littman
Aaai 5 (2005), 1541-1546, 2005
18762005
Packet routing in dynamically changing networks: A reinforcement learning approach
J Boyan, M Littman
Advances in neural information processing systems 6, 671-678, 1993
8761993
Acting optimally in partially observable stochastic domains
AR Cassandra, LP Kaelbling, ML Littman
Aaai 94, 1023-1028, 1994
8181994
Learning policies for partially observable environments: Scaling up
ML Littman, AR Cassandra, LP Kaelbling
Machine Learning Proceedings 1995, 362-370, 1995
7991995
Convergence results for single-step on-policy reinforcement-learning algorithms
S Singh, T Jaakkola, ML Littman, C Szepesvári
Machine learning 38 (3), 287-308, 2000
7312000
Graphical models for game theory
M Kearns, ML Littman, S Singh
arXiv preprint arXiv:1301.2281, 2013
7032013
Interactions between learning and evolution
D Ackley, M Littman
Artificial life II 10, 487-509, 1991
6761991
On the complexity of solving Markov decision problems
ML Littman, TL Dean, LP Kaelbling
arXiv preprint arXiv:1302.4971, 2013
6072013
Incremental pruning: A simple, fast, exact method for partially observable Markov decision processes
AR Cassandra, ML Littman, NL Zhang
arXiv preprint arXiv:1302.1525, 2013
5832013
Friend-or-foe Q-learning in general-sum games
ML Littman
ICML 1, 322-328, 2001
5552001
Predictive representations of state
M Littman, RS Sutton
Advances in neural information processing systems 14, 1555-1561, 2001
5362001
Computerized cross-language document retrieval using latent semantic indexing
TK Landauer, ML Littman
US Patent 5,301,109, 1994
5041994
Algorithms for sequential decision making
ML Littman
Brown University, 1996
4971996
Unsupervised learning of semantic orientation from a hundred-billion-word corpus
PD Turney, ML Littman
arXiv preprint cs/0212012, 2002
4102002
PAC model-free reinforcement learning
AL Strehl, L Li, E Wiewiora, J Langford, ML Littman
Proceedings of the 23rd international conference on Machine learning, 881-888, 2006
4082006
Value-function reinforcement learning in Markov games
ML Littman
Cognitive systems research 2 (1), 55-66, 2001
4032001
The system can't perform the operation now. Try again later.
Articles 1–20