Follow
Ronald Ortner
Ronald Ortner
Verified email at unileoben.ac.at - Homepage
Title
Cited by
Cited by
Year
Near-optimal regret bounds for reinforcement learning
P Auer, T Jaksch, R Ortner
Advances in neural information processing systems 21, 2008
16172008
UCB revisited: Improved regret bounds for the stochastic multi-armed bandit problem
P Auer, R Ortner
Periodica Mathematica Hungarica 61 (1-2), 55-65, 2010
3832010
Logarithmic online regret bounds for undiscounted reinforcement learning
P Auer, R Ortner
Advances in neural information processing systems 19, 2006
3112006
Improved rates for the stochastic continuum-armed bandit problem
P Auer, R Ortner, C Szepesvári
International Conference on Computational Learning Theory, 454-468, 2007
2722007
Adaptively tracking the best bandit arm with an unknown number of distribution changes
P Auer, P Gajane, R Ortner
Conference on Learning Theory, 138-158, 2019
166*2019
Efficient bias-span-constrained exploration-exploitation in reinforcement learning
R Fruit, M Pirotta, A Lazaric, R Ortner
International Conference on Machine Learning, 1578-1586, 2018
1232018
A boosting approach to multiple instance learning
P Auer, R Ortner
European conference on machine learning, 63-74, 2004
1112004
Online regret bounds for undiscounted continuous reinforcement learning
R Ortner, D Ryabko
Advances in Neural Information Processing Systems 25, 2012
942012
Regret bounds for restless markov bandits
R Ortner, D Ryabko, P Auer, R Munos
International conference on algorithmic learning theory, 214-228, 2012
932012
Variational regret bounds for reinforcement learning
R Ortner, P Gajane, P Auer
Uncertainty in Artificial Intelligence, 81-90, 2020
712020
Regret bounds for restless Markov bandits
R Ortner, D Ryabko, P Auer, R Munos
Theoretical Computer Science 558, 62-76, 2014
592014
PAC-Bayesian analysis of contextual bandits
Y Seldin, P Auer, J Shawe-taylor, R Ortner, F Laviolette
Advances in neural information processing systems 24, 2011
582011
Regret bounds for reinforcement learning via markov chain concentration
R Ortner
Journal of Artificial Intelligence Research 67, 115-128, 2020
542020
Pareto front identification from stochastic bandit feedback
P Auer, CK Chiang, R Ortner, M Drugan
Artificial intelligence and statistics, 939-947, 2016
542016
A sliding-window algorithm for markov decision processes with arbitrarily changing rewards and transitions
P Gajane, R Ortner, P Auer
arXiv preprint arXiv:1805.10066, 2018
532018
Improved learning complexity in combinatorial pure exploration bandits
V Gabillon, A Lazaric, M Ghavamzadeh, R Ortner, P Bartlett
Artificial Intelligence and Statistics, 1004-1012, 2016
472016
Non-backtracking random walks and cogrowth of graphs
R Ortner, W Woess
Canadian Journal of Mathematics 59 (4), 828-844, 2007
462007
Improved regret bounds for undiscounted continuous reinforcement learning
K Lakshmanan, R Ortner, D Ryabko
International conference on machine learning, 524-532, 2015
452015
Pseudometrics for state aggregation in average reward Markov decision processes
R Ortner
Algorithmic Learning Theory: 18th International Conference, ALT 2007, Sendai …, 2007
412007
Adaptive aggregation for reinforcement learning in average reward Markov decision processes
R Ortner
Annals of Operations Research 208, 321-336, 2013
372013
The system can't perform the operation now. Try again later.
Articles 1–20