Roshan Shariff
Roshan Shariff
Verifierad e-postadress på ualberta.ca - Startsida
Titel
Citeras av
Citeras av
År
Conservative bandits
Y Wu, R Shariff, T Lattimore, C Szepesvári
International Conference on Machine Learning, 1254-1262, 2016
372016
Differentially private contextual linear bandits
R Shariff, O Sheffet
Advances in Neural Information Processing Systems, 4296-4306, 2018
182018
Discounted reinforcement learning is not an optimization problem
A Naik, R Shariff, N Yasui, RS Sutton
arXiv preprint arXiv:1910.02140, 2019
32019
Exploiting symmetries to construct efficient MCMC algorithms with an application to SLAM
R Shariff, A György, C Szepesvári
Artificial Intelligence and Statistics, 866-874, 2015
22015
Lunar Lander: A Continous-Action Case Study for Policy-Gradient Actor-Critic Algorithms
R Shariff, T Dick
RLDM, 2013
12013
Systemet kan inte utföra åtgärden just nu. Försök igen senare.
Artiklar 1–5