Följ
Andrew Patterson
Andrew Patterson
Verifierad e-postadress på ualberta.ca - Startsida
Titel
Citeras av
Citeras av
År
Supervised autoencoders: Improving generalization performance with unsupervised regularizers
L Le, A Patterson, M White
Advances in neural information processing systems 31, 2018
2602018
The open diffusion data derivatives, brain data upcycling via integrated publishing of derivatives and reproducible open cloud services
P Avesani, B McPherson, S Hayashi, CF Caiafa, R Henschel, ...
Scientific data 6 (1), 69, 2019
912019
Organizing experience: a deeper look at replay mechanisms for sample-based planning in continuous state domains
Y Pan, M Zaheer, A White, A Patterson, M White
International Joint Conference on Artificial Intelligence, 2018
532018
Gradient Temporal-Difference Learning with Regularized Corrections
S Ghiassian, A Patterson, S Garg, D Gupta, A White, M White
International Conference on Machine Learning, 2020
432020
General value function networks
M Schlegel, A Jacobsen, M Zaheer, A Patterson, A White, M White
arXiv preprint arXiv:1807.06763, 2018
392018
Online off-policy prediction
S Ghiassian, A Patterson, M White, RS Sutton, A White
arXiv preprint arXiv:1811.02597, 2018
302018
A Generalized Projected Bellman Error for Off-policy Value Estimation in Reinforcement Learning
A Patterson, A White, M White
Journal of Machine Learning Research 23 (1), 2022
162022
Empirical design in reinforcement learning
A Patterson, S Neumann, M White, A White
arXiv preprint arXiv:2304.01315, 2023
9*2023
Robust losses for learning value functions
A Patterson, V Liao, M White
IEEE transactions on pattern analysis and machine intelligence 45 (5), 6157-6167, 2022
82022
Learning macroscopic brain connectomes via group-sparse factorization
F Aminmansour, A Patterson, L Le, Y Peng, D Mitchell, F Pestilli, ...
Advances in Neural Information Processing Systems 32, 2019
52019
A temporal-difference approach to policy gradient estimation
S Tosatto, A Patterson, M White, R Mahmood
International Conference on Machine Learning, 21609-21632, 2022
22022
The cross-environment hyperparameter setting benchmark for reinforcement learning
A Patterson, S Neumann, A White, R Kumaraswamy, M White
12021
Discovery of Predictive Representations With a Network of General Value Functions
M Schlegel, A Patterson, A White, M White
12018
When is Offline Policy Selection Sample Efficient for Reinforcement Learning?
V Liu, P Nagarajan, A Patterson, M White
arXiv preprint arXiv:2312.02355, 2023
2023
A Gradient Critic for Policy Gradient Estimation
S Tosatto, A Patterson, M White, AR Mahmood
Sixteenth European Workshop on Reinforcement Learning, 2023
2023
When is Offline Hyperparameter Selection Feasible for Reinforcement Learning?
V Liu, P Nagarajan, A Patterson, M White
2022
When is Offline Policy Selection Feasible for Reinforcement Learning?
V Liu, P Nagarajan, A Patterson, M White
Systemet kan inte utföra åtgärden just nu. Försök igen senare.
Artiklar 1–17