Martha White
Martha White
Verifierad e-postadress på ualberta.ca - Startsida
Titel
Citeras av
Citeras av
År
Off-Policy Actor-Critic
T Degris, M White, RS Sutton
Twenty-Ninth International Conference on Machine Learning, 2012
2372012
Convex multi-view subspace learning
M White, X Zhang, D Schuurmans, Y Yu
Advances in neural information processing systems, 1673-1681, 2012
1302012
An emphatic approach to the problem of off-policy temporal-difference learning
RS Sutton, AR Mahmood, M White
The Journal of Machine Learning Research 17 (1), 2603-2631, 2016
1122016
Estimating the class prior and posterior from noisy positives and unlabeled data
S Jain, M White, P Radivojac
Advances in neural information processing systems, 2693-2701, 2016
592016
Supervised autoencoders: Improving generalization performance with unsupervised regularizers
L Le, A Patterson, M White
Advances in Neural Information Processing Systems, 107-117, 2018
402018
Unifying task specification in reinforcement learning
M White
International Conference on Machine Learning, 2016
362016
Relaxed clipping: A global training method for robust regression and classification
Y Yu, M Yang, L Xu, M White, D Schuurmans
Advances in Neural Information Processing Systems 23, 2011
362011
Nonparametric semi-supervised learning of class proportions
S Jain, M White, MW Trosset, P Radivojac
arXiv preprint arXiv:1601.01944, 2016
352016
Convex Sparse Coding, Subspace Learning, and Semi-Supervised Extensions.
X Zhang, Y Yu, M White, R Huang, D Schuurmans
Proceedings of the AAAI Conference on Artificial Intelligence, 2011
342011
Meta-learning representations for continual learning
K Javed, M White
Advances in Neural Information Processing Systems, 1820-1830, 2019
332019
Recovering true classifier performance in positive-unlabeled learning
S Jain, M White, P Radivojac
arXiv preprint arXiv:1702.00518, 2017
282017
Optimal reverse prediction: a unified perspective on supervised, unsupervised and semi-supervised learning
L Xu, M White, D Schuurmans
Proceedings of the 26th International Conference on Machine Learning, 1137-1144, 2009
262009
A greedy approach to adapting the trace parameter for temporal difference learning
M White, A White
International Conference on Autonomous Agents & Multiagent Systems, 557-565, 2016
222016
Interval Estimation for Reinforcement-Learning Algorithms in Continuous-State Domains
M White, A White
Advances in Neural Information Processing Systems, 2433–2441, 2010
222010
Investigating practical, linear temporal difference learning
A White, M White
Autonomous Agents and Multiagent Sytems, 2016
212016
An off-policy policy gradient theorem using emphatic weightings
E Imani, E Graves, M White
Advances in Neural Information Processing Systems, 96-106, 2018
202018
Emphatic temporal-difference learning
AR Mahmood, H Yu, M White, RS Sutton
European Workshop on Reinforcement Learning, 2015
202015
Learning a Value Analysis Tool for Agent Evaluation.
M White, MH Bowling
International Joint Conference on Artificial Intelligence, 1976-1981, 2009
182009
Organizing experience: a deeper look at replay mechanisms for sample-based planning in continuous state domains
Y Pan, M Zaheer, A White, A Patterson, M White
International Joint Conference on Artificial Intelligence, 2018
172018
Partition tree weighting
J Veness, M White, M Bowling, A György
2013 Data Compression Conference, 321-330, 2013
172013
Systemet kan inte utföra åtgärden just nu. Försök igen senare.
Artiklar 1–20