Follow
Han Wang
Han Wang
Verified email at ualberta.ca
Title
Cited by
Cited by
Year
The in-sample softmax for offline reinforcement learning
C Xiao, H Wang, Y Pan, A White, M White
arXiv preprint arXiv:2302.14372, 2023
272023
Investigating the properties of neural network representations in reinforcement learning
H Wang, E Miahi, M White, MC Machado, Z Abbas, R Kumaraswamy, ...
Artificial Intelligence 330, 104100, 2024
212024
No more pesky hyperparameters: Offline hyperparameter tuning for RL
H Wang, A Sakhadeo, A White, J Bell, V Liu, X Zhao, P Liu, T Kozuno, ...
arXiv preprint arXiv:2205.08716, 2022
72022
Measuring and mitigating interference in reinforcement learning
V Liu, H Wang, RY Tao, K Javed, A White, M White
Conference on Lifelong Learning Agents, 781-795, 2023
32023
Replay memory as an empirical MDP: Combining conservative estimation with experience replay
H Zhang, C Xiao, H Wang, J Jin, M Müller
The Eleventh International Conference on Learning Representations, 2023
22023
A Simple Mixture Policy Parameterization for Improving Sample Efficiency of CVaR Optimization
Y Luo, Y Pan, H Wang, P Torr, P Poupart
arXiv preprint arXiv:2403.11062, 2024
12024
Offline Reinforcement Learning via Tsallis Regularization
L Zhu, MK Schlegel, H Wang, M White
Transactions on Machine Learning Research, 0
The system can't perform the operation now. Try again later.
Articles 1–7