Zheng Wen
Zheng Wen
DeepMind
Verifierad e-postadress på google.com - Startsida
Titel
Citeras av
Citeras av
År
A Tutorial on Thompson Sampling
D Russo, B Van Roy, A Kazerouni, I Osband, Z Wen
arXiv, https://arxiv.org/pdf/1707.02038.pdf, 0
185*
Generalization and exploration via randomized value functions
I Osband, B Van Roy, Z Wen
arXiv preprint arXiv:1402.0635, 2014
1352014
Tight Regret Bounds for Stochastic Combinatorial Semi-Bandits
B Kveton, Z Wen, A Ashkan, C Szepesvari
International Conference on Artificial Intelligence and Statistics (AISTATS …, 2014
1302014
Cascading bandits: Learning to rank in the cascade model
B Kveton, C Szepesvári, Z Wen, A Ashkan
ICML, 2015
1272015
Optimal demand response using device based reinforcement learning
Z Wen, D O'Neill, HR Maei
IEEE Transactions on Smart Grid, 2014
1162014
Deep exploration via randomized value functions
I Osband, D Russo, Z Wen, B Van Roy
Journal of Machine Learning Research, 2017
702017
Matroid bandits: Fast combinatorial optimization with learning
B Kveton, Z Wen, A Ashkan, H Eydgahi, B Eriksson
UAI 2014, 2014
662014
Efficient learning in large-scale combinatorial semi-bandits
Z Wen, B Kveton, A Ashkan
http://jmlr.org/proceedings/papers/v37/wen15.html, 2014
512014
Combinatorial cascading bandits
B Kveton, Z Wen, A Ashkan, C Szepesvari
Advances in Neural Information Processing Systems, 1450-1458, 2015
502015
Online influence maximization under independent cascade model with semi-bandit feedback
Z Wen, B Kveton, M Valko, S Vaswani
Advances in neural information processing systems, 3022-3032, 2017
47*2017
DCM Bandits: Learning to Rank with Multiple Clicks
S Katariya, B Kveton, C Szepesvári, Z Wen
arXiv, 2016
472016
Optimal Greedy Diversity for Recommendation
A Ashkan, B Kveton, S Berkovsky, Z Wen
462015
Cascading bandits for large-scale recommendation problems
S Zong, H Ni, K Sung, NR Ke, Z Wen, B Kveton
arXiv preprint arXiv:1603.05359, 2016
422016
Online learning to rank in stochastic click models
M Zoghi, T Tunys, M Ghavamzadeh, B Kveton, C Szepesvari, Z Wen
Proceedings of the 34th International Conference on Machine Learning-Volume …, 2017
382017
Adaptive submodular maximization in bandit setting
V Gabillon, B Kveton, Z Wen, B Eriksson, S Muthukrishnan
Advances in Neural Information Processing Systems, 2697-2705, 2013
332013
Recommendation system based on collaborative filtering
Z Wen
CS229 Lecture Notes, 2008
332008
Stochastic rank-1 bandits
S Katariya, B Kveton, C Szepesvari, C Vernade, Z Wen
arXiv preprint arXiv:1608.03023, 2016
272016
Efficient Exploration and Value Function Generalization in Deterministic Systems
Z Wen, B Van Roy
Advances in Neural Information Processing Systems, 3021--3029, 2013
242013
Diffusion independent semi-bandit influence maximization
S Vaswani, B Kveton, Z Wen, M Ghavamzadeh, L Lakshmanan, ...
Proceedings of the 34th International Conference on Machine Learning (ICML), 2017
22*2017
On the disturbance response and external stability of a saturating static-feedback-controlled double integrator
Z Wen, S Roy, A Saberi
Automatica 44 (8), 2191-2196, 2008
212008
Systemet kan inte utföra åtgärden just nu. Försök igen senare.
Artiklar 1–20