Aviv Tamar
Aviv Tamar
Verifierad e-postadress på technion.ac.il - Startsida
TitelCiteras avÅr
Multi-agent actor-critic for mixed cooperative-competitive environments
R Lowe, Y Wu, A Tamar, J Harb, OAIP Abbeel, I Mordatch
Advances in Neural Information Processing Systems, 6379-6390, 2017
4342017
Value iteration networks
A Tamar, Y Wu, G Thomas, S Levine, P Abbeel
Advances in Neural Information Processing Systems, 2154-2162, 2016
2742016
Bayesian reinforcement learning: A survey
M Ghavamzadeh, S Mannor, J Pineau, A Tamar
Foundations and Trends® in Machine Learning 8 (5-6), 359-483, 2015
1402015
Constrained policy optimization
J Achiam, D Held, A Tamar, P Abbeel
Proceedings of the 34th International Conference on Machine Learning-Volume …, 2017
1342017
Policy gradients with variance related risk criteria
D Di Castro, A Tamar, S Mannor
arXiv preprint arXiv:1206.6404, 2012
752012
Policy gradients with variance related risk criteria
D Di Castro, A Tamar, S Mannor
arXiv preprint arXiv:1206.6404, 2012
752012
Risk-sensitive and robust decision-making: a cvar optimization approach
Y Chow, A Tamar, S Mannor, M Pavone
Advances in Neural Information Processing Systems, 1522-1530, 2015
692015
Model-ensemble trust-region policy optimization
T Kurutach, I Clavera, Y Duan, A Tamar, P Abbeel
arXiv preprint arXiv:1802.10592, 2018
512018
Optimizing the CVaR via sampling
A Tamar, Y Glassner, S Mannor
Twenty-Ninth AAAI Conference on Artificial Intelligence, 2015
452015
Learning to route
A Valadarsky, M Schapira, D Shahaf, A Tamar
Proceedings of the 16th ACM Workshop on Hot Topics in Networks, 185-191, 2017
392017
Learning plannable representations with causal infogan
T Kurutach, A Tamar, G Yang, SJ Russell, P Abbeel
Advances in Neural Information Processing Systems, 8733-8744, 2018
322018
Scaling up robust MDPs using function approximation
A Tamar, S Mannor, H Xu
International Conference on Machine Learning, 181-189, 2014
292014
Temporal difference methods for the variance of the reward to go
A Tamar, D Di Castro, S Mannor
International Conference on Machine Learning, 495-503, 2013
262013
Learning generalized reactive policies using deep neural networks
E Groshev, A Tamar, M Goldstein, S Srivastava, P Abbeel
2018 AAAI Spring Symposium Series, 2018
242018
Learning robotic assembly from CAD
G Thomas, M Chien, A Tamar, JA Ojea, P Abbeel
2018 IEEE International Conference on Robotics and Automation (ICRA), 1-9, 2018
222018
Generalized emphatic temporal difference learning: Bias-variance analysis
A Hallak, A Tamar, R Munos, S Mannor
Thirtieth AAAI Conference on Artificial Intelligence, 2016
212016
Learning the variance of the reward-to-go
A Tamar, D Di Castro, S Mannor
The Journal of Machine Learning Research 17 (1), 361-396, 2016
212016
Learning from the hindsight plan—episodic mpc improvement
A Tamar, G Thomas, T Zhang, S Levine, P Abbeel
2017 IEEE International Conference on Robotics and Automation (ICRA), 336-343, 2017
192017
Policy gradient for coherent risk measures
A Tamar, Y Chow, M Ghavamzadeh, S Mannor
Advances in Neural Information Processing Systems, 1468-1476, 2015
192015
Variance adjusted actor critic algorithms
A Tamar, S Mannor
arXiv preprint arXiv:1310.3697, 2013
162013
Systemet kan inte utföra åtgärden just nu. Försök igen senare.
Artiklar 1–20