Ishan Durugkar
Ishan Durugkar
Verifierad e-postadress på cs.utexas.edu - Startsida
Titel
Citeras av
Citeras av
År
Generative Multi-Adversarial Networks
I Durugkar, I Gemp, S Mahadevan
International Conference on Learning Representations, 2017, 2017
2732017
Go for a walk and arrive at the answer: Reasoning over paths in knowledge bases using reinforcement learning
R Das, S Dhuliawala, M Zaheer, L Vilnis, I Durugkar, A Krishnamurthy, ...
arXiv preprint arXiv:1711.05851, 2017
2432017
Cohort intelligence: a self supervised learning behavior
AJ Kulkarni, IP Durugkar, M Kumar
2013 IEEE international conference on systems, man, and cybernetics, 1396-1400, 2013
892013
Predictive off-policy policy evaluation for nonstationary decision problems, with applications to digital marketing
PS Thomas, G Theocharous, M Ghavamzadeh, I Durugkar, E Brunskill
Twenty-Ninth IAAI Conference, 2017
302017
Deep reinforcement learning with macro-actions
IP Durugkar, C Rosenbaum, S Dernbach, S Mahadevan
arXiv preprint arXiv:1606.04615, 2016
182016
TD learning with constrained gradients
I Durugkar, P Stone
82018
An imitation from observation approach to transfer learning with dynamics mismatch
S Desai, I Durugkar, H Karnan, G Warnell, J Hanna, P Stone
arXiv preprint arXiv:2008.01594, 2020
62020
Unmixing in the presence of nuisances with deep generative models
M Parente, I Gemp, I Durugkar
2017 IEEE International Geoscience and Remote Sensing Symposium (IGARSS …, 2017
52017
Balancing individual preferences and shared objectives in multiagent reinforcement learning
I Durugkar, E Liebman, P Stone
Good Systems-Published Research, 2020
42020
Reducing sampling error in batch temporal difference learning
B Pavse, I Durugkar, J Hanna, P Stone
International Conference on Machine Learning, 7543-7552, 2020
32020
Inverting variational autoencoders for improved generative accuracy
I Gemp, I Durugkar, M Parente, MD Dyar, S Mahadevan
arXiv preprint arXiv:1608.05983, 2016
22016
An imitation from observation approach to sim-to-real transfer
S Desai, I Durugkar, H Karnan, G Warnell, J Hanna, P Stone
arXiv e-prints, arXiv: 2008.01594, 2020
12020
Multi-Preference Actor Critic
I Durugkar, M Hausknecht, A Swaminathan, P MacAlpine
arXiv preprint arXiv:1904.03295, 2019
12019
Adversarial goal generation for intrinsic motivation
I Durugkar, P Stone
Proceedings of the AAAI Conference on Artificial Intelligence 32 (1), 2018
12018
Adversarial Intrinsic Motivation for Reinforcement Learning
I Durugkar, M Tec, S Niekum, P Stone
arXiv preprint arXiv:2105.13345, 2021
2021
HR-TD: A Regularized TD Method to Avoid Over-Generalization
I Durugkar, B Liu, P Stone
2018
REASONING OVER PATHS IN KNOWLEDGE BASES USING REINFORCEMENT LEARNING
R Das, S Dhuliawala, M Zaheer, L Vilnis, I Durugkar, A Krishnamurthy, ...
arXiv preprint arXiv:1711.05851, 2017
2017
Deep Generative Models for Spectroscopic Analysis on Mars
I Gemp, I Durugkar, M Parente, S Mahadevan
CoRR, 2016
2016
An Imitation from Observation Approach to Transfer Learning with Dynamics Mismatch
I Durugkar, S Desai, H Karnan, G Warnell, J Hanna, P Stone
UT Austin Villa 2019 Team Report
S Desai, I Durugkar, K Genter, H Goyal, J Hanna, E Liebman, J Manashe, ...
Systemet kan inte utföra åtgärden just nu. Försök igen senare.
Artiklar 1–20