Ishan Durugkar
Ishan Durugkar
Verifierad e-postadress på cs.utexas.edu - Startsida
Titel
Citeras av
Citeras av
År
Generative Multi-Adversarial Networks
I Durugkar, I Gemp, S Mahadevan
International Conference on Learning Representations, 2017, 2017
2452017
Go for a walk and arrive at the answer: Reasoning over paths in knowledge bases using reinforcement learning
R Das, S Dhuliawala, M Zaheer, L Vilnis, I Durugkar, A Krishnamurthy, ...
arXiv preprint arXiv:1711.05851, 2017
1952017
Cohort intelligence: a self supervised learning behavior
AJ Kulkarni, IP Durugkar, M Kumar
2013 IEEE international conference on systems, man, and cybernetics, 1396-1400, 2013
862013
Predictive Off-Policy Policy Evaluation for Nonstationary Decision Problems, with Applications to Digital Marketing.
PS Thomas, G Theocharous, M Ghavamzadeh, I Durugkar, E Brunskill
AAAI, 4740-4745, 2017
242017
Deep reinforcement learning with macro-actions
IP Durugkar, C Rosenbaum, S Dernbach, S Mahadevan
arXiv preprint arXiv:1606.04615, 2016
142016
TD learning with constrained gradients
I Durugkar, P Stone
62018
Balancing individual preferences and shared objectives in multiagent reinforcement learning
I Durugkar, E Liebman, P Stone
Good Systems-Published Research, 2020
32020
Reducing sampling error in batch temporal difference learning
B Pavse, I Durugkar, J Hanna, P Stone
International Conference on Machine Learning, 7543-7552, 2020
22020
Unmixing in the presence of nuisances with deep generative models
M Parente, I Gemp, I Durugkar
2017 IEEE International Geoscience and Remote Sensing Symposium (IGARSS …, 2017
22017
Inverting variational autoencoders for improved generative accuracy
I Gemp, I Durugkar, M Parente, MD Dyar, S Mahadevan
arXiv preprint arXiv:1608.05983, 2016
22016
An imitation from observation approach to sim-to-real transfer
S Desai, I Durugkar, H Karnan, G Warnell, J Hanna, P Stone
arXiv preprint arXiv:2008.01594, 2020
12020
An Imitation from Observation Approach to Transfer Learning with Dynamics Mismatch
S Desai, I Durugkar, H Karnan, G Warnell, J Hanna, P Stone, AI Sony
Advances in Neural Information Processing Systems 33, 2020
12020
Multi-Preference Actor Critic
I Durugkar, M Hausknecht, A Swaminathan, P MacAlpine
arXiv preprint arXiv:1904.03295, 2019
12019
Adversarial goal generation for intrinsic motivation
I Durugkar, P Stone
Proceedings of the AAAI Conference on Artificial Intelligence 32 (1), 2018
12018
HR-TD: A Regularized TD Method to Avoid Over-Generalization
I Durugkar, B Liu, P Stone
2018
REASONING OVER PATHS IN KNOWLEDGE BASES USING REINFORCEMENT LEARNING
R Das, S Dhuliawala, M Zaheer, L Vilnis, I Durugkar, A Krishnamurthy, ...
arXiv preprint arXiv:1711.05851, 2017
2017
Deep Generative Models for Spectroscopic Analysis on Mars
I Gemp, I Durugkar, M Parente, S Mahadevan
CoRR, 2016
2016
Balancing Individual Preferences and Shared Objectives in Multiagent Reinforcement Learning Download PDF
I Durugkar, E Liebman, P Stone
ON SAMPLING ERROR IN BATCH ACTION-VALUE PREDICTION ALGORITHMS
BS Pavse, JP Hanna, I Durugkar, P Stone
Systemet kan inte utföra åtgärden just nu. Försök igen senare.
Artiklar 1–19