Pablo Hernandez-Leal
Pablo Hernandez-Leal
Researcher - Borealis AI
Verified email at borealisai.com - Homepage
Title
Cited by
Cited by
Year
A survey and critique of multiagent deep reinforcement learning
P Hernandez-Leal, B Kartal, ME Taylor
Autonomous Agents and Multi-Agent Systems 33 (6), 750-797, 2019
194*2019
A survey of learning in multiagent environments: Dealing with non-stationarity
P Hernandez-Leal, M Kaisers, T Baarslag, EM de Cote
arXiv preprint arXiv:1707.09183, 2017
1212017
Multi-label classification with Bayesian network-based chain classifiers
LE Sucar, C Bielza, EF Morales, P Hernandez-Leal, JH Zaragoza, ...
Pattern Recognition Letters 41, 14-22, 2014
1152014
Local energy markets: Paving the path toward fully transactive energy systems
F Lezama, J Soares, P Hernandez-Leal, M Kaisers, T Pinto, Z Vale
IEEE Transactions on Power Systems 34 (5), 4081-4088, 2018
982018
Stress modelling and prediction in presence of scarce data
A Maxhuni, P Hernandez-Leal, LE Sucar, V Osmani, EF Morales, ...
Journal of biomedical informatics 63, 344-356, 2016
402016
Efficiently detecting switches against non-stationary opponents
P Hernandez-Leal, Y Zhan, ME Taylor, LE Sucar, EM de Cote
Autonomous Agents and Multi-Agent Systems 31 (4), 767-789, 2017
252017
Towards a fast detection of opponents in repeated stochastic games
P Hernandez-Leal, M Kaisers
International Conference on Autonomous Agents and Multiagent Systems, 239-257, 2017
232017
Identifying and tracking switching, non-stationary opponents: A Bayesian approach
P Hernandez-Leal, ME Taylor, BS Rosman, LE Sucar, E Munoz de Cote
Association for the Advancement of Artificial Intelligence (AAAI), 2016
232016
Learning temporal nodes Bayesian networks
P Hernandez-Leal, JA Gonzalez, EF Morales, LE Sucar
International journal of approximate reasoning 54 (8), 956-977, 2013
232013
InstanceRank based on borders for instance selection
P Hernandez-Leal, JA Carrasco-Ochoa, JF Martínez-Trinidad, ...
Pattern recognition 46 (1), 365-375, 2013
232013
Agent modeling as auxiliary task for deep reinforcement learning
P Hernandez-Leal, B Kartal, ME Taylor
Proceedings of the AAAI Conference on Artificial Intelligence and …, 2019
212019
A framework for learning and planning against switching strategies in repeated games
P Hernandez-Leal, E Munoz de Cote, LE Sucar
Connection Science 26 (2), 103-122, 2014
192014
Learning against sequential opponents in repeated stochastic games
P Hernandez-Leal, M Kaisers
The 3rd Multi-disciplinary Conference on Reinforcement Learning and Decision …, 2017
162017
Skynet: A top deep RL agent in the inaugural pommerman team competition
C Gao, P Hernandez-Leal, B Kartal, ME Taylor
arXiv preprint arXiv:1905.01360, 2019
152019
Uncertainty-aware action advising for deep reinforcement learning agents
FL Da Silva, P Hernandez-Leal, B Kartal, ME Taylor
Proceedings of the AAAI Conference on Artificial Intelligence 34 (04), 5792-5799, 2020
142020
An exploration strategy for non-stationary opponents
P Hernandez-Leal, Y Zhan, ME Taylor, LE Sucar, EM de Cote
Autonomous Agents and Multi-Agent Systems 31 (5), 971-1002, 2017
142017
Discovering human immunodeficiency virus mutational pathways using temporal Bayesian networks
P Hernandez-Leal, A Rios-Flores, S Ávila-Rios, G Reyes-Terán, ...
Artificial intelligence in medicine 57 (3), 185-195, 2013
122013
Terminal prediction as an auxiliary task for deep reinforcement learning
B Kartal, P Hernandez-Leal, ME Taylor
Proceedings of the AAAI Conference on Artificial Intelligence and …, 2019
112019
Action Guidance with MCTS for Deep Reinforcement Learning
B Kartal, P Hernandez-Leal, ME Taylor
Proceedings of the AAAI Conference on Artificial Intelligence and …, 2019
102019
Using Monte Carlo tree search as a demonstrator within asynchronous deep RL
B Kartal, P Hernandez-Leal, ME Taylor
arXiv preprint arXiv:1812.00045, 2018
102018
The system can't perform the operation now. Try again later.
Articles 1–20