Jean Harb

Cited by

	All	Since 2019
Citations	6289	5772
h-index	9	9
i10-index	9	9

1700

850

425

1275

2017201820192020202120222023202479 264 535 799 998 1296 1669 474

Public access

View all

2 articles

0 articles

available

not available

Based on funding mandates

Co-authors

Doina PrecupDeepMind and McGill UniversityVerified email at cs.mcgill.ca
Pierre-Luc BaconUniversity of MontrealVerified email at mila.quebec
Pieter AbbeelUC Berkeley | CovariantVerified email at cs.berkeley.edu
Yi WuInstitute for Interdisciplinary Information Sciences, Tsinghua UniversityVerified email at mail.tsinghua.edu.cn
Ryan LoweOpenAIVerified email at openai.com
Igor MordatchGoogle DeepMindVerified email at google.com
Aviv TamarTechnionVerified email at technion.ac.il

Jean Harb

OpenAI

Verified email at openai.com

Machine Learning Reinforcement Learning Deep Learning


Title Sort by citations Sort by year Sort by title	Cited by Cited by	Year
Multi-agent actor-critic for mixed cooperative-competitive environments R Lowe, YI Wu, A Tamar, J Harb, OAI Pieter Abbeel, I Mordatch Advances in neural information processing systems 30, 2017	4579	2017
The option-critic architecture PL Bacon, J Harb, D Precup Proceedings of the AAAI conference on artificial intelligence 31 (1), 2017	1189	2017
Investigating recurrence and eligibility traces in deep Q-networks J Harb, D Precup arXiv preprint arXiv:1704.05495, 2017	231	2017
When waiting is not an option: Learning options with a deliberation cost J Harb, PL Bacon, M Klissarov, D Precup Proceedings of the AAAI Conference on Artificial Intelligence 32 (1), 2018	153	2018
Learnings options end-to-end for continuous action tasks M Klissarov, PL Bacon, J Harb, D Precup arXiv preprint arXiv:1712.00004, 2017	57	2017
Policy evaluation networks J Harb, T Schaul, D Precup, PL Bacon arXiv preprint arXiv:2002.11833, 2020	39	2020
Waymax: An accelerated, data-driven simulator for large-scale autonomous driving research C Gulino, J Fu, W Luo, G Tucker, E Bronstein, Y Lu, J Harb, X Pan, ... Advances in Neural Information Processing Systems 36, 2024	17	2024
The barbados 2018 list of open issues in continual learning T Schaul, H van Hasselt, J Modayil, M White, A White, PL Bacon, J Harb, ... arXiv preprint arXiv:1811.07004, 2018	13	2018
General policy evaluation and improvement by learning to identify few but crucial states F Faccio, A Ramesh, V Herrmann, J Harb, J Schmidhuber arXiv preprint arXiv:2207.01566, 2022	10	2022
Learning options in deep reinforcement learning J Merheb-Harb McGill University (Canada), 2016	1	2016
Asynchronous Advantage Option-Critic with Deliberation Cost J Harb, PL Bacon, D Precup RLDM, 2017		2017

The system can't perform the operation now. Try again later.

Articles 1–11

Citations per year

Duplicate citations

Merged citations

Add co-authorsCo-authors

Follow

Cited by

Co-authors