Hengyuan Hu

Cited by

	All	Since 2019
Citations	1929	1835
h-index	14	14
i10-index	15	15

560

280

140

420

2017201820192020202120222023202415 72 144 243 317 410 546 169

Public access

View all

1 article

0 articles

available

not available

Based on funding mandates

Co-authors

Jakob FoersterAssociate Professor, University of OxfordVerified email at eng.ox.ac.uk
Adam LererFacebook AI ResearchVerified email at fb.com
Noam BrownResearch Scientist, OpenAIVerified email at cs.cmu.edu
Dorsa SadighStanford UniversityVerified email at cs.stanford.edu
Mike LewisFacebook AI ResearchVerified email at fb.com

Hengyuan Hu

Stanford University

Verified email at stanford.edu

reinforcement learning multi-agent


Title Sort by citations Sort by year Sort by title	Cited by Cited by	Year
Network trimming: A data-driven neuron pruning approach towards efficient deep architectures H Hu, R Peng, YW Tai, CK Tang arXiv preprint arXiv:1607.03250, 2016	1044	2016
“Other-Play” for Zero-Shot Coordination H Hu, A Lerer, A Peysakhovich, J Foerster International Conference on Machine Learning, 4399-4410, 2020	155	2020
Human-level play in the game of Diplomacy by combining language models with strategic reasoning Meta Fundamental AI Research Diplomacy Team (FAIR)†, A Bakhtin, ... Science 378 (6624), 1067-1074, 2022	152	2022
Simplified action decoder for deep multi-agent reinforcement learning H Hu, JN Foerster ICLR 2019, 2019	89	2019
Trajectory diversity for zero-shot coordination A Lupu, B Cui, H Hu, J Foerster International Conference on Machine Learning, 7204-7213, 2021	83	2021
Improving policies via search in cooperative partially observable games A Lerer, H Hu, J Foerster, N Brown Proceedings of the AAAI Conference on Artificial Intelligence 34 (05), 7187-7194, 2020	70	2020
Hierarchical decision making by generating and following natural language instructions H Hu, D Yarats, Q Gong, Y Tian, M Lewis Advances in neural information processing systems 32, 2019	61	2019
Off-belief learning H Hu, A Lerer, B Cui, L Pineda, N Brown, J Foerster International Conference on Machine Learning, 4369-4379, 2021	59	2021
Polygames: Improved zero learning T Cazenave, YC Chen, GW Chen, SY Chen, XD Chiu, J Dehos, M Elsa, ... ICGA Journal 42 (4), 244-256, 2020	43	2020
Modeling strong and human-like gameplay with KL-regularized search AP Jacob, DJ Wu, G Farina, A Lerer, H Hu, A Bakhtin, J Andreas, N Brown International Conference on Machine Learning, 9695-9728, 2022	39	2022
Language instructed reinforcement learning for human-ai coordination H Hu, D Sadigh International Conference on Machine Learning, 13584-13598, 2023	33	2023
K-level Reasoning for Zero-Shot Coordination in Hanabi B Cui, H Hu, L Pineda, J Foerster Advances in Neural Information Processing Systems 34, 8215-8228, 2021	26	2021
Ridge rider: Finding diverse solutions by following eigenvectors of the hessian J Parker-Holder, L Metz, C Resnick, H Hu, A Lerer, A Letcher, ... Advances in Neural Information Processing Systems 33, 753-765, 2020	25	2020
Scalable online planning via reinforcement learning fine-tuning A Fickinger, H Hu, B Amos, S Russell, N Brown Advances in Neural Information Processing Systems 34, 16951-16963, 2021	14	2021
Adversarial Diversity in Hanabi B Cui, A Lupu, S Sokota, H Hu, DJ Wu, JN Foerster The Eleventh International Conference on Learning Representations, 2022	11	2022
Human-AI Coordination via Human-Regularized Search and Learning H Hu, DJ Wu, A Lerer, J Foerster, N Brown arXiv preprint arXiv:2210.05125, 2022	6	2022
A fine-tuning approach to belief state modeling S Sokota, H Hu, DJ Wu, JZ Kolter, JN Foerster, N Brown International Conference on Learning Representations, 2021	6	2021
Learned belief search: Efficiently improving policies in partially observable settings H Hu, A Lerer, N Brown, J Foerster arXiv preprint arXiv:2106.09086, 2021	5	2021
Toward Grounded Social Reasoning M Kwon, H Hu, V Myers, S Karamcheti, A Dragan, D Sadigh arXiv preprint arXiv:2306.08651, 2023	4	2023
Imitation Bootstrapped Reinforcement Learning H Hu, S Mirchandani, D Sadigh arXiv preprint arXiv:2311.02198, 2023	2	2023

The system can't perform the operation now. Try again later.

Articles 1–20

Citations per year

Duplicate citations

Merged citations

Add co-authorsCo-authors

Follow

Cited by

Co-authors