Följ
Long Ouyang
Long Ouyang
OpenAI
Verifierad e-postadress på openai.com - Startsida
Titel
Citeras av
Citeras av
År
Training language models to follow instructions with human feedback
L Ouyang, J Wu, X Jiang, D Almeida, C Wainwright, P Mishkin, C Zhang, ...
Advances in neural information processing systems 35, 27730-27744, 2022
61482022
Learning to summarize with human feedback
N Stiennon, L Ouyang, J Wu, D Ziegler, R Lowe, C Voss, A Radford, ...
Advances in Neural Information Processing Systems 33, 3008-3021, 2020
10222020
Gpt-4 technical report
J Achiam, S Adler, S Agarwal, L Ahmad, I Akkaya, FL Aleman, D Almeida, ...
arXiv preprint arXiv:2303.08774, 2023
8062023
Webgpt: Browser-assisted question-answering with human feedback
R Nakano, J Hilton, S Balaji, J Wu, L Ouyang, C Kim, C Hesse, S Jain, ...
arXiv preprint arXiv:2112.09332, 2021
6802021
Recursively summarizing books with human feedback
J Wu, L Ouyang, DM Ziegler, N Stiennon, R Lowe, J Leike, P Christiano
arXiv preprint arXiv:2109.10862, 2021
1862021
Improving image generation with better captions
J Betker, G Goh, L Jing, T Brooks, J Wang, L Li, L Ouyang, J Zhuang, ...
Computer Science. https://cdn. openai. com/papers/dall-e-3. pdf 2 (3), 8, 2023
1772023
Training language models to follow instructions with human feedback, 2022
L Ouyang, J Wu, X Jiang, D Almeida, CL Wainwright, P Mishkin, C Zhang, ...
URL https://arxiv. org/abs/2203.02155 13, 1, 2022
1522022
Self-critiquing models for assisting human evaluators
W Saunders, C Yeh, J Wu, S Bills, L Ouyang, J Ward, J Leike
arXiv preprint arXiv:2206.05802, 2022
1182022
Training language models to follow instructions with human feedback. arXiv
L Ouyang, J Wu, X Jiang, D Almeida, CL Wainwright, P Mishkin, C Zhang, ...
arXiv preprint arXiv:2203.02155, 2022
652022
Training language models to follow instructions with human feedback. arXiv 2022
L Ouyang, J Wu, X Jiang, D Almeida, CL Wainwright, P Mishkin, C Zhang, ...
arXiv preprint arXiv:2203.02155 10, 0
32
Practical optimal experiment design with probabilistic programs
L Ouyang, MH Tessler, D Ly, N Goodman
arXiv preprint arXiv:1608.05046, 2016
222016
Semantic coherence facilitates distributional learning
L Ouyang, L Boroditsky, MC Frank
Cognitive science 41, 855-884, 2017
192017
webppl-oed: A practical optimal experiment design system.
L Ouyang, MH Tessler, D Ly, ND Goodman
CogSci, 2018
82018
Learning to summarize from human feedback, 2020
N Stiennon, L Ouyang, J Wu, DM Ziegler, R Lowe, C Voss, A Radford, ...
URL https://arxiv. org/abs, 2009
82009
Fabular: Regression formulas as probabilistic programming
J Borgström, AD Gordon, L Ouyang, C Russo, A Ścibior, M Szymczak
Proceedings of the 43rd Annual ACM SIGPLAN-SIGACT Symposium on Principles of …, 2016
72016
Recursively summarizing books with human feedback, 2021
J Wu, L Ouyang, DM Ziegler, N Stiennon, R Lowe, J Leike, P Christiano
URL https://arxiv. org/abs/2109.10862, 0
7
Semantic coherence facilitates distributional learning of word meanings
L Ouyang, L Boroditsky, M Frank
Proceedings of the Annual Meeting of the Cognitive Science Society 34 (34), 2012
32012
Bayesian inference of regular expressions from human-generated example strings
L Ouyang
arXiv preprint arXiv:1805.08427, 2018
22018
Pedagogical learning
L Ouyang, MC Frank
arXiv preprint arXiv:1711.09401, 2017
12017
The Effect of Learning on Learning
L Ouyang
Stanford University, 2015
2015
Systemet kan inte utföra åtgärden just nu. Försök igen senare.
Artiklar 1–20