Följ
Carol Chen
Carol Chen
Member of Technical Staff
Verifierad e-postadress på anthropic.com - Startsida
Titel
Citeras av
Citeras av
År
Constitutional ai: Harmlessness from ai feedback
Y Bai, S Kadavath, S Kundu, A Askell, J Kernion, A Jones, A Chen, ...
arXiv preprint arXiv:2212.08073, 2022
5992022
Toy models of superposition
N Elhage, T Hume, C Olsson, N Schiefer, T Henighan, S Kravec, ...
arXiv preprint arXiv:2209.10652, 2022
1452022
Towards measuring the representation of subjective global opinions in language models
E Durmus, K Nyugen, TI Liao, N Schiefer, A Askell, A Bakhtin, C Chen, ...
arXiv preprint arXiv:2306.16388, 2023
642023
Mitigating harm in language models with conditional-likelihood filtration
H Ngo, C Raterink, JGM Araújo, I Zhang, C Chen, A Morisot, N Frosst
arXiv preprint arXiv:2108.07790, 2021
292021
Question decomposition improves the faithfulness of model-generated reasoning
A Radhakrishnan, K Nguyen, A Chen, C Chen, C Denison, D Hernandez, ...
arXiv preprint arXiv:2307.11768, 2023
262023
Constitutional AI: harmlessness from AI feedback. 2022
Y Bai, S Kadavath, S Kundu, A Askell, J Kernion, A Jones, A Chen, ...
ArXiv preprint: https://arxiv. org/pdf/2212.08073. pdf, 0
12
Tamera Lanham, Tim Maxwell, Venkatesa Chandrasekaran, Zac Hatfield-Dodds, Jared Kaplan, Jan Brauner, Samuel R
A Radhakrishnan, K Nguyen, A Chen, C Chen, C Denison, D Hernandez, ...
Bowman, and Ethan Perez, 2023
112023
Constitutional AI: Harmlessness from AI feedback (arXiv: 2212.08073). arXiv
Y Bai, S Kadavath, S Kundu, A Askell, J Kernion, A Jones, A Chen, ...
82022
Constitutional AI: Harmlessness from AI Feedback, December 2022
Y Bai, S Kadavath, S Kundu, A Askell, J Kernion, A Jones, A Chen, ...
URL https://www. anthropic. com/constitutional. pdf, 0
7
Predicting twitter engagement with deep language models
M Volkovs, Z Cheng, M Ravaut, H Yang, K Shen, JP Zhou, A Wong, ...
Proceedings of the Recommender Systems Challenge 2020, 38-43, 2020
62020
Systemet kan inte utföra åtgärden just nu. Försök igen senare.
Artiklar 1–10