Följ
Yacine Jernite
Yacine Jernite
Research Scientist, HuggingFace
Verifierad e-postadress på cs.nyu.edu - Startsida
Titel
Citeras av
Citeras av
År
Transformers: State-of-the-art natural language processing
T Wolf, L Debut, V Sanh, J Chaumond, C Delangue, A Moi, P Cistac, ...
Proceedings of the 2020 conference on empirical methods in natural language …, 2020
41202020
Huggingface's transformers: State-of-the-art natural language processing
T Wolf, L Debut, V Sanh, J Chaumond, C Delangue, A Moi, P Cistac, ...
arXiv preprint arXiv:1910.03771, 2019
27412019
Character-aware neural language models
Y Kim, Y Jernite, D Sontag, A Rush
Proceedings of the AAAI conference on artificial intelligence 30 (1), 2016
21012016
Bloom: A 176b-parameter open-access multilingual language model
T Le Scao, A Fan, C Akiki, E Pavlick, S Ilić, D Hesslow, R Castagné, ...
11242023
KILT: a benchmark for knowledge intensive language tasks
F Petroni, A Piktus, A Fan, P Lewis, M Yazdani, N De Cao, J Thorne, ...
arXiv preprint arXiv:2009.02252, 2020
3892020
ELI5: Long form question answering
A Fan, Y Jernite, E Perez, D Grangier, J Weston, M Auli
arXiv preprint arXiv:1907.09190, 2019
3892019
Creating an automated trigger for sepsis clinical decision support at emergency department triage using machine learning
S Horng, DA Sontag, Y Halpern, Y Jernite, NI Shapiro, LA Nathanson
PloS one 12 (4), e0174708, 2017
2822017
Starcoder: may the source be with you!
R Li, LB Allal, Y Zi, N Muennighoff, D Kocetkov, C Mou, M Marone, C Akiki, ...
arXiv preprint arXiv:2305.06161, 2023
2762023
Lysandre Debut, Stas Bekman, Pierric Cistac, Thibault Goehringer, Victor Mustar, François Lagunas, Alexander Rush, and Thomas Wolf. 2021. Datasets: A community library for …
Q Lhoest, AV Del Moral, Y Jernite, A Thakur, P Von Platen, S Patil, ...
Proceedings of the 2021 Conference on Empirical Methods in Natural Language …, 2021
1972021
Datasets: A community library for natural language processing
Q Lhoest, AV del Moral, Y Jernite, A Thakur, P von Platen, S Patil, ...
arXiv preprint arXiv:2109.02846, 2021
1802021
The gem benchmark: Natural language generation, its evaluation and metrics
S Gehrmann, T Adewumi, K Aggarwal, PS Ammanamanchi, ...
arXiv preprint arXiv:2102.01672, 2021
1312021
The stack: 3 tb of permissively licensed source code
D Kocetkov, R Li, LB Allal, J Li, C Mou, CM Ferrandis, Y Jernite, M Mitchell, ...
arXiv preprint arXiv:2211.15533, 2022
1202022
SantaCoder: don't reach for the stars!
LB Allal, R Li, D Kocetkov, C Mou, C Akiki, CM Ferrandis, N Muennighoff, ...
arXiv preprint arXiv:2301.03988, 2023
1122023
Discourse-based objectives for fast unsupervised sentence representation learning
Y Jernite, SR Bowman, D Sontag
arXiv preprint arXiv:1705.00557, 2017
1052017
The bigscience roots corpus: A 1.6 tb composite multilingual dataset
H Laurençon, L Saulnier, T Wang, C Akiki, A Villanova del Moral, ...
Advances in Neural Information Processing Systems 35, 31809-31826, 2022
1012022
Quality at a glance: An audit of web-crawled multilingual datasets
J Kreutzer, I Caswell, L Wang, A Wahab, D van Esch, N Ulzii-Orshikh, ...
Transactions of the Association for Computational Linguistics 10, 50-72, 2022
862022
Stable bias: Analyzing societal representations in diffusion models
AS Luccioni, C Akiki, M Mitchell, Y Jernite
arXiv preprint arXiv:2303.11408, 2023
762023
Nisansa de Silva
J Kreutzer, I Caswell, L Wang, A Wahab, D van Esch, N Ulzii-Orshikh, ...
Sakine Çabuk Ballı, Stella Biderman, Alessia Battisti, Ahmed Baruwa, Ankur …, 2022
762022
Variable computation in recurrent neural networks
Y Jernite, E Grave, A Joulin, T Mikolov
arXiv preprint arXiv:1611.06188, 2016
722016
Huggingface’s transformers: State-of-the-art natural language processing. arXiv
T Wolf, L Debut, V Sanh, J Chaumond, C Delangue, A Moi, P Cistac, ...
arXiv preprint arXiv:1910.03771, 2019
682019
Systemet kan inte utföra åtgärden just nu. Försök igen senare.
Artiklar 1–20