Follow
Guillaume Wenzek
Guillaume Wenzek
Verified email at fb.com
Title
Cited by
Cited by
Year
Unsupervised cross-lingual representation learning at scale
A Conneau, K Khandelwal, N Goyal, V Chaudhary, G Wenzek, F Guzmán, ...
arXiv preprint arXiv:1911.02116, 2019
51492019
Beyond english-centric multilingual machine translation
A Fan, S Bhosale, H Schwenk, Z Ma, A El-Kishky, S Goyal, M Baines, ...
Journal of Machine Learning Research 22 (107), 1-48, 2021
6242021
CCNet: Extracting high quality monolingual datasets from web crawl data
G Wenzek, MA Lachaux, A Conneau, V Chaudhary, F Guzmán, A Joulin, ...
arXiv preprint arXiv:1911.00359, 2019
4812019
No language left behind: Scaling human-centered machine translation
MR Costa-jussŕ, J Cross, O Çelebi, M Elbayad, K Heafield, K Heffernan, ...
arXiv preprint arXiv:2207.04672, 2022
3962022
The flores-101 evaluation benchmark for low-resource and multilingual machine translation
N Goyal, C Gao, V Chaudhary, PJ Chen, G Wenzek, D Ju, S Krishnan, ...
Transactions of the Association for Computational Linguistics 10, 522-538, 2022
3022022
CCMatrix: Mining billions of high-quality parallel sentences on the web
H Schwenk, G Wenzek, S Edunov, E Grave, A Joulin
arXiv preprint arXiv:1911.04944, 2019
1882019
Trans-gram, Fast Cross-lingual Word-embeddings
J Coulmance, JM Marty, G Wenzek, A Benhalloum
Proceedings of the 2015 Conference on Empirical Methods in Natural Language …, 2015
1122015
Unsupervised cross-lingual representation learning at scale. CoRR abs/1911.02116 (2019)
A Conneau, K Khandelwal, N Goyal, V Chaudhary, G Wenzek, F Guzmán, ...
URL: http://arxiv. org/abs/1911.02116, 1911
691911
Generating fact checking briefs
A Fan, A Piktus, F Petroni, G Wenzek, M Saeidi, A Vlachos, A Bordes, ...
arXiv preprint arXiv:2011.05448, 2020
512020
Unsupervised cross-lingual representation learning at scale. arXiv 2019
A Conneau, K Khandelwal, N Goyal, V Chaudhary, G Wenzek, F Guzmán, ...
arXiv preprint arXiv:1911.02116, 1911
511911
SeamlessM4T-Massively Multilingual & Multimodal Machine Translation
L Barrault, YA Chung, MC Meglioli, D Dale, N Dong, PA Duquenne, ...
arXiv preprint arXiv:2308.11596, 2023
342023
Facebook AI's WAT19 Myanmar-English translation task submission
PJ Chen, J Shen, M Le, V Chaudhary, A El-Kishky, G Wenzek, M Ott, ...
arXiv preprint arXiv:1910.06848, 2019
262019
Findings of the WMT 2021 shared task on large-scale multilingual machine translation
G Wenzek, V Chaudhary, A Fan, S Gomez, N Goyal, S Jain, D Kiela, ...
Proceedings of the Sixth Conference on Machine Translation, 89-99, 2021
182021
Findings of the WMT’22 shared task on large-scale machine translation evaluation for African languages
D Adelani, MMI Alam, A Anastasopoulos, A Bhagia, MR Costa-jussŕ, ...
Proceedings of the Seventh Conference on Machine Translation (WMT), 773-800, 2022
132022
Seamless: Multilingual Expressive and Streaming Speech Translation
L Barrault, YA Chung, MC Meglioli, D Dale, N Dong, M Duppenthaler, ...
arXiv preprint arXiv:2312.05187, 2023
122023
How Robust is Neural Machine Translation to Language Imbalance in Multilingual Tokenizer Training?
S Zhang, V Chaudhary, N Goyal, J Cross, G Wenzek, M Bansal, ...
arXiv preprint arXiv:2204.14268, 2022
112022
Analyse d’opinions de tweets par réseaux de neurones convolutionnels
JM Marty, G Wenzek, E Schmitt, J Coulmance
Actes de DEFT, Caen, France: TALN, 2015
52015
stopes-Modular Machine Translation Pipelines
P Andrews, G Wenzek, K Heffernan, O Çelebi, A Sun, A Kamran, Y Guo, ...
Proceedings of the 2022 Conference on Empirical Methods in Natural Language …, 2022
12022
Method for automatically constructing inter-language queries for a search engine
G Wenzek, J Coulmance, JM Marty
US Patent 11,055,370, 2021
2021
The system can't perform the operation now. Try again later.
Articles 1–19