Följ
Bowen Shi
Bowen Shi
Facebook AI Research
Verifierad e-postadress på meta.com
Titel
Citeras av
Citeras av
År
Learning audio-visual speech representation by masked multimodal cluster prediction
B Shi, WN Hsu, K Lakhotia, A Mohamed
arXiv preprint arXiv:2201.02184, 2022
1972022
Scaling speech technology to 1,000+ languages
V Pratap, A Tjandra, B Shi, P Tomasello, A Babu, S Kundu, A Elkahky, ...
Journal of Machine Learning Research 25 (97), 1-52, 2024
1042024
Robust self-supervised audio-visual speech recognition
B Shi, WN Hsu, A Mohamed
arXiv preprint arXiv:2201.01763, 2022
862022
Offloading guidelines for augmented reality applications on wearable devices
B Shi, J Yang, Z Huang, P Hui
Proceedings of the 23rd ACM international conference on Multimedia, 1271-1274, 2015
802015
American sign language fingerspelling recognition in the wild
B Shi, AM Del Rio, J Keane, J Michaux, D Brentari, G Shakhnarovich, ...
2018 IEEE Spoken Language Technology Workshop (SLT), 145-152, 2018
772018
Voicebox: Text-guided multilingual universal speech generation at scale
M Le, A Vyas, B Shi, B Karrer, L Sari, R Moritz, M Williamson, V Manohar, ...
Advances in neural information processing systems 36, 2024
692024
Few-shot acoustic event detection via meta learning
B Shi, M Sun, KC Puvvada, CC Kao, S Matsoukas, C Wang
ICASSP 2020-2020 IEEE International Conference on Acoustics, Speech and …, 2020
642020
Fingerspelling recognition in the wild with iterative visual attention
B Shi, AMD Rio, J Keane, D Brentari, G Shakhnarovich, K Livescu
Proceedings of the IEEE/CVF International Conference on Computer Vision …, 2019
642019
Scaling autoregressive multi-modal models: Pretraining and instruction tuning
L Yu, B Shi, R Pasunuru, B Muller, O Golovneva, T Wang, A Babu, B Tang, ...
arXiv preprint arXiv:2309.02591, 2023
532023
Comparative layer-wise analysis of self-supervised speech models
A Pasad, B Shi, K Livescu
ICASSP 2023-2023 IEEE International Conference on Acoustics, Speech and …, 2023
422023
A cross-task analysis of text span representations
S Toshniwal, H Shi, B Shi, L Gao, K Livescu, K Gimpel
arXiv preprint arXiv:2006.03866, 2020
372020
Fingerspelling detection in american sign language
B Shi, D Brentari, G Shakhnarovich, K Livescu
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern …, 2021
272021
u-hubert: Unified mixed-modal speech pretraining and zero-shot transfer to unlabeled modality
WN Hsu, B Shi
Advances in Neural Information Processing Systems 35, 21157-21170, 2022
212022
Open-domain sign language translation learned from online video
B Shi, D Brentari, G Shakhnarovich, K Livescu
arXiv preprint arXiv:2205.12870, 2022
202022
Semi-supervised acoustic event detection based on tri-training
B Shi, M Sun, CC Kao, V Rozgic, S Matsoukas, C Wang
ICASSP 2019-2019 IEEE International Conference on Acoustics, Speech and …, 2019
202019
Multitask training with unlabeled data for end-to-end sign language fingerspelling recognition
B Shi, K Livescu
2017 IEEE Automatic Speech Recognition and Understanding Workshop (ASRU …, 2017
202017
Compression of acoustic event detection models with low-rank matrix factorization and quantization training
B Shi, M Sun, CC Kao, V Rozgic, S Matsoukas, C Wang
arXiv preprint arXiv:1905.00855, 2019
162019
Muavic: A multilingual audio-visual corpus for robust speech recognition and robust speech-to-text translation
M Anwar, B Shi, V Goswami, WN Hsu, J Pino, C Wang
arXiv preprint arXiv:2303.00628, 2023
132023
Expresso: A benchmark and analysis of discrete expressive speech resynthesis
TA Nguyen, WN Hsu, A d'Avirro, B Shi, I Gat, M Fazel-Zarani, T Remez, ...
arXiv preprint arXiv:2308.05725, 2023
122023
Compression of acoustic event detection models with quantized distillation
B Shi, M Sun, CC Kao, V Rozgic, S Matsoukas, C Wang
arXiv preprint arXiv:1907.00873, 2019
122019
Systemet kan inte utföra åtgärden just nu. Försök igen senare.
Artiklar 1–20