Language is not all you need: Aligning perception with language models S Huang, L Dong, W Wang, Y Hao, S Singhal, S Ma, T Lv, L Cui, ... arXiv preprint arXiv:2302.14045, 2023 | 154 | 2023 |
Bilingual lexicon induction with semi-supervision in non-isometric embedding spaces B Patra, JRA Moniz, S Garg, MR Gormley, G Neubig arXiv preprint arXiv:1908.06625, 2019 | 113 | 2019 |
A survey of community question answering B Patra arXiv preprint arXiv:1705.04009, 2017 | 27 | 2017 |
A length-extrapolatable transformer Y Sun, L Dong, B Patra, S Ma, S Huang, A Benhaim, V Chaudhary, ... arXiv preprint arXiv:2212.10554, 2022 | 26 | 2022 |
On the representation collapse of sparse mixture of experts Z Chi, L Dong, S Huang, D Dai, S Ma, B Patra, S Singhal, P Bajaj, X Song, ... Advances in Neural Information Processing Systems 35, 34600-34613, 2022 | 21 | 2022 |
Foundation transformers H Wang, S Ma, S Huang, L Dong, W Wang, Z Peng, Y Wu, P Bajaj, ... arXiv preprint arXiv:2210.06423, 2022 | 16 | 2022 |
Constrained BERT BiLSTM CRF for understanding multi-sentence entity-seeking questions D Contractor, B Patra, P Singla Natural Language Engineering 27 (1), 65-87, 2021 | 14 | 2021 |
Invariant language modeling M Peyrard, SS Ghotra, M Josifoski, V Agarwal, B Patra, D Carignan, ... arXiv preprint arXiv:2110.08413, 2021 | 9 | 2021 |
Beyond english-centric bitexts for better multilingual language representation learning B Patra, S Singhal, S Huang, Z Chi, L Dong, F Wei, V Chaudhary, X Song arXiv preprint arXiv:2210.14867, 2022 | 7 | 2022 |
TorchScale: Transformers at scale S Ma, H Wang, S Huang, W Wang, Z Chi, L Dong, A Benhaim, B Patra, ... arXiv preprint arXiv:2211.13184, 2022 | 5 | 2022 |
On efficiently acquiring annotations for multilingual models JRA Moniz, B Patra, MR Gormley arXiv preprint arXiv:2204.01016, 2022 | 5 | 2022 |
ScopeIt: Scoping task relevant sentences in documents B Patra, V Suryanarayanan, C Fufa, P Bhattacharya, CC Lee Proceedings of the 28th International Conference on Computational …, 2020 | 5* | 2020 |
Compression and localization in reinforcement learning for atari games JRA Moniz, B Patra, S Garg arXiv preprint arXiv:1904.09489, 2019 | 4 | 2019 |
Artificial intelligence for identifying relevant content related to specific tasks P Bhattacharya, B Patra, CY Lee, V Suryanarayanan, CF Fufa US Patent 11,354,500, 2022 | 3 | 2022 |
Weakly supervised attention networks for entity recognition B Patra, JRA Moniz Proceedings of the 2019 Conference on Empirical Methods in Natural Language …, 2019 | 3 | 2019 |
The SUMEval 2022 Shared Task on Performance Prediction of Multilingual Pre-trained Language Models K Ahuja, A Anastasopoulos, B Patra, G Neubig, M Choudhury, ... Proceedings of the First Workshop on Scaling Up Multilingual Evaluation, 1-7, 2022 | 2 | 2022 |
To schedule or not to schedule: extracting task specific temporal entities and associated negation constraints B Patra, C Fufa, P Bhattacharya, C Lee arXiv preprint arXiv:2012.02594, 2020 | 2 | 2020 |
Everything you need to know about multilingual LLMs: Towards fair, performant and reliable models for languages of the world S Sitaram, M Choudhury, B Patra, V Chaudhary, K Ahuja, K Bali Proceedings of the 61st Annual Meeting of the Association for Computational …, 2023 | 1 | 2023 |
Language Model Decoding as Likelihood-Utility Alignment M Josifoski, M Peyrard, F Rajic, J Wei, D Paul, V Hartmann, B Patra, ... arXiv preprint arXiv:2210.07228, 2022 | 1 | 2022 |
Understanding complex multi-sentence entity seeking questions D Contractor, B Patra, PS Mausam, P Singla AAAI, 2019 | 1 | 2019 |