The rise and potential of large language model based agents: A survey Z Xi, W Chen, X Guo, W He, Y Ding, B Hong, M Zhang, J Wang, S Jin, ... arXiv preprint arXiv:2309.07864, 2023 | 263 | 2023 |
Textflint: Unified multilingual robustness evaluation toolkit for natural language processing X Wang, Q Liu, T Gui, Q Zhang, Y Zou, X Zhou, J Ye, Y Zhang, R Zheng, ... Proceedings of the 59th Annual Meeting of the Association for Computational …, 2021 | 112* | 2021 |
Secrets of rlhf in large language models part i: Ppo R Zheng, S Dou, S Gao, Y Hua, W Shen, B Wang, Y Liu, S Jin, Q Liu, ... arXiv preprint arXiv:2307.04964, 2023 | 51* | 2023 |
Flooding-X: Improving BERT’s resistance to adversarial attacks via loss-restricted fine-tuning Q Liu, R Zheng, B Rong, J Liu, Z Liu, Z Cheng, L Qiao, T Gui, Q Zhang, ... Proceedings of the 60th Annual Meeting of the Association for Computational …, 2022 | 27 | 2022 |
From shortcuts to triggers: Backdoor defense with denoised poe Q Liu, F Wang, C Xiao, M Chen arXiv preprint arXiv:2305.14910, 2023 | 4 | 2023 |
Overview of argumentative text understanding for ai debater challenge J Yuan, L Cheng, R He, Y Li, L Bing, Z Wei, Q Liu, C Shen, S Zhang, ... Natural Language Processing and Chinese Computing: 10th CCF International …, 2021 | 4 | 2021 |
Test-time backdoor mitigation for black-box large language models with defensive demonstrations W Mo, J Xu, Q Liu, J Wang, J Yan, C Xiao, M Chen arXiv preprint arXiv:2311.09763, 2023 | 3 | 2023 |
Characterizing the impacts of instances on robustness R Zheng, Z Xi, Q Liu, W Lai, T Gui, Q Zhang, XJ Huang, J Ma, Y Shan, ... Findings of the Association for Computational Linguistics: ACL 2023, 2314-2332, 2023 | 3 | 2023 |
Plugat: A plug and play module to defend against textual adversarial attack R Zheng, R Bao, Q Liu, T Gui, Q Zhang, XJ Huang, R Xie, W Wu Proceedings of the 29th International Conference on Computational …, 2022 | 2 | 2022 |
Two Heads are Better than One: Nested PoE for Robust Defense Against Multi-Backdoors V Graf, Q Liu, M Chen arXiv preprint arXiv:2404.02356, 2024 | 1 | 2024 |
Monotonic Paraphrasing Improves Generalization of Language Model Prompting Q Liu, F Wang, N Xu, T Yan, T Meng, M Chen arXiv preprint arXiv:2403.16038, 2024 | 1 | 2024 |
Detecting Adversarial Samples through Sharpness of Loss Landscape R Zheng, S Dou, Y Zhou, Q Liu, T Gui, Q Zhang, Z Wei, XJ Huang, ... Findings of the Association for Computational Linguistics: ACL 2023, 11282-11298, 2023 | 1 | 2023 |
Learning to generate representations for novel words: Mimic the OOV situation in training X Xing, M Peng, Q Zhang, Q Liu, X Huang Natural Language Processing and Chinese Computing: 9th CCF International …, 2020 | 1 | 2020 |