PandaLM: An Automatic Evaluation Benchmark for LLM Instruction Tuning Optimization Y Wang*, Z Yu*, Z Zeng, L Yang, C Wang, H Chen, C Jiang, R Xie, ... arXiv preprint arXiv:2306.05087, 2023 | 73 | 2023 |
Textbox: A unified, modularized, and extensible framework for text generation J Li, T Tang, G He, J Jiang, X Hu, P Xie, Z Chen, Z Yu, WX Zhao, JR Wen arXiv preprint arXiv:2101.02046, 2021 | 24 | 2021 |
Exploring vision-language models for imbalanced learning Y Wang, Z Yu, J Wang, Q Heng, H Chen, W Ye, R Xie, X Xie, S Zhang International Journal of Computer Vision 132 (1), 224-237, 2024 | 15 | 2024 |
Textbox 2.0: A text generation library with pre-trained language models T Tang, J Li, Z Chen, Y Hu, Z Yu, W Dai, Z Dong, X Cheng, Y Wang, ... arXiv preprint arXiv:2212.13005, 2022 | 6 | 2022 |
ElitePLM: An empirical study on general language ability evaluation of pretrained language models J Li, T Tang, Z Gong, L Yang, Z Yu, Z Chen, J Wang, WX Zhao, JR Wen arXiv preprint arXiv:2205.01523, 2022 | 5 | 2022 |
KIEval: A Knowledge-grounded Interactive Evaluation Framework for Large Language Models Z Yu, C Gao, W Yao, Y Wang, W Ye, J Wang, X Xie, Y Zhang, S Zhang arXiv preprint arXiv:2402.15043, 2024 | 3 | 2024 |
Supervised knowledge makes large language models better in-context learners L Yang, S Zhang, Z Yu, G Bao, Y Wang, J Wang, R Xu, W Ye, X Xie, ... arXiv preprint arXiv:2312.15918, 2023 | 3 | 2023 |
CodeShell Technical Report R Xie, Z Zeng, Z Yu, C Gao, S Zhang, W Ye arXiv preprint arXiv:2403.15747, 2024 | 1 | 2024 |
LLMTune: Accelerate Database Knob Tuning with Large Language Models X Huang, H Li, J Zhang, X Zhao, Z Yao, Y Li, Z Yu, T Zhang, H Chen, C Li arXiv preprint arXiv:2404.11581, 2024 | | 2024 |
FreeEval: A Modular Framework for Trustworthy and Efficient Evaluation of Large Language Models Z Yu, C Gao, W Yao, Y Wang, Z Zeng, W Ye, J Wang, Y Zhang, S Zhang arXiv preprint arXiv:2404.06003, 2024 | | 2024 |