A survey on evaluation of large language models
Y Chang, X Wang, J Wang, Y Wu, L Yang, K Zhu, H Chen, X Yi, C Wang, ...
ACM TIST, 2024
On the robustness of chatgpt: An adversarial and out-of-distribution perspective
J Wang, X Hu, W Hou, H Chen, R Zheng, Y Wang, L Yang, H Huang, ...
ICLR 2023 Workshop, 2023
Promptbench: Towards evaluating the robustness of large language models on adversarial prompts
K Zhu, J Wang, J Zhou, Z Wang, H Chen, Y Wang, L Yang, W Ye, ...
arXiv preprint arXiv:2306.04528, 2023
HTML: Hierarchical Transformer-based Multi-task Learning for Volatility Prediction
L Yang, TLJ Ng, B Smyth, R Dong
WWW 2020, 2020
PandaLM: An Automatic Evaluation Benchmark for LLM Instruction Tuning Optimization
Y Wang, Z Yu, Z Zeng, L Yang, C Wang, H Chen, C Jiang, R Xie, J Wang, ...
Internation Conference on Learning Representation (ICLR 2024), 2024
USB: A Unified Semi-supervised Learning Benchmark for Classification
Y Wang, H Chen, Y Fan, W Sun, R Tao, W Hou, R Wang, L Yang, Z Zhou, ...
NeurIPS 2022 Dataset and Benchmark, 2022
Survey on factuality in large language models: Knowledge, retrieval and domain-specificity
C Wang, X Liu, Y Yue, X Tang, T Zhang, C Jiayang, Y Yao, W Gao, X Hu, ...
arXiv preprint arXiv:2310.07521, 2023
Generating Plausible Counterfactual Explanations for Deep Transformers in Financial Text Classification
L Yang, EM Kenny, TLJ Ng, Y Yang, B Smyth, R Dong
COLING 2020, 2020
Exploring the Efficacy of Automatically Generated Counterfactuals for Sentiment Analysis
L Yang, J Li, P Cunningham, Y Zhang, B Smyth, R Dong
ACL 2021, 2021
Explainable Text-Driven Neural Network for Stock Prediction
L Yang, Z Zhang, S Xiong, L Wei, J Ng, L Xu, R Dong
CCIS 2018 (Best Paper Nomination), 2018
GLUE-X: Evaluating Natural Language Understanding Models from an Out-of-distribution Generalization Perspective
L Yang, S Zhang, L Qin, Y Li, Y Wang, H Liu, J Wang, X Xie, Y Zhang
ACL 2023 Findings, 2023
Multi-level attention-based neural networks for distant supervised relation extraction
L Yang, TLJ Ng, C Mooney, R Dong
AICS 2017, 2017
Fast-detectgpt: Efficient zero-shot detection of machine-generated text via conditional probability curvature
G Bao, Y Zhao, Z Teng, L Yang, Y Zhang
Internation Conference on Learning Representation (ICLR 2024), 2024
MAEC: A Multimodal Aligned Earnings Conference Call Dataset for Financial Risk Prediction
J Li*, L Yang*, B Smyth, R Dong
CIKM 2020, 2020
Deepfake text detection in the wild
Y Li, Q Li, L Cui, W Bi, L Wang, L Yang, S Shi, Y Zhang
ACL 2024, 2024
A Rationale-Centric Framework for Human-in-the-loop Machine Learning
J Lu*, L Yang*, BM Namee, Y Zhang
ACL 2022, 2022
Leveraging BERT to Improve the FEARS Index for Stock Forecasting
L Yang, Y Xu, J Ng, R Dong
IJCAI 2019, 2019
NumHTML: Numeric-Oriented Hierarchical Transformer Model for Multi-task Financial Forecasting
L Yang, J Li, R Dong, Y Zhang, B Smyth
AAAI 2022, 2022
FactMix: Using a Few Labeled In-domain Examples to Generalize to Cross-domain Named Entity Recognition
L Yang, L Yuan, L Cui, W Gao, Y Zhang
COLING 2022, 2022
Towards Fine-grained Causal Reasoning and QA
L Yang, Z Wang, Y Wu, J Yang, Y Zhang
arXiv preprint arXiv:2204.07408, 2022
