Följ
Alessandro Stolfo
Alessandro Stolfo
Verifierad e-postadress på ethz.ch - Startsida
Titel
Citeras av
Citeras av
År
Distilling Reasoning Capabilities into Smaller Language Models
K Shridhar*, A Stolfo*, M Sachan
ACL 2023 (Findings), 2023
178*2023
A Mechanistic Interpretation of Arithmetic Reasoning in Language Models using Causal Mediation Analysis
A Stolfo, Y Belinkov, M Sachan
EMNLP 2023, 2023
712023
A Causal Framework to Quantify the Robustness of Mathematical Reasoning with Language Models
A Stolfo*, Z Jin*, K Shridhar, B Schölkopf, M Sachan
ACL 2023, 2022
522022
Towards a Mechanistic Interpretation of Multi-Step Reasoning Capabilities of Language Models
Y Hou, J Li, Y Fei, A Stolfo, W Zhou, G Zeng, A Bosselut, M Sachan
EMNLP 2023, 2023
192023
Do Language Models Exhibit the Same Cognitive Biases in Problem Solving as Human Learners?
A Opedal*, A Stolfo*, H Shirakami, Y Jiao, R Cotterell, B Schölkopf, ...
ICML 2024, 2024
152024
A Simple Unsupervised Approach for Coreference Resolution using Rule-based Weak Supervision
A Stolfo, C Tanner, V Gupta, M Sachan
Proceedings of the 11th Joint Conference on Lexical and Computational …, 2022
72022
Confidence Regulation Neurons in Language Models
A Stolfo*, B Wu*, W Gurnee, Y Belinkov, X Song, M Sachan, N Nanda
NeurIPS 2024, 2024
42024
Groundedness in Retrieval-augmented Long-form Generation: An Empirical Study
A Stolfo
NAACL 2024 (Findings), 2024
42024
Longtonotes: OntoNotes with Longer Coreference Chains
K Shridhar, N Monath, R Thirukovalluru, A Stolfo, M Zaheer, A McCallum, ...
EACL 2023 (Findings), 2022
32022
Improving instruction-following in language models through activation steering
A Stolfo, V Balachandran, S Yousefi, E Horvitz, B Nushi
arXiv preprint arXiv:2410.12877, 2024
22024
Systemet kan inte utföra åtgärden just nu. Försök igen senare.
Artiklar 1–10