Hannah Cyberey

I’m a Computer Science PhD candidate at the University of Virginia, advised by Prof. David Evans and Prof. Yangfeng Ji. I’m broadly interested in topics on AI safety and ethics. My primary research focuses on trustworthy natural language processing (NLP), addressing issues related to robustness and fairness of language models. Currently, my work explores representation engineering methods for mitigating bias and countering censorship.
Email: hannahcyberey at virginia dot edu
news
Apr 24, 2025 | Our paper “Do Prevalent Bias Metrics Capture Allocational Harms from LLMs?” is accepted to the Workshop on Insights from Negative Results in NLP |
---|---|
May 09, 2024 | I passed my PhD dissertation proposal defense! |
Apr 03, 2024 | Our paper “Addressing Both Statistical and Causal Gender Fairness in NLP Models” is accepted to NAACL 2024 Findings. |
Jun 07, 2023 | Co-leading Causal Learning Reading Group this summer at UVA, along with Anshuman Suri |
Oct 06, 2022 | Our paper “Balanced Adversarial Training: Balancing Tradeoffs between Fickleness and Obstinacy in NLP Models” is accepted to EMNLP 2022. |